#general | Arena | Page 222

cloud zinc Dec 15, 2025, 5:12 PM

#

yes

compact flame Dec 15, 2025, 5:12 PM

#

Got the auto again

echo aurora Dec 15, 2025, 5:20 PM

#

Yeah this is an experiment, meaning a small percentage of people are going to be seeing it currently. cc @cloud zinc

#

Also same with auto modality. blobthumbsup

proud bobcat Dec 15, 2025, 5:20 PM

#

5.2 still get mogged in every category by gemini

#

LONG LIVE GEMINI

compact flame Dec 15, 2025, 5:22 PM

#

echo aurora Yeah this is an experiment, meaning a small percentage of people are going to be...

Oh okay

#

Now I got some type of new ui

#

Instead of icons it's like a dropdown

cloud zinc Dec 15, 2025, 5:31 PM

#

proud bobcat 5.2 still get mogged in every category by gemini

neon idol Dec 15, 2025, 5:32 PM

#

what is it?

proud bobcat Dec 15, 2025, 5:33 PM

#

cloud zinc

benchmark unequal to general intelligence

#

people wont use gpt 5.2 because its too pricy

#

if gemini 3 pro is basically still the king why pay more for gpt

cloud zinc Dec 15, 2025, 5:42 PM

#

proud bobcat if gemini 3 pro is basically still the king why pay more for gpt

nobody is king

plucky sparrow Dec 15, 2025, 5:43 PM

#

Gpt useless 🤣

mild granite Dec 15, 2025, 5:49 PM

#

cloud zinc

#

#

cherry picking

hollow echo Dec 15, 2025, 6:10 PM

#

👋

echo aurora Dec 15, 2025, 6:12 PM

#

hollow echo 👋

ablobwave

haughty jetty Dec 15, 2025, 6:24 PM

#

👋

cinder gull Dec 15, 2025, 6:27 PM

#

hello everyone what's up? I want to know how to face swap a pic with her face and her into a video on Instagram from another lady, thanks

zealous sparrow Dec 15, 2025, 6:29 PM

#

Speaking of removed models, will speciale ever make a comeback? It's been a long while since it got deleted.

mild granite Dec 15, 2025, 6:30 PM

#

zealous sparrow Speaking of removed models, will speciale ever make a comeback? It's been a long...

deepseek 3.2 speciale?

zealous sparrow Dec 15, 2025, 6:30 PM

#

mild granite deepseek 3.2 speciale?

yea

mild granite Dec 15, 2025, 6:30 PM

#

zealous sparrow yea

has it ever been on lmarena?

zealous sparrow Dec 15, 2025, 6:31 PM

#

mild granite has it ever been on lmarena?

yes but it was deleted the same day it came to lmarena

mild granite Dec 15, 2025, 6:31 PM

#

oh

tardy plover Dec 15, 2025, 6:32 PM

#

grok removed?

whole sundial Dec 15, 2025, 6:32 PM

#

I also heard some complaints about reve-v1 and reve-fast-edit being removed, it was replaced with the stealth models epsilon and epsilon-fast, some people would like to be able to select these models again

zealous sparrow Dec 15, 2025, 6:34 PM

#

whole sundial I also heard some complaints about `reve-v1` and `reve-fast-edit` being removed,...

are they still reve v1 tho

#

they might be updated

whole sundial Dec 15, 2025, 6:34 PM

#

zealous sparrow are they still reve v1 tho

it's an upgraded model, but I know its removal and replacement with an unselectable model upset some people

zealous sparrow Dec 15, 2025, 6:35 PM

#

whole sundial it's an upgraded model, but I know its removal and replacement with an unselecta...

It was prob the company decision, not LMArenas

mystic sluice Dec 15, 2025, 6:39 PM

#

is there a limits for gemeni-3-pro?

plucky sparrow Dec 15, 2025, 6:39 PM

#

Haha this is beautiful

#

echo aurora Dec 15, 2025, 6:47 PM

#

mystic sluice is there a limits for gemeni-3-pro?

Yes, all models will have some kind of rate limit associated with them.

zealous sparrow Dec 15, 2025, 7:06 PM

#

peak new model name @whole sundial

#

its a textarena model

#

i hope its not amazon

agile bloom Dec 15, 2025, 7:09 PM

#

so best overall ai model is gemini 3 pro for December 2025?

golden ocean Dec 15, 2025, 7:20 PM

#

plucky sparrow

erm sydney

#

artificial neural network vs biological neural network

cloud zinc Dec 15, 2025, 7:37 PM

#

agile bloom so best overall ai model is gemini 3 pro for December 2025?

gpt 5.2

agile bloom Dec 15, 2025, 7:38 PM

#

cloud zinc gpt 5.2

frr?

cloud zinc Dec 15, 2025, 7:39 PM

#

yes

cloud zinc Dec 15, 2025, 7:39 PM

#

agile bloom frr?

agile bloom Dec 15, 2025, 7:40 PM

#

cloud zinc

wow

#

is there any way i can upload a txt file to gpt 5.2 on lmarena?

cloud zinc Dec 15, 2025, 7:43 PM

#

lmarena doesnt allow pdf upload. you have to upload txt manually

empty stump Dec 15, 2025, 7:44 PM

#

i dont like 5.2

#

i will have to try it in api

agile bloom Dec 15, 2025, 7:45 PM

#

cloud zinc lmarena doesnt allow pdf upload. you have to upload txt manually

my txt file has 40,000 words in it, i wanted to use it to analyse using gpt5.2 as a txt file instead of the entire thing as text input

frosty lava Dec 15, 2025, 8:05 PM

#

yes and we're definitly improving everything your talking about ? each new month a new model with better capabilities came out

#

and it wont stop in 2026 for sure

#

less hallucination, follow task much better, better at coding

#

like everything we want its what they're working on

lunar glade Dec 15, 2025, 8:28 PM

#

anyone know any site that have nanobanana 4K model?

cloud zinc Dec 15, 2025, 8:31 PM

#

lunar glade anyone know any site that have nanobanana 4K model?

yes aistudio

#

no it is a scam

#

scam models

empty stump Dec 15, 2025, 8:44 PM

#

sota in what

weary galleon Dec 15, 2025, 8:50 PM

#

cloud zinc scam models

I agreed. They are not look too big.

weary galleon Dec 15, 2025, 8:51 PM

#

empty stump i dont like 5.2

Nobody likes it. Even Scam Altman.

cloud zinc Dec 15, 2025, 8:51 PM

#

weary galleon Nobody likes it. Even Scam Altman.

5.2 is #1 model

empty stump Dec 15, 2025, 8:51 PM

#

in benchmarks

zealous sparrow Dec 15, 2025, 8:52 PM

#

5.2 benchmaxxing

weary galleon Dec 15, 2025, 8:52 PM

#

It is designed for benchmarks only, not for real tasks.

cloud zinc Dec 15, 2025, 8:52 PM

#

weary galleon It is designed for benchmarks only, not for real tasks.

yes for real task also

#

weary galleon Dec 15, 2025, 8:53 PM

#

cloud zinc

It's a banchmark.

empty stump Dec 15, 2025, 8:53 PM

#

that openai made...

weary galleon Dec 15, 2025, 8:54 PM

#

GDPval is a banchmark.

cloud zinc Dec 15, 2025, 8:54 PM

#

for work tasks

empty stump Dec 15, 2025, 8:55 PM

#

of course a model by openai will perform best on an openai made benchmark

cloud zinc Dec 15, 2025, 8:56 PM

#

empty stump of course a model by openai will perform best on an openai made benchmark

show where its bad

weary galleon Dec 15, 2025, 8:59 PM

#

cloud zinc for work tasks

But still a banchmark.

weary galleon Dec 15, 2025, 9:00 PM

#

empty stump of course a model by openai will perform best on an openai made benchmark

Agreed. They design banchmarks to win in them other models.

#

I will laugh when it will get lower place in Text Arena than GPT 5.1

#

🤣

cloud zinc Dec 15, 2025, 9:02 PM

#

weary galleon But still a banchmark.

so what are u basing ur opinion on

weary galleon Dec 15, 2025, 9:04 PM

#

cloud zinc so what are u basing ur opinion on

Truth.

#

Banchmark is a banchmark. Period.

cloud zinc Dec 15, 2025, 9:04 PM

#

weary galleon Truth.

ur truth is a lie

weary galleon Dec 15, 2025, 9:05 PM

#

cloud zinc ur truth is a lie

Is banchmark a banchmark or not?

cloud zinc Dec 15, 2025, 9:05 PM

#

weary galleon Is banchmark a banchmark or not?

weary galleon Dec 15, 2025, 9:07 PM

#

I won!

cloud zinc Dec 15, 2025, 9:08 PM

#

https://tenor.com/view/tom-senya-gif-26024901

Tenor

weary galleon Dec 15, 2025, 9:14 PM

#

cloud zinc https://tenor.com/view/tom-senya-gif-26024901

This clown is GPT 5.2, not me. It lied for my demand, is it a smart output?

echo sinew Dec 15, 2025, 9:15 PM

#

Hello! Let's keep disagreements respectful and friendly.

weary galleon Dec 15, 2025, 9:16 PM

#

echo sinew Hello! Let's keep disagreements respectful and friendly.

Highly agreed👍

cloud zinc Dec 15, 2025, 9:16 PM

#

echo sinew Hello! Let's keep disagreements respectful and friendly.

yes, gpt 5.2 is best

weary galleon Dec 15, 2025, 9:20 PM

#

OpenAI got 500 billion dollars from the US government, Anthropic got nothing, Google got nothing and Gemini 3 Pro and Opus 4.5 both outperform GPT 5.2. How is that possible? Scam Altman's mismanaging of the company is the reason.

cloud zinc Dec 15, 2025, 9:22 PM

#

weary galleon OpenAI got 500 billion dollars from the US government, Anthropic got nothing, Go...

us didnt give openai money, u are wrong

#

us only approved the stargate project

weary galleon Dec 15, 2025, 9:23 PM

#

By the way, right answer is yes, GPT 5.2 said wrong answer, like always.

cloud zinc Dec 15, 2025, 9:23 PM

#

weary galleon OpenAI got 500 billion dollars from the US government, Anthropic got nothing, Go...

weary galleon Dec 15, 2025, 9:24 PM

#

cloud zinc us only approved the stargate project

This Stargate Project is a gift of the most powerful AI farm to OpenAI.

cloud zinc Dec 15, 2025, 9:24 PM

#

weary galleon This Stargate Project is a gift of the most powerful AI farm to OpenAI.

not a "gift"

weary galleon Dec 15, 2025, 9:25 PM

#

cloud zinc not a "gift"

Gift

cloud zinc Dec 15, 2025, 9:25 PM

#

its called investment

weary galleon Dec 15, 2025, 9:25 PM

#

cloud zinc its called investment

The same

cloud zinc Dec 15, 2025, 9:25 PM

#

weary galleon The same

#

gift to google

weary galleon Dec 15, 2025, 9:26 PM

#

cloud zinc

Contract isn't a gift. Investment is.

cloud zinc Dec 15, 2025, 9:27 PM

#

weary galleon Contract isn't a gift. Investment is.

weary galleon Dec 15, 2025, 9:27 PM

#

Even if a gift(but it's not) 500B and 200M is 2500X difference.

cloud zinc Dec 15, 2025, 9:27 PM

#

500 billion over several years

weary galleon Dec 15, 2025, 9:28 PM

#

cloud zinc Dec 15, 2025, 9:28 PM

#

weary galleon Even if a gift(but it's not) 500B and 200M is 2500X difference.

google can't get that much gift cuz they are bad

weary galleon Dec 15, 2025, 9:28 PM

#

cloud zinc

Google to Anthropic and US government (taxpayers money) to OpenAI. Are you feeling the difference?

hollow flicker Dec 15, 2025, 9:29 PM

#

is it just me or do you guys think that most people use LMArena to get all the paid ai's for free

weary galleon Dec 15, 2025, 9:29 PM

#

cloud zinc google can't get that much gift cuz they are bad

And taxpayers are good 👍

weary galleon Dec 15, 2025, 9:29 PM

#

hollow flicker is it just me or do you guys think that most people use LMArena to get all the p...

Yes

cloud zinc Dec 15, 2025, 9:29 PM

#

weary galleon Google to Anthropic and US government (taxpayers money) to OpenAI. Are you feeli...

not us government, its softbank

weary galleon Dec 15, 2025, 9:29 PM

#

But some like me, don't

cloud zinc Dec 15, 2025, 9:30 PM

#

weary galleon Dec 15, 2025, 9:31 PM

#

cloud zinc

Anyway Scam Altman will get those money. Anthropic and Google will not.

cloud zinc Dec 15, 2025, 9:32 PM

#

cuz scam google are bad

weary galleon Dec 15, 2025, 9:32 PM

#

cloud zinc cuz scam google are bad

Why?

cloud zinc Dec 15, 2025, 9:34 PM

#

cuz gpt 5.2 is #1

stray aspen Dec 15, 2025, 9:40 PM

#

Lmao

#

Scam altman

weary galleon Dec 15, 2025, 9:41 PM

#

Let's go!

#

Vote, guys! Democracy will win!

cloud zinc Dec 15, 2025, 9:46 PM

#

weary galleon Vote, guys! Democracy will win!

#

empty stump Dec 15, 2025, 9:53 PM

#

its bad because of the safety

cloud zinc Dec 15, 2025, 9:57 PM

#

where is it? u are making a rumor, not actual reality

#

so far with my testing, it looks bad

rapid merlin Dec 15, 2025, 10:01 PM

#

Gemini hallucinates the fact it has dall e 3? Interesting....

echo aurora Dec 15, 2025, 10:02 PM

#

Is it just me, or are others experiencing a bug when trying to login? Essentially, after doing an email login, after entering email/password, clicking the login button doesn't do anything. Are others seeing the same?

cloud zinc Dec 15, 2025, 10:04 PM

#

echo aurora Is it just me, or are others experiencing a bug when trying to login? Essentiall...

yes

cloud zinc Dec 15, 2025, 10:04 PM

#

echo aurora Is it just me, or are others experiencing a bug when trying to login? Essentiall...

echo aurora Dec 15, 2025, 10:04 PM

#

cloud zinc yes

Thank you. Will share with the team.

weary galleon Dec 15, 2025, 10:10 PM

#

Maybe because this "test" is paid by Scam Altman?

weary galleon Dec 15, 2025, 10:12 PM

#

weary galleon

60% say bad, 40% say good. GPT 5.2 is feeling so bad.

cloud zinc Dec 15, 2025, 10:13 PM

#

weary galleon 60% say bad, 40% say good. GPT 5.2 is feeling so bad.

how many votes?

cloud zinc Dec 15, 2025, 10:13 PM

#

weary galleon 60% say bad, 40% say good. GPT 5.2 is feeling so bad.

u voted with ur alt ok

weary galleon Dec 15, 2025, 10:14 PM

#

@cloud zinc If GPT 5.2 is #1 as you said multiple times today, why members of LMArena hate it so much?

cloud zinc Dec 15, 2025, 10:14 PM

#

weary galleon <@211998999282974720> If GPT 5.2 is #1 as you said multiple times today, why mem...

no one is hating

weary galleon Dec 15, 2025, 10:14 PM

#

cloud zinc u voted with ur alt ok

Proofs!

cloud zinc Dec 15, 2025, 10:14 PM

#

yes u used ur alt to vote

#

vote is not over yet

#

23 hours left

weary galleon Dec 15, 2025, 10:15 PM

#

cloud zinc yes u used ur alt to vote

Give proofs instead of repeating false accusations!

cloud zinc Dec 15, 2025, 10:16 PM

#

weary galleon Give proofs instead of repeating false accusations!

vote not over 23 hours left

weary galleon Dec 15, 2025, 10:17 PM

#

cloud zinc vote not over 23 hours left

See you after 23 hours! Bye! 👋

weary galleon Dec 15, 2025, 10:22 PM

#

cloud zinc vote not over 23 hours left

Look my previous poll
#general message

cloud zinc Dec 15, 2025, 10:24 PM

#

weary galleon Look my previous poll https://discord.com/channels/1340554757349179412/134055475...

ur poll is rigged

weary galleon Dec 15, 2025, 10:24 PM

#

cloud zinc ur poll is rigged

Give proofs, false accuser!

echo aurora Dec 15, 2025, 10:25 PM

#

Hey going to ask we move on from this conversation.

#

This doesn't seem to be very productive and is just escalating a bit here and there.

weary galleon Dec 15, 2025, 10:26 PM

#

echo aurora Hey going to ask we move on from this conversation.

He said I vote from my alts in my polls with zero evidence.

#

Just accuse, accuse, accuse, again and again! Without proofs.

golden ocean Dec 15, 2025, 10:28 PM

#

cloud zinc Dec 15, 2025, 10:34 PM

#

weary galleon Just accuse, accuse, accuse, again and again! Without proofs.

give proof u are not lying

night kelp Dec 15, 2025, 10:35 PM

#

can anyone help me a bit?

cloud zinc Dec 15, 2025, 10:35 PM

#

night kelp can anyone help me a bit?

yes

weary galleon Dec 15, 2025, 10:35 PM

#

cloud zinc give proof u are not lying

Troll! 😡

cloud zinc Dec 15, 2025, 10:35 PM

#

weary galleon Troll! 😡

so u have no proof

weary galleon Dec 15, 2025, 10:36 PM

#

cloud zinc so u have no proof

I added you to block list. Bye! 👋

cloud zinc Dec 15, 2025, 10:36 PM

#

thought so

night kelp Dec 15, 2025, 10:36 PM

#

cloud zinc yes

im working on something with python, been workin on it for a few days, then the chat decides to just stop, not being able to send emssages (i mean like i can send but it gives me that annyoing red prompt "something went wrong with this response, please try again") i refreshed, restarted, tried typing again, same thing

#

using ai ofc

#

also gemini 3.0

cloud zinc Dec 15, 2025, 10:37 PM

#

night kelp im working on something with python, been workin on it for a few days, then the ...

u need to start new chat

night kelp Dec 15, 2025, 10:38 PM

#

i have almost the whole stuff on that specific chat, any other way i can restore it?

cloud zinc Dec 15, 2025, 10:41 PM

#

that chat is bugged

#

always backup ur files after couple of hours

torn mantle Dec 15, 2025, 10:42 PM

#

@cloud zinc what did i miss

#

any new model

night kelp Dec 15, 2025, 10:43 PM

#

cloud zinc always backup ur files after couple of hours

how do i even back a chat up anws

torn mantle Dec 15, 2025, 10:45 PM

#

night kelp how do i even back a chat up anws

any new model

#

kaiser

night kelp Dec 15, 2025, 10:45 PM

#

uhh

stray aspen Dec 15, 2025, 10:46 PM

#

Why can't gpt 5.2 extra high think of another way of designing stuff

#

It all looks the same

night kelp Dec 15, 2025, 10:46 PM

#

gimini

stray aspen Dec 15, 2025, 10:46 PM

#

Wheres creativity

cloud zinc Dec 15, 2025, 10:46 PM

#

torn mantle <@211998999282974720> what did i miss

flash tomorrow

torn mantle Dec 15, 2025, 10:46 PM

#

cloud zinc flash tomorrow

how do u know

cloud zinc Dec 15, 2025, 10:46 PM

#

torn mantle how do u know

u will see

stray aspen Dec 15, 2025, 10:46 PM

#

Yeah it cooked for me but I just don't like that its not that creative

torn mantle Dec 15, 2025, 10:46 PM

#

cloud zinc u will see

bet

cloud zinc Dec 15, 2025, 10:46 PM

#

torn mantle how do u know

watch for logan tweet tonight

neat apex Dec 15, 2025, 10:46 PM

#

stray aspen Wheres creativity

Thats weird since even Qwen3 high manages that

torn mantle Dec 15, 2025, 10:47 PM

#

imma sleep early

#

_<

neat apex Dec 15, 2025, 10:47 PM

#

Likely because the gpt is sooo formal

#

So it will rarely try outshine, besises it writes well when you ask something exactly

ocean vortex Dec 15, 2025, 10:49 PM

#

neat apex Thats weird since even Qwen3 high manages that

they copied the "high" thing from OpenAI? 🗿

neat apex Dec 15, 2025, 10:50 PM

#

😂

cloud zinc Dec 15, 2025, 10:50 PM

#

ocean vortex they copied the "high" thing from OpenAI? 🗿

yes, the word "high" is copyrighted by openai

golden ocean Dec 15, 2025, 10:50 PM

#

gpt 6 latent space reasoning

neat apex Dec 15, 2025, 10:50 PM

#

I think its general knowledge

ocean vortex Dec 15, 2025, 10:51 PM

#

cloud zinc yes, the word "high" is copyrighted by openai

That's not what I meant. Don't pretend you aren't very smart. Although maybe you...

neat apex Dec 15, 2025, 10:51 PM

#

I hope

ocean vortex Dec 15, 2025, 10:51 PM

#

I mean it's reasonable to assume they have bigger release in the works

#

there was less than 1 month between 5.1 and 5.2

#

this was like a small incremental update

neat apex Dec 15, 2025, 10:52 PM

#

ocean vortex That's not what I meant. Don't pretend you aren't very smart. Although maybe you...

Thats not what he meant. Dont pretend you arent very smart
Although maybe you...

ocean vortex Dec 15, 2025, 10:52 PM

#

No clue why they tried to oversell it this hard lol

#

Like pushing for benchmarks this hard with 5.2 wasn't necessary I feel like

cloud zinc Dec 15, 2025, 10:52 PM

#

neat apex Thats not what he meant. Dont pretend you arent very smart Although maybe you...

true

ocean vortex Dec 15, 2025, 10:52 PM

#

5.2 you mean?

#

I don't think it is tbh

#

have you seen SimpleBench and other stuff?

#

5.2 gets beaten by both 5.1 and even more so by 5.0 there lol

#

That's the thing, my experience was the same. 5.0 > 5.1 and 5.2

#

And then Gemini3 just somehow manages to score great everywhere they test it at

#

Not definitively the best in select things perhaps, but not really underperforming anywhere either

neat apex Dec 15, 2025, 11:19 PM

#

5.1 is way better than 5.0

#

Almost same smartness, but it actually efforts to make better responses

steel dune Dec 16, 2025, 12:02 AM

#

Hi everyone, a quick question “Which AI model will one recommend to aid in accounting work?”

echo aurora Dec 16, 2025, 12:05 AM

#

steel dune Hi everyone, a quick question “Which AI model will one recommend to aid in accou...

Hello ablobwave would encourage you to check out our Text Arena leaderboard using the Business, Management, and Financial Ops Occupational filter -> https://lmarena.ai/leaderboard/text/industry-business-and-management-and-financial-operations

Text Arena | LMArena

Compare and explore Text models ranked by industry-business-and-management-and-financial-operations performance.

steel dune Dec 16, 2025, 12:06 AM

#

Thank you

jade egret Dec 16, 2025, 12:34 AM

#

cloud zinc Dec 16, 2025, 12:40 AM

#

@torn mantle nice

thorn path Dec 16, 2025, 12:44 AM

#

Wait the text leaderboard updated and 5.2 isn't even in the top 10 lmfao wth

echo aurora Dec 16, 2025, 12:45 AM

#

thorn path Wait the text leaderboard updated and 5.2 isn't even in the top 10 lmfao wth

gpt-5.2 isn't yet on the Text leaderboard

thorn path Dec 16, 2025, 12:47 AM

#

echo aurora gpt-5.2 isn't yet on the Text leaderboard

That makes much more sense, between you and us where do you (personally) feel it's going to fall 👀 (if you're allowed to say, no worries if you're legally bonded from doing so)

burnt sinew Dec 16, 2025, 12:49 AM

#

anyone notice gemini 3 is hallucinating thinking it has experiences?
"I personally have a database with over 800 entries."

burnt sinew Dec 16, 2025, 12:50 AM

#

echo aurora gpt-5.2 isn't yet on the Text leaderboard

when will it be?

#

if you can share

echo aurora Dec 16, 2025, 12:51 AM

#

thorn path That makes *much* more sense, between you and us where do you (personally) feel ...

I'm not legally bound to discourage from sharing my personal opinion, but I'd prefer not to as some may interpret that the wrong way.

echo aurora Dec 16, 2025, 12:51 AM

#

burnt sinew when will it be?

TBD. Sorry to say I won't be able to give an estimated time.

burnt sinew Dec 16, 2025, 12:52 AM

#

thorn path That makes *much* more sense, between you and us where do you (personally) feel ...

i think it will score highly, though it does terrible on some edge cases

#

fails seahorse test

echo aurora Dec 16, 2025, 12:57 AM

#

thorn path That makes *much* more sense, between you and us where do you (personally) feel ...

Where do you think it'll land?

lucid geyser Dec 16, 2025, 1:03 AM

#

Really flipping it around

swift oyster Dec 16, 2025, 1:04 AM

#

LIVE BENCHMARK UPDATE
Model: Hawk (Launch 25th)
We're currently halfway through the official ARC-AGI-2 benchmark - one of the hardest AI reasoning tests in existence!

Current Stats (48% Complete)
Correct: 14
Incorrect: 44
Accuracy: 24.1%
Progress: 58/120 tasks

Key Takeaways:
Outperforming Claude Opus 4.5 by 10%
Currently ranked #9 globally

surreal creek Dec 16, 2025, 1:07 AM

#

thorn path That makes *much* more sense, between you and us where do you (personally) feel ...

“legally bound” bro it’s an AI benchmark site not classified information 😂

sullen quest Dec 16, 2025, 1:07 AM

#

swift oyster LIVE BENCHMARK UPDATE Model: Hawk (Launch 25th) We're currently halfway through...

Can we make LIVE BENCHMARK UPDATE a meme?

cloud zinc Dec 16, 2025, 1:13 AM

#

echo aurora Where do you think it'll land?

3rd

thorn path Dec 16, 2025, 1:25 AM

#

echo aurora Where do you think it'll land?

realistically 2nd, but it could really go either way depending on how varied opinions go imo

burnt sinew Dec 16, 2025, 1:28 AM

#

i think only o3 and opus 4.5 thinking gets seahorse question right?

jade egret Dec 16, 2025, 1:54 AM

#

burnt sinew i think only o3 and opus 4.5 thinking gets seahorse question right?

gemini does too i think?

burnt sinew Dec 16, 2025, 1:54 AM

#

jade egret gemini does too i think?

no

#

not without search i dont think

jade egret Dec 16, 2025, 1:55 AM

#

gemini 3?

#

o

#

lol

brisk turret Dec 16, 2025, 2:00 AM

#

Where is 5.2 on the leaderboard?

#

Wtf is going on

jade egret Dec 16, 2025, 2:23 AM

#

brisk turret Where is 5.2 on the leaderboard?

Not on it yet

#

#

Must be tomorrow?

tired plaza Dec 16, 2025, 3:15 AM

#

guys, what are all the site's keyboard shortcuts? is there a shorcut for new chat?

astral bloom Dec 16, 2025, 3:25 AM

#

@echo aurora when will the video arena be accessible, I've seen people here claim they had access to it.

frosty lava Dec 16, 2025, 3:44 AM

#

when will we get an ai with like 1% hallucination

#

that's all im waiting for

#

ofc its good to have genius ai but what if they always do errors

#

give me actual genius ai if you want but that are capable of actually doing things without mistake

cloud zinc Dec 16, 2025, 3:46 AM

#

frosty lava when will we get an ai with like 1% hallucination

2027

frosty lava Dec 16, 2025, 3:46 AM

#

cloud zinc 2027

you think ?

#

Cause like in 2026 we're supposed to see alot of new ai like grok 5 and all

#

and they're supposed to be like much better

cloud zinc Dec 16, 2025, 3:49 AM

#

frosty lava Cause like in 2026 we're supposed to see alot of new ai like grok 5 and all

overhyped products

frosty lava Dec 16, 2025, 3:50 AM

#

we'll see

#

no one know actually

delicate wagon Dec 16, 2025, 4:05 AM

#

Hi everyone

obtuse smelt Dec 16, 2025, 4:23 AM

#

hello

sullen quest Dec 16, 2025, 4:23 AM

#

astral bloom <@283397944160550928> when will the video arena be accessible, I've seen people ...

?

#

@astral bloom video arena is right here in discord, is a discord exclusive feature, go to the channel how-to-video-bot for more info

cloud zinc Dec 16, 2025, 4:25 AM

#

sullen quest <@981972895884533830> video arena is right here in discord, is a discord exclusi...

there is video and auto modality being experimented in the site

obtuse smelt Dec 16, 2025, 4:59 AM

#

hmm is happening
Something went wrong with this response, please try again.

left lodge Dec 16, 2025, 5:33 AM

#

Anyone wanna search with images?
I have a way.

#

Literally just go to lmarena.ai
Add images
Switch to search modality
Send Prompt.
Done.

#

I already reported it in #1449482775823515688
So act fast , :p

#

Also, Lmarena is working on 2 new possible updates;

A new modality picker

And

A new Video Modality in web
which will be available at
https://lmarena.ai/c/new?chat-modality=video
And
https://lmarena.ai/?chat-modality=video

#

Enjoy the information 💁

#

Currently these links are redirected to lmarena.ai
And video generation requires login.

cloud zinc Dec 16, 2025, 5:53 AM

#

left lodge Also, Lmarena is working on 2 new possible updates; > A new modality picker A...

how do i make video on site?

sour spear Dec 16, 2025, 5:58 AM

#

frosty lava Cause like in 2026 we're supposed to see alot of new ai like grok 5 and all

Hallucination is baked into Grok by design...

torn mantle Dec 16, 2025, 6:09 AM

#

cloud zinc <@295243581818404874> nice

how did u know

#

so sus

mild harness Dec 16, 2025, 6:10 AM

#

hi, how can I use gpt 5.2 xhigh on lmarena? I only see 5.2 high. thanks

torn mantle Dec 16, 2025, 6:10 AM

#

mild harness hi, how can I use gpt 5.2 xhigh on lmarena? I only see 5.2 high. thanks

not available

vivid coral Dec 16, 2025, 6:17 AM

#

left lodge Also, Lmarena is working on 2 new possible updates; > A new modality picker A...

mother of God

left lodge Dec 16, 2025, 6:18 AM

#

cloud zinc how do i make video on site?

I got access randomly, but it is not currently available.
I tried but it required login and upon login it went away.

keen beacon Dec 16, 2025, 7:12 AM

#

Does anyone know why it no longer displays images in 1920x1080?

obtuse smelt Dec 16, 2025, 7:14 AM

#

is just 768 x1360

keen beacon Dec 16, 2025, 7:17 AM

#

obtuse smelt is just 768 x1360

Yesterday, if you asked for them in 1920x1080, it would do that.

frosty lava Dec 16, 2025, 7:36 AM

#

sour spear Hallucination is baked into Grok by design...

what do you mean its not only grok every llm

#

theres actually no llm without hallucination

obtuse smelt Dec 16, 2025, 7:38 AM

#

keen beacon Yesterday, if you asked for them in 1920x1080, it would do that.

oh i see

#

is gemini error ?

weary galleon Dec 16, 2025, 8:05 AM

#

🏆Sonnet 4.7 is coming... until Christmas ⛄

#

https://tenor.com/view/cheers-cheers-to-that-wink-steve-carell-gif-23705252507487769

Tenor

sterile tartan Dec 16, 2025, 8:10 AM

#

weary galleon 🏆Sonnet 4.7 is coming... until Christmas ⛄

I had thought it would be 5 but 4.7 works too

weary galleon Dec 16, 2025, 8:10 AM

#

sterile tartan I had thought it would be 5 but 4.7 works too

I too

sterile tartan Dec 16, 2025, 8:10 AM

#

Expect quicker releases now from all companies

#

Thr competition is peak cutthroat now

#

battle3d lmarenalogo

weary galleon Dec 16, 2025, 8:13 AM

#

AI winter is coming... My expectations are Anthropic and Google will release lots of great models under this competition.
P.S. Maybe xAI also, I am not sure.

#

Also, AMD and Nvidia will release new AI chips in January, which also accelerate this rush dramatically.

#

Googles and Amazons chips are also great 👍

#

My expectations are after Sonnet 4.7 release, it will get 🥇 first place in coding, Opus 4.5 🥈 second.

compact flame Dec 16, 2025, 8:18 AM

#

weary galleon AI winter is coming... My expectations are Anthropic and Google will release lot...

I doubt google will release any models soon

#

They gonna be either focused on other stuff or just training the new models for long time

weary galleon Dec 16, 2025, 8:20 AM

#

compact flame They gonna be either focused on other stuff or just training the new models for ...

I'm sure no one tech giant is focused on any other things.

#

Grok 4.20 is coming until Christmas ⛄ also. Not sure how much good it will be.

frosty lava Dec 16, 2025, 8:25 AM

#

Do you guys think ai will really replace like almost every jobs ?

#

many people is saying this is stupid and will never happen

#

but

#

why tho it can definitly happen

compact flame Dec 16, 2025, 8:27 AM

#

frosty lava Do you guys think ai will really replace like almost every jobs ?

Well it's based

#

Animators? Could be after I seen sora

frosty lava Dec 16, 2025, 8:28 AM

#

then why majority of people still think it wont happen isn't it like crazy

compact flame Dec 16, 2025, 8:28 AM

#

frosty lava then why majority of people still think it wont happen isn't it like crazy

They just can't accept that ai is getting too good

frosty lava Dec 16, 2025, 8:28 AM

#

how can we ignore it when we see the massive investment and the literal race

#

for ai

compact flame Dec 16, 2025, 8:29 AM

#

frosty lava how can we ignore it when we see the massive investment and the literal race

Well some believe the big ai advancement will slowdown someday

#

Its kinda based how well they train ai

frosty lava Dec 16, 2025, 8:30 AM

#

there's always a solution

compact flame Dec 16, 2025, 8:30 AM

#

frosty lava there's always a solution

Like how do you think when ai will just reach its limit and stop advancing?

frosty lava Dec 16, 2025, 8:32 AM

#

It will just be a problem to solve not something impossible to achieve, people be like its impossible yet every time human do new discoveries and what felt impossible is just a reality

#

there's always someone that find a new way

#

there will be some problem but saying we can't ever solve it feel stupid

#

in my opinion

weary galleon Dec 16, 2025, 8:37 AM

#

frosty lava Do you guys think ai will really replace like almost every jobs ?

Opus 4.5 Thinking 32K + prompt engineer can replace ANY programmer easily.

frosty lava Dec 16, 2025, 8:38 AM

#

weary galleon Opus 4.5 Thinking 32K + prompt engineer can replace ANY programmer easily.

Not really due to hallucinations

#

we just want less hallucinations and yes

weary galleon Dec 16, 2025, 8:39 AM

#

frosty lava Not really due to hallucinations

Prompt engineer is a human, they will give prompts to Opus until it would give an output without hallucinations.

frosty lava Dec 16, 2025, 8:40 AM

#

your right but are we already at that point where it will be like as efficient as a team of programmer, i don't really know at all

#

i think definitly in 2026 what your saying will be doable

#

we're close for sure

weary galleon Dec 16, 2025, 8:42 AM

#

I'm a programmer, and that's why I think LLMs are bad for us. To find a job is much harder for me even now, if to compare a few years ago.

frosty lava Dec 16, 2025, 8:43 AM

#

weary galleon I'm a programmer, and that's why I think LLMs are bad for us. To find a job is m...

its bad cause we only see what's happening right now, but in the future we won't just let people having no jobs and like die

#

it will just be very very different

#

it can definitly help us all if we think about medecine, new discoveries, and future

#

its just that theres this questions about jobs and what will happen but no one can answer already were not here yet

#

but for sure, we won't let people struggle forever

weary galleon Dec 16, 2025, 8:49 AM

#

frosty lava but for sure, we won't let people struggle forever

Only so groups of people, all people with intelligence work, people with physical work will feeling great.

frosty lava Dec 16, 2025, 8:50 AM

#

weary galleon Only so groups of people, all people with intelligence work, people with physica...

Sorry, english is not my native language and im not really sure about what you try to say

#

Physical work might also get replaced

weary galleon Dec 16, 2025, 8:51 AM

#

frosty lava Physical work might also get replaced

But not by LLMs.

frosty lava Dec 16, 2025, 8:51 AM

#

weary galleon But not by LLMs.

No, but by robotics maybe ? which is also growing much

#

In china its already happening

strong wren Dec 16, 2025, 8:52 AM

#

hi

weary galleon Dec 16, 2025, 8:53 AM

#

frosty lava No, but by robotics maybe ? which is also growing much

Current LLMs are very good, current robots are not good. They will be able to do all physical works, but not in near future.

frosty lava Dec 16, 2025, 8:53 AM

#

weary galleon Current LLMs are very good, current robots are not good. They will be able to do...

Already done in some factory in china, with this ai race its not only about LLM

#

there is also robotics going with it

#

and yes they are doing alot of progress

#

I would say we're really into automation these days

#

everythings is about it

#

we want more efficient, cheaper worker

#

and its doing alot of progress

#

You can look deeper into it

#

its happening.

viscid cloak Dec 16, 2025, 8:58 AM

#

nanobanana pro DIRECT keeps giving“something wrong”, anyone as well?

obtuse smelt Dec 16, 2025, 9:00 AM

#

viscid cloak nanobanana pro DIRECT keeps giving“something wrong”, anyone as well?

yeah gemini looks error

weary galleon Dec 16, 2025, 9:00 AM

#

frosty lava In china its already happening

Robots waiters are good.What about robots plumbers, electricians, carpenters, builders, etc.? It's impossible.

frosty lava Dec 16, 2025, 9:01 AM

#

weary galleon Robots waiters are good.What about robots plumbers, electricians, carpenters, bu...

impossible ? no its not, its just not done yet, like i said we're doing alot of progress in it you really can look into it you'll see its not a dream or something impossible

#

im not saying its already done, but definitly it seems that it will happen

#

and not like in 50 years

#

that's definitly something else than LLM

#

but its also doing progress

weary galleon Dec 16, 2025, 9:04 AM

#

frosty lava and not like in 50 years

I think even more. If robot (even futures one) would repair your electricity panel your house would burn.

frosty lava Dec 16, 2025, 9:04 AM

#

weary galleon I think even more. If robot (even futures one) would repair your electricity pan...

there is a problem, and no problem is impossible to solve

#

like we definitly won't make something that kill you

#

or is dangerous

weary galleon Dec 16, 2025, 9:05 AM

#

frosty lava there is a problem, and no problem is impossible to solve

We need 50 years or more to solve this problem.

frosty lava Dec 16, 2025, 9:06 AM

#

weary galleon We need 50 years or more to solve this problem.

wow that's a big numbers, we're progressing faster than that

#

you know how the world were 50 years ago ?

#

its changing much faster than that

weary galleon Dec 16, 2025, 9:09 AM

#

left lodge Enjoy the information 💁

You're not an admin. How did you get insider information?

frosty lava Dec 16, 2025, 9:10 AM

#

But we shouldn't see only the bad side of it honestly

#

nothing is all good or all bad

#

but from what we saw in history, when we progress its mostly good

robust sluice Dec 16, 2025, 9:13 AM

#

viscid cloak nanobanana pro DIRECT keeps giving“something wrong”, anyone as well?

yes and error is count on limit to me

obtuse smelt Dec 16, 2025, 9:16 AM

#

need fix this

#

several hours is look error

robust sluice Dec 16, 2025, 9:17 AM

#

mine its been 3 days

#

Gemini never appear in Battle only Flux models

obtuse smelt Dec 16, 2025, 9:18 AM

#

hmm

robust sluice Dec 16, 2025, 9:19 AM

#

and Direct keep error and said try again in 50 min

obtuse smelt Dec 16, 2025, 9:20 AM

#

hmm

plucky sparrow Dec 16, 2025, 9:20 AM

#

frosty lava but for sure, we won't let people struggle forever

not 'forever' but don't be so sure the rich will just help people out

#

the question is whether you will end up living the majority of your life in hell, or abundance

frosty lava Dec 16, 2025, 9:21 AM

#

plucky sparrow not 'forever' but don't be so sure the rich will just help people out

it won't depend on one person but on the government, i can't say hey everythings will be good, but i can definitly say if its bad anyway it will also be bad for them in that case

plucky sparrow Dec 16, 2025, 9:21 AM

#

the great depression was a pretty long period of time

frosty lava Dec 16, 2025, 9:22 AM

#

its a win win

plucky sparrow Dec 16, 2025, 9:22 AM

#

it will take a long time

frosty lava Dec 16, 2025, 9:22 AM

#

if its bad for most people then its bad for them too

#

so it most likely won't be bad

plucky sparrow Dec 16, 2025, 9:22 AM

#

at first they'll be like "oh, but, AI is creating new jobs. we'll just create more gov jobs."

#

they'll delay it as long as possible

#

we might even get a revolution before we getr change

#

historically, that's how it plays out

frosty lava Dec 16, 2025, 9:23 AM

#

yes theres a time for it to be good actually they will delay it your right, but definitly the worst case will not happen

plucky sparrow Dec 16, 2025, 9:23 AM

#

yeah i agree. to be honest, worst case i'm worried about is, the rich people are in charge of AI right now

frosty lava Dec 16, 2025, 9:24 AM

#

plucky sparrow yeah i agree. to be honest, worst case i'm worried about is, the rich people are...

Im not sure of what your trying to say cause ai are made by rich people already

plucky sparrow Dec 16, 2025, 9:24 AM

#

and the rich people have access to the best AI models. not worried about an AI 'killing us all' as much as 'rich people having great influence over AI' and using it to manipulate people

#

imagine a 100x better claude opus 4.5, but only the rich have access to it, adn they use it to exploit you

frosty lava Dec 16, 2025, 9:25 AM

#

plucky sparrow imagine a 100x better claude opus 4.5, but only the rich have access to it, adn ...

that's the worst case, but it won't happen if it do you can already tell it will be a war

#

i don't believe we all just gonna accept something like that happening and they know it

plucky sparrow Dec 16, 2025, 9:25 AM

#

sure, but, like all wars, it's usually won by the people with the most power

#

can you win against a robot army?

#

i think we see great famine before we see great abundance

frosty lava Dec 16, 2025, 9:26 AM

#

why will they use a robot army, why will they want to be hated by everyone, and that mean like every country have to do it

plucky sparrow Dec 16, 2025, 9:26 AM

#

historically, that's always happened. not sure this time is 'different'

frosty lava Dec 16, 2025, 9:26 AM

#

if everyone like everyone do hate them they are loosing

#

they are human too

#

they don't want human to die

#

and nobody will let someone doing it

plucky sparrow Dec 16, 2025, 9:27 AM

#

have you not seen how many humans were enslaved historically?

frosty lava Dec 16, 2025, 9:27 AM

#

plucky sparrow have you not seen how many humans were enslaved historically?

Yes, but like what your saying is something different actually its not the same weight

plucky sparrow Dec 16, 2025, 9:27 AM

#

we're living in less than 0.01% of human history. we forget how bad things have been

frosty lava Dec 16, 2025, 9:27 AM

#

if something like a robot army controlling everyone happen humanity is dead

#

would mean even them at some point will die

plucky sparrow Dec 16, 2025, 9:28 AM

#

i don't think it will go on forever, i'm just saying, historically, we go through incredibly rough times before we get good times

#

and we forget, as a species, how good it is now, compared to how it was

#

and that cycle has repeated itself for as long as history was written

shrewd citrus Dec 16, 2025, 9:28 AM

#

Anyone else having issues with nano banana pro

plucky sparrow Dec 16, 2025, 9:29 AM

#

so either this time is different, or it's not

frosty lava Dec 16, 2025, 9:29 AM

#

plucky sparrow i don't think it will go on forever, i'm just saying, historically, we go throug...

Why do you think its the same thing ? when we created pcs and phone and everythings no such thing happened

#

not that bad

#

atleast

obtuse smelt Dec 16, 2025, 9:29 AM

#

shrewd citrus Anyone else having issues with nano banana pro

yeah is seem error

plucky sparrow Dec 16, 2025, 9:29 AM

#

the industrial revolution, the great depression, the last 2 wars..

frosty lava Dec 16, 2025, 9:29 AM

#

But also people getting smarter with more right

#

you can't compare something that happened when like everythings was different from now to now

plucky sparrow Dec 16, 2025, 9:30 AM

#

frosty lava Why do you think its the same thing ? when we created pcs and phone and everythi...

this is a very small % of human history. i'd argue we're still in the age of abundance, but we're on a downward curve into the age of famine

#

so you're saying. "this time is different"

#

it could be. but it rarely is

frosty lava Dec 16, 2025, 9:31 AM

#

it always are different, cause something like that happening in the world of today doesn't seem achievable and if it is like everyone will loose

#

even them

plucky sparrow Dec 16, 2025, 9:31 AM

#

yes, i don't think it will go on forever

#

i think history never repeats itself exactly but it often rhymes

#

i think we will get an age where a lot of people are in famine because of AI, and eventually a revolution of some sort, and eventually we will have abundance

frosty lava Dec 16, 2025, 9:31 AM

#

they need to be popular too

plucky sparrow Dec 16, 2025, 9:32 AM

#

the only question is whether we end up living in the majority of the famine and barely see the abundance

frosty lava Dec 16, 2025, 9:32 AM

#

to be popular it also mean they have to do good things somehow

plucky sparrow Dec 16, 2025, 9:32 AM

#

like many people who survived through WW2

#

we might end up telling our grand kids, "Back in my day, AIs made us all poor.."

frosty lava Dec 16, 2025, 9:32 AM

#

plucky sparrow like many people who survived through WW2

WW2 didn't started cause suddenly we got new technologies

plucky sparrow Dec 16, 2025, 9:32 AM

#

no, it started because of famine in Germany

frosty lava Dec 16, 2025, 9:32 AM

#

so its completely different

#

not even the same reason

#

Ai are new technologies

plucky sparrow Dec 16, 2025, 9:33 AM

#

we'll see. you're banking on the government being actually controlled by the poor majority instead of manipulated by the rich minority

frosty lava Dec 16, 2025, 9:33 AM

#

yes cause like you said majority and minority, and if the majority start to defend themselves the minority wont win

#

and they don't want that anyway

#

it would in every case make them loose

plucky sparrow Dec 16, 2025, 9:35 AM

#

i think what you're saying is right, i just think it'll take a lot longer than we'd like

frosty lava Dec 16, 2025, 9:35 AM

#

plucky sparrow i think what you're saying is right, i just think it'll take a lot longer than w...

your right too, people loosing job is not something avoidable now

#

but i dont know what will happen

#

i just know we can't go back anyway

plucky sparrow Dec 16, 2025, 9:36 AM

#

yeah i agree

#

people somehow think protesting AI will get anywhere 🤣

swift oyster Dec 16, 2025, 9:37 AM

#

Result are in for our upcoming model.

plucky sparrow Dec 16, 2025, 9:37 AM

#

if anything, it's better the country you live in, develops AI before the country that may not share your same values

sterile ore Dec 16, 2025, 9:37 AM

#

hello

obtuse smelt Dec 16, 2025, 9:37 AM

#

hi

frosty lava Dec 16, 2025, 9:38 AM

#

plucky sparrow people somehow think protesting AI will get anywhere 🤣

its like, when gun were created who will still use sword

#

its more efficient

#

and that's all we want

#

when people will realize they can't beat someone that use ai in the future

#

they will use ai

#

and you can't really blame them

plucky sparrow Dec 16, 2025, 9:40 AM

#

imagine trying to protest the development of guns 😄

frosty lava Dec 16, 2025, 9:41 AM

#

you just can't tell to everyone to stop

plucky sparrow Dec 16, 2025, 9:41 AM

#

imagine if your country had no guns but every other country does 😄

frosty lava Dec 16, 2025, 9:41 AM

#

and if someone still do it he will be more efficient

plucky sparrow Dec 16, 2025, 9:41 AM

#

not even in the army

frosty lava Dec 16, 2025, 9:41 AM

#

plucky sparrow imagine if your country had no guns but every other country does 😄

exactly that's why its not possible to go back when its proven its better

plucky sparrow Dec 16, 2025, 9:41 AM

#

i do agree with you 💯 on one thing, protesting AI is stupid

#

it won't lead anywhere

#

except your own misfortune

frosty lava Dec 16, 2025, 9:42 AM

#

we can't protest progress even if the progress is dangerous or will lead to big problem like mass unemployment

#

cause its somehow still a progress

#

all we can do is find solution but it doesn't even depend on us

#

but on few people :

#

all i can say is if its too bad the majority will defend themselves

#

so it probably won't be "too" bad

plucky sparrow Dec 16, 2025, 9:44 AM

#

hmm

#

but things are 'too bad' in many countries

#

and the majority isn't able to defend themselves

frosty lava Dec 16, 2025, 9:45 AM

#

its not equality everywhere, for the big countries it work like i said

plucky sparrow Dec 16, 2025, 9:46 AM

#

i think that's why AI development is actually super important

#

it will determine whether your country is a 'big' country or not, and how long the famine will last

frosty lava Dec 16, 2025, 9:46 AM

#

if ever a "bad" countries get the lead of the most powerfull progress then we're all at their mercy

#

that's why its a race

#

we're saying to know who will be the leader

#

of the next decades

#

but no one understand it atleast not yet

plucky sparrow Dec 16, 2025, 9:47 AM

#

will be interesting to see who wins

#

might not be who we think at all

frosty lava Dec 16, 2025, 9:47 AM

#

yes

#

we never know it can come from anyone

plucky sparrow Dec 16, 2025, 9:47 AM

#

e.g. may not be US or China

frosty lava Dec 16, 2025, 9:47 AM

#

it will be the smarter one

#

that's it

plucky sparrow Dec 16, 2025, 9:47 AM

#

might not even be smarter, could be luck

frosty lava Dec 16, 2025, 9:48 AM

#

yes your right

#

we just can't tell

#

but its a race and not for nothing

#

its much more important than people think

plucky sparrow Dec 16, 2025, 9:49 AM

#

one very interesting video I find is the time lapse videos of the world borders: https://www.youtube.com/watch?v=-6Wu0Q7x5D0

YouTube

Ollie Bye

The History of the World: Every Year

Since 200,000 BCE, humanity has spread around globe and enacted huge change upon the planet. This video shows every year of that story, right from the beginning.

Abriviations can be found in this document: https://docs.google.com/document/d/1_oJx72M75tuai2mo6yD13qqQB1g_auQ...

▶ Play video

frosty lava Dec 16, 2025, 9:50 AM

#

when china and us are racing how can people still say hey its nothing

#

i don't understand sometime

plucky sparrow Dec 16, 2025, 9:50 AM

#

put it on 2x or whatever. the borders historically change like crazy, yet somehow, we think war is over

frosty lava Dec 16, 2025, 9:50 AM

#

how delusionnal we can be

frosty lava Dec 16, 2025, 9:51 AM

#

plucky sparrow put it on 2x or whatever. the borders historically change like crazy, yet someho...

Yeah i know already too many things happen but we see it as small and not important when it actually matter the most

plucky sparrow Dec 16, 2025, 9:51 AM

#

I do hope you're right, that this time really is different

frosty lava Dec 16, 2025, 9:51 AM

#

if its not big enough to see it through eyes people will still think its nothing

plucky sparrow Dec 16, 2025, 9:52 AM

#

otherwise in the next 7-30 years we're likely going to go throuigh hell 😄

frosty lava Dec 16, 2025, 9:52 AM

#

plucky sparrow otherwise in the next 7-30 years we're likely going to go throuigh hell 😄

yes and it can even be faster than in 7 years

#

by seeing at the progress of it this year for example

#

its crazy

#

all of it is being speed up cause its a race

plucky sparrow Dec 16, 2025, 9:53 AM

#

well i mean, people predicted self-driving cars everywhere 5 years ago 😄

frosty lava Dec 16, 2025, 9:53 AM

#

plucky sparrow well i mean, people predicted self-driving cars everywhere 5 years ago 😄

yes but theres a difference, cause self driving car doesn't mean power

#

when its about power you'll be surprised

#

how fast it happen

#

seeing Trillion dollars of investment lol

#

i don't even know who have this much money but yeah

#

its still happening

weary galleon Dec 16, 2025, 9:55 AM

#

swift oyster Result are in for our upcoming model.

It doesn't look real. In my experience really good models don't need hype, they get hype automatically from their existence. You post these pictures every single day maybe because after the release hype will over.

frosty lava Dec 16, 2025, 9:55 AM

#

Yes gpt 5.2 what a scam in my opinion

#

they did this so they don't loose fame but its definitly not as good as expected

#

benchmark are impressive yet we dont see real result

#

like student getting 100 / 100 on a test but when it come to apply it in real life nothing happen

plucky sparrow Dec 16, 2025, 9:57 AM

#

frosty lava when its about power you'll be surprised

that is true

frosty lava Dec 16, 2025, 9:59 AM

#

plucky sparrow that is true

president themselves are involved in this

#

its not just a new fancy technologies

#

its power but people be underestimating this

#

while Trillion dollars being invested

hollow mist Dec 16, 2025, 10:11 AM

#

Tell me what the problem is. I've been nerfing everything for a long time, but now I keep getting the error "Something went wrong with this response, please try again." (Gemini 3 pro image preview nano bsnana)

#

Help please

compact flame Dec 16, 2025, 10:25 AM

#

hollow mist Tell me what the problem is. I've been nerfing everything for a long time, but n...

It's something with nano banana itself

#

It's not available on yupp either

robust sluice Dec 16, 2025, 10:36 AM

#

someone told me error result doesnt count on limit, but I try over and over when limit resets, I found it really count, cuz this hour I got only error msg and limits still runs out

obtuse smelt Dec 16, 2025, 10:37 AM

#

yeah me too is sme happening again

robust sluice Dec 16, 2025, 10:41 AM

#

how I test is: generate something random and Flux model appear (I remember the style it draw cuz I see it a lot) with other Error model I keep clicking on retry with that error model and it count on 11s then error again, I do this untill it doesnt count anymore, that means limit runs out already

hollow mist Dec 16, 2025, 10:44 AM

#

This is an update, the server just doesn't work.

magic ravine Dec 16, 2025, 10:55 AM

#

Nano Banana Pro is not working again

obtuse smelt Dec 16, 2025, 11:00 AM

#

hmm, well me too

exotic iris Dec 16, 2025, 11:12 AM

#

#

Everyone you love this image or not

storm summit Dec 16, 2025, 11:12 AM

#

any cods invite for sora?

exotic iris Dec 16, 2025, 11:13 AM

#

exotic iris

Everyone likes 👍???

pastel bone Dec 16, 2025, 11:26 AM

#

storm summit any cods invite for sora?

You can now use lmarena to generate videos(Open Chrome, whatever you like)If u are lucky enough!

storm summit Dec 16, 2025, 11:32 AM

#

pastel bone You can now use lmarena to generate videos(Open Chrome, whatever you like)If u a...

i can onlyy generate image and web ersearch and code. How to generate video

pastel bone Dec 16, 2025, 11:33 AM

#

storm summit i can onlyy generate image and web ersearch and code. How to generate video

Asking pineapple, probably release in a few weeks/minths

storm summit Dec 16, 2025, 11:33 AM

#

but why u can generate video and i cant?

ocean vortex Dec 16, 2025, 11:33 AM

#

neat apex Almost same smartness, but it actually efforts to make better responses

"almost same"... For me the style feels kinda forced. And it isn't as reliable on some tasks.

pastel bone Dec 16, 2025, 11:33 AM

#

storm summit i can onlyy generate image and web ersearch and code. How to generate video

But you can try your luck and use #video-arena-1 1, #video-arena-2 or #video-arena-3 and #video-arena-4

#

Maybe u can use Sora👍🏻Good luck

true harbor Dec 16, 2025, 11:34 AM

#

Is image generation not working at all in lm arena since yesterday? At least for Nanobanana

magic ravine Dec 16, 2025, 11:35 AM

#

pastel bone You can now use lmarena to generate videos(Open Chrome, whatever you like)If u a...

I don't see a video option.

pastel bone Dec 16, 2025, 11:35 AM

#

magic ravine I don't see a video option.

Yeah I do it several hours ago and it maybe a leak

#

I was amazed

#

And I saw many models like kling😆

magic ravine Dec 16, 2025, 11:36 AM

#

Damn NatsuSadChibi

ocean vortex Dec 16, 2025, 11:36 AM

#

with GPT5 they went all in for genuine performance. With 5.1 they went for response style, and then for 5.2 they had this "oh sh'it" moment franctically responding to Gemini3

pastel bone Dec 16, 2025, 11:37 AM

#

magic ravine Damn <:NatsuSadChibi:585646799646228481>

I didn't have time to try it out actually.BTW, it is a good try to add video arena

obtuse smelt Dec 16, 2025, 11:37 AM

#

oh well is work again

ocean vortex Dec 16, 2025, 11:38 AM

#

that turnaround of less than 1 month between model versions is not normal tbh

#

We had similar with 2.5Pro 05-06 vs 06-05, but that was marginal differences and was advertised as such

magic ravine Dec 16, 2025, 11:39 AM

#

pastel bone I didn't have time to try it out actually.BTW, it is a good try to add video are...

The problem with video arena here is the randomness. I almost never get Sora or Veo 3. It's always the bad models

pastel bone Dec 16, 2025, 11:40 AM

#

magic ravine The problem with video arena here is the randomness. I almost never get Sora or ...

Yeah, the mods tell me that they may add a function for a model at random and a model you pick option

weary galleon Dec 16, 2025, 11:41 AM

#

frosty lava Yes gpt 5.2 what a scam in my opinion

They scammed Microsoft when Microsoft invested $500B into Stargate and now Microsoft wanna get their money back because models of OAI are terrible, but they can't.

magic ravine Dec 16, 2025, 11:42 AM

#

pastel bone Yeah, the mods tell me that they may add a function for a model at random and a ...

I hope so, but yeah, I'd prefer it on the site itself. Way more convenient and less traffic than in a public chat where it may get drowned by other gens.

frosty lava Dec 16, 2025, 11:42 AM

#

they wanted to release a new model fast due to them loosing people cause of gemini and claude

#

but it wasn't the best idea

weary galleon Dec 16, 2025, 11:44 AM

#

magic ravine The problem with video arena here is the randomness. I almost never get Sora or ...

There is a feature "Direct Chat" for text, visual, and image models. It would be a solid idea to add it for videos too.

weary galleon Dec 16, 2025, 11:45 AM

#

magic ravine I hope so, but yeah, I'd prefer it on the site itself. Way more convenient and l...

And all people see your inputs/outputs.

weary galleon Dec 16, 2025, 11:46 AM

#

frosty lava but it wasn't the best idea

It was the worst idea. Ever. GPT 5.1 outperforms GPT 5.2 in real tasks. GPT 5.2 is just a banchmarks scam.

frosty lava Dec 16, 2025, 11:47 AM

#

i guess they just lost due to their idea can they get back on the race ? i don't know honestly it depends on so many things

#

people will always use the best

#

fake benchmark doesn't get them anything lol

#

if it can't do it in real life

buoyant crypt Dec 16, 2025, 12:06 PM

#

Nano banana is not working today

weary galleon Dec 16, 2025, 12:07 PM

#

frosty lava i guess they just lost due to their idea can they get back on the race ? i don't...

Trying not to lose short-term race, but don't care at all about long-term race proofs Scam Altman has negative IQ level.

viscid cloak Dec 16, 2025, 12:08 PM

#

buoyant crypt Nano banana is not working today

yea, about only 1/5 generation was successful

acoustic crater Dec 16, 2025, 12:11 PM

#

viscid cloak yea, about only 1/5 generation was successful

1/50

magic ravine Dec 16, 2025, 12:12 PM

#

buoyant crypt Nano banana is not working today

Yep. All we can do is wait.

#

And hope they fix it

#

NatsuSadChibi

plucky sparrow Dec 16, 2025, 12:19 PM

#

Probably because they're trying to rush out flash 3

magic ravine Dec 16, 2025, 12:21 PM

#

plucky sparrow Probably because they're trying to rush out flash 3

Flash 3? How's it different from Pro 3?

plucky sparrow Dec 16, 2025, 12:24 PM

#

Faster and cheaper for those paying

#

(High use people and api people)

#

And some claim it's better in some ways. We'll see today

vale vortex Dec 16, 2025, 12:29 PM

#

Quick question, which one is better for research,

Kimi, MiniMax Agent, or Deepseek?

compact flame Dec 16, 2025, 12:42 PM

#

vale vortex Quick question, which one is better for research, Kimi, MiniMax Agent, or Deeps...

Why these specifically though? There multiple different ones but probably deepseek though

plucky sparrow Dec 16, 2025, 12:47 PM

#

I haven't used deepseek or minimax but K2 is actually pretty good for research

#

surprisingly good

pale torrent Dec 16, 2025, 12:48 PM

#

How to use Sor 3?

plucky sparrow Dec 16, 2025, 12:50 PM

#

#1397655624103493813

vale vortex Dec 16, 2025, 12:51 PM

#

plucky sparrow I haven't used deepseek or minimax but K2 is actually pretty good for research

Thinking or Instruct?

plucky sparrow Dec 16, 2025, 12:51 PM

#

vale vortex Thinking or Instruct?

thinking, on their site (not api)

onyx shore Dec 16, 2025, 12:59 PM

#

vale vortex Quick question, which one is better for research, Kimi, MiniMax Agent, or Deeps...

go with k2 much better then those

hollow mist Dec 16, 2025, 1:06 PM

#

Is there any news when nano banana will work?

obtuse smelt Dec 16, 2025, 1:06 PM

#

hmm still not respond

sterile tartan Dec 16, 2025, 1:06 PM

#

vale vortex Quick question, which one is better for research, Kimi, MiniMax Agent, or Deeps...

Might as well Try Tongyi Deep Research in Qwen

obtuse smelt Dec 16, 2025, 1:07 PM

#

hollow mist Is there any news when nano banana will work?

still like this "Something went wrong with this response, please try again."

plucky sparrow Dec 16, 2025, 1:08 PM

#

sterile tartan Might as well Try Tongyi Deep Research in Qwen

thanks for the suggestion, i've not tried that

sterile tartan Dec 16, 2025, 1:08 PM

#

plucky sparrow thanks for the suggestion, i've not tried that

Yeah try all best and judge by personal experience

#

Is the research very important?

plucky sparrow Dec 16, 2025, 1:10 PM

#

what is 'low price'

sterile tartan Dec 16, 2025, 1:10 PM

#

plucky sparrow what is 'low price'

Cheaper then original

plucky sparrow Dec 16, 2025, 1:10 PM

#

so not very cheap then

sterile tartan Dec 16, 2025, 1:10 PM

#

Probably well cheaper

#

The black internet market of ai yeah

#

I have a a seat in ChatGPT business myself

compact flame Dec 16, 2025, 1:16 PM

#

Seems like advertisment

#

Should we call mods or something

sterile tartan Dec 16, 2025, 1:17 PM

#

compact flame Should we call mods or something

I feel like it yeah

#

💀

compact flame Dec 16, 2025, 1:18 PM

#

sterile tartan I feel like it yeah

Ping mods while replying to the message ig

sterile tartan Dec 16, 2025, 1:18 PM

#

compact flame Ping mods while replying to the message ig

Right

#

The Stage is Yours

compact flame Dec 16, 2025, 1:20 PM

#

sterile tartan Right

Ig I'll wait can't test if we can ping mods or not though

sterile tartan Dec 16, 2025, 1:20 PM

#

U can atleast ping pineapple afaik

zealous sparrow Dec 16, 2025, 1:35 PM

#

gemini flash 3 is defo today

#

google put out another model onto battle LMArena

#

Updated Ghostfalcon and Fiercefalcon

#

obtuse smelt Dec 16, 2025, 1:37 PM

#

what the

sour spear Dec 16, 2025, 1:50 PM

#

zealous sparrow gemini flash 3 is defo today

I'm so curious to see how Flash 3 will perform.

plucky sparrow Dec 16, 2025, 1:54 PM

#

preliminary reports are saying 'the sonnet to the opus'

#

we'll see soon enough if it is

plucky sparrow Dec 16, 2025, 1:56 PM

#

zealous sparrow google put out another model onto battle LMArena

don't think this is google. doesn't make sense for them to release a codename trial model on the same day as release

compact sleet Dec 16, 2025, 1:57 PM

#

It felt like a Google model to me, could be wrong though

sour spear Dec 16, 2025, 1:57 PM

#

If Ghostfalcon is Flash 3, then OpenAI is in even deeper s*** than they already are. I just got it in battle mode, and it did a fantastic job

zealous sparrow Dec 16, 2025, 1:58 PM

#

google got the 3/3 on my questions after i changed them up a bit

#

btw

#

no AI could 3/3 this

#

i want to see if Xhigh can after i changed em up a bit

compact sleet Dec 16, 2025, 1:59 PM

#

zealous sparrow no AI could 3/3 this

Yeah guess sundar wasn't lying it's their best model yet on that specific interview.

zealous sparrow Dec 16, 2025, 2:00 PM

#

ima test xhigh rq

#

also deepseek r1 turbo got it right

#

so if xhigh doesnt

#

OpenAi is cooked

fiery gull Dec 16, 2025, 2:05 PM

#

sour spear I'm so curious to see how Flash 3 will perform.

I hope it's good, I'll have to manipulate 300gb of documents in anti gravity, I need flash 3.0 to be good in agentic mode

zealous sparrow Dec 16, 2025, 2:05 PM

#

gpt 5.2 xhigh officialy lost to deepseek r1 turbo and the new gemini model

#

2/3 because of the Macy question

fiery gull Dec 16, 2025, 2:05 PM

#

zealous sparrow gpt 5.2 xhigh officialy lost to deepseek r1 turbo and the new gemini model

I cant use gpt 5.2 xhigh, my works it need more that 500 sec and crash

zealous sparrow Dec 16, 2025, 2:05 PM

#

if you argue fell behind means she fell off the hill it aint, it means that she is in the back of the hill

zealous sparrow Dec 16, 2025, 2:06 PM

#

fiery gull I cant use gpt 5.2 xhigh, my works it need more that 500 sec and crash

im usin on yupp

acoustic bolt Dec 16, 2025, 2:06 PM

#

Guys work the server on lm?

neat apex Dec 16, 2025, 2:06 PM

#

Gpt 5.2 trys a lot to not allucinate and ends not reasoning many things

fiery gull Dec 16, 2025, 2:06 PM

#

zealous sparrow im usin on yupp

Too

zealous sparrow Dec 16, 2025, 2:06 PM

#

neat apex Gpt 5.2 trys a lot to not allucinate and ends not reasoning many things

btw r1 turbo and ghostfalcon-20251215got it right

fiery gull Dec 16, 2025, 2:06 PM

#

acoustic bolt Guys work the server on lm?

How so?

acoustic bolt Dec 16, 2025, 2:06 PM

#

Gemini 3 pro

neat apex Dec 16, 2025, 2:06 PM

#

When R1 was released it was the best model sometimes by far in everything, yes?

zealous sparrow Dec 16, 2025, 2:07 PM

#

neat apex When R1 was released it was the best model sometimes by far in everything, yes?

the discord designs it made had me impressed

#

it was then NERFED

#

a f- ton

#

It was able to recreate websites

neat apex Dec 16, 2025, 2:08 PM

#

Yeah, i am not tweaking

He was a burfed 4o and near to level of sonnet 3.5 but with a reasonable reasoning

compact sleet Dec 16, 2025, 2:08 PM

#

neat apex When R1 was released it was the best model sometimes by far in everything, yes?

It made deepseek moment happened, if that is a thing lol

dull mason Dec 16, 2025, 2:08 PM

#

5.2 on text leaderboard wen

zealous sparrow Dec 16, 2025, 2:09 PM

#

dull mason 5.2 on text leaderboard wen

eventually

neat apex Dec 16, 2025, 2:09 PM

#

dull mason 5.2 on text leaderboard wen

He takes too much time answering, it will take weeks lmao

dull mason Dec 16, 2025, 2:09 PM

#

neat apex He takes too much time answering, it will take weeks lmao

I wonder how "code red" will hold up against Gemini

compact sleet Dec 16, 2025, 2:10 PM

#

Do lmarena have a predecent of having a model avaliable on the direct select chat, but that model is not on the leaderboards yet?

fleet lintel Dec 16, 2025, 2:10 PM

#

sour spear If Ghostfalcon is Flash 3, then OpenAI is in even deeper s*** than they already ...

ghostfalcon is 100% flash

sour spear Dec 16, 2025, 2:10 PM

#

neat apex He takes too much time answering, it will take weeks lmao

At this point, I wouldn't be at all surprised if Flash 3 would beat 5.2 High.

neat apex Dec 16, 2025, 2:10 PM

#

The fact open ai makes a benchmaxxed model, to then make it actually smart is very true, it is very noticeable in 4o

fleet lintel Dec 16, 2025, 2:11 PM

#

zealous sparrow gemini flash 3 is defo today

I was almost certain that Flash 3 is tomorrow .. umm

compact sleet Dec 16, 2025, 2:11 PM

#

I feel like it can be used as boosting purposes, if the side by side and battle votes are not separated.

zealous sparrow Dec 16, 2025, 2:11 PM

#

dull mason I wonder how "code red" will hold up against Gemini

Year Summary VS
OpenAI Vs Gemini
OpenAI

2 new models that they said were good
Both flopped
Benchmaxxing king
Gemini
2 new models
Both good
Gemini no diff

pale torrent Dec 16, 2025, 2:11 PM

#

Veo 3 please

zealous sparrow Dec 16, 2025, 2:11 PM

#

fleet lintel I was almost certain that Flash 3 is tomorrow .. umm

tuesday

neat apex Dec 16, 2025, 2:11 PM

#

Since he does many works perfectly, and loses perfomance gradually when the task is more different

zealous sparrow Dec 16, 2025, 2:11 PM

#

They always release on a tuesday

neat apex Dec 16, 2025, 2:11 PM

#

Its sad that gpt 5.1 had a relative flop

#

Its gpt 5, but actually efforts to make a better answer, it is great

zealous sparrow Dec 16, 2025, 2:12 PM

#

Google also gave us the best OCR this year

fleet lintel Dec 16, 2025, 2:12 PM

#

zealous sparrow tuesday

yeah, Logan tweet also suggest that it will happen today. Good thing that I dont gamble, otherwise I would have place bet for Wednesday 🙂

zealous sparrow Dec 16, 2025, 2:12 PM

#

It will take a LOOOOOOOOONG time till someone beats gemini pro 3 OCR

dull mason Dec 16, 2025, 2:12 PM

#

sour spear At this point, I wouldn't be at all surprised if Flash 3 would beat 5.2 High.

Yeah it's teacher model is Gemini 3 Pro after all

neat apex Dec 16, 2025, 2:12 PM

#

zealous sparrow Google also gave us the best OCR this year

Since OCR existed, Google always had the best one, have the best one and likely will always have the best one

fiery gull Dec 16, 2025, 2:13 PM

#

zealous sparrow It will take a LOOOOOOOOONG time till someone beats gemini pro 3 OCR

I think will be gemini 3.5 lol

compact sleet Dec 16, 2025, 2:13 PM

#

zealous sparrow It will take a LOOOOOOOOONG time till someone beats gemini pro 3 OCR

This is a no brainer, they been doing this since Google lens.

zealous sparrow Dec 16, 2025, 2:13 PM

#

fiery gull I think will be gemini 3.5 lol

someone except google

compact sleet Dec 16, 2025, 2:13 PM

#

And probably no one can't beat them.

zealous sparrow Dec 16, 2025, 2:13 PM

#

compact sleet This is a no brainer, they been doing this since Google lens.

yeah google lens was good

#

I also used gemini Live from the app and told it to identify things in my room

#

it didnt get one wrong

neat apex Dec 16, 2025, 2:13 PM

#

Well, Qwen3 Omni had a level close to Gemini 2.5 and runs in a phone

compact sleet Dec 16, 2025, 2:14 PM

#

Hmm, ofc China had to have one lol

#

Perhaps you're right.

neat apex Dec 16, 2025, 2:14 PM

#

There was a company the Prime was not in the 3rd model?

weary galleon Dec 16, 2025, 2:14 PM

#

zealous sparrow gpt 5.2 xhigh officialy lost to deepseek r1 turbo and the new gemini model

GPT 5.2 is very bad.

sour spear Dec 16, 2025, 2:14 PM

#

dull mason Yeah it's teacher model is Gemini 3 Pro after all

True. Like Geralt and Ciri. 😉

zealous sparrow Dec 16, 2025, 2:15 PM

#

I want to know Flash score on simplebench

#

I hope its high

obtuse smelt Dec 16, 2025, 2:16 PM

#

well yeah need fix

plucky sparrow Dec 16, 2025, 2:17 PM

#

i don't get how some people like gpt 5.2

#

are they all paid shills?

neat apex Dec 16, 2025, 2:17 PM

#

Primes are in the 3rd model?:
Gpt 3/3.5 ✅ (Gpt 4 was not way better against Claude 2)
Grok 3 ✅ (Grok 4 fast was great, but it suprassed the 1400 points)
Gemini 3 ✅ (By far)
Claude 3 ✅
Qwen 3 ✅ (By far)
Mistral 3 ❌ (The first Mixtral 22bx8 were a beast, yet nowadays they barely manage to be competitive)

tiny halo Dec 16, 2025, 2:17 PM

#

we need claude has image import

plucky sparrow Dec 16, 2025, 2:17 PM

#

i really tried to like it, i tried various things on it, but, it feels like a model from 1.5 years ago

neat apex Dec 16, 2025, 2:18 PM

#

plucky sparrow i really tried to like it, i tried various things on it, but, it feels like a mo...

I know some people can be frustated about gpt 5.2, but you are isane

#

What are you dooing with that?

fleet lintel Dec 16, 2025, 2:19 PM

#

plucky sparrow i don't get how some people like gpt 5.2

Are you using Plus tier or Pro tier?
Plus tier is terrible.. you get medium gpt 5.2, which is like gemini flash models. They are treating their paid customers very badly 🙁

You need to use Pro or API to get the best out of gpt 5.2

compact sleet Dec 16, 2025, 2:20 PM

#

fleet lintel Are you using Plus tier or Pro tier? Plus tier is terrible.. you get medium gp...

Ouch.

neat apex Dec 16, 2025, 2:20 PM

#

Ah yes, gpt 5.2 medium

plucky sparrow Dec 16, 2025, 2:20 PM

#

i'm using it from api, even xhigh. i haven't tried 'pro' but i'm guessing it's similar to xhigh

neat apex Dec 16, 2025, 2:20 PM

#

Its only gpt 5.1 medium, but allucinates sigthly less (to the unoticeable level)

fleet lintel Dec 16, 2025, 2:20 PM

#

plucky sparrow i'm using it from api, even xhigh. i haven't tried 'pro' but i'm guessing it's s...

xhigh should perform .. what is your usecase?

plucky sparrow Dec 16, 2025, 2:20 PM

#

maybe 5.2 medium. but i find 5.1 way better

neat apex Dec 16, 2025, 2:21 PM

#

Why non Gpt 5.2 xtra high even exists? Lmao

plucky sparrow Dec 16, 2025, 2:21 PM

#

i tried it with coding, i tried it with logic, tried it with medical text, tried it with long context

sour spear Dec 16, 2025, 2:21 PM

#

neat apex I know some people can be frustated about gpt 5.2, but you are isane

If you look at 5.2 isolated, it is a really good model. But it's not excellent. Plus It's behind the competition, and above all, it's excrutiatingly slow. So all in all, it's neither fun to use, nor impressive in any way.

plucky sparrow Dec 16, 2025, 2:21 PM

#

good at what?!?!

#

like can someone give an actual example, prompt and output?

neat apex Dec 16, 2025, 2:22 PM

#

If you use xtra high, it does very very good analises in documents

#

Sometimes better than opus 4.5 or Gemini 3

plucky sparrow Dec 16, 2025, 2:22 PM

#

give an example please

#

because every time i've tried it, it's worse than all the other models

neat apex Dec 16, 2025, 2:22 PM

#

My brother creates documents using Gemini 3, and when go fixing it with Opus 4.5 and Gpt 5.2 xtra high

plucky sparrow Dec 16, 2025, 2:23 PM

#

i might be using 5.2 medium, if xhigh is 5.2 medium

#

what kind of documents? what did it fix that opus 4.5 couldn't?

neat apex Dec 16, 2025, 2:23 PM

#

Most times 5.2 notices way more issues than opus 4.5

plucky sparrow Dec 16, 2025, 2:23 PM

#

proofreading?

neat apex Dec 16, 2025, 2:23 PM

#

I dont have it now xd

plucky sparrow Dec 16, 2025, 2:23 PM

#

ok but what kind of documents and what kind of errors?

neat apex Dec 16, 2025, 2:24 PM

#

But yes, you are right, besises xtra high, it looks be worse than gpt 5.1

plucky sparrow Dec 16, 2025, 2:24 PM

#

i had xtra high try to create a game for me. it was terrible

#

not functional

#

or barely functional, rather

neat apex Dec 16, 2025, 2:24 PM

#

plucky sparrow ok but what kind of documents and what kind of errors?

Documents in general, he find subtle erros and mistakes, and even give good fixs plane

plucky sparrow Dec 16, 2025, 2:25 PM

#

please be more specific than in general because every document and information i've fed it, it acts like it has low context

#

and it's all confused

#

and the few times it suggests things, it's wrong

#

my specifics are html/js code, transcripts, and medical text

#

it's failed at all three compared to other models and even gpt 5.1

#

i'm very confused what it excels at

sour spear Dec 16, 2025, 2:27 PM

#

plucky sparrow it's failed at all three compared to other models and even gpt 5.1

You have to use standardized benchmark tests, it's only optimized for that. 😉

plucky sparrow Dec 16, 2025, 2:27 PM

#

🤣

#

ok that is fair, i've not thrown any standardized texts at it, just actual use cases

tiny halo Dec 16, 2025, 2:37 PM

#

omg why all claude moldes does not have image import

worn flume Dec 16, 2025, 2:42 PM

#

Hey

#

I'm giving away an apple 🍎

lunar glade Dec 16, 2025, 2:55 PM

#

can we all agree that LMArena's banana pro is spoiled? just can't get any result beside other than "Something went wrong with this response, please try again."?

sour spear Dec 16, 2025, 2:58 PM

#

#

But it is a little crash prone atm, needed two attemps to work. 😉

tribal kernel Dec 16, 2025, 3:01 PM

#

Hi everyone, I'm using Arena, but it always gives me this error, how can I fix it? Thanks

magic ravine Dec 16, 2025, 3:05 PM

#

Yep. It's been happening all day.

sour spear Dec 16, 2025, 3:11 PM

#

They're probably busy implementing Gemini 3 Flash Image. 😉

echo aurora Dec 16, 2025, 3:12 PM

#

tribal kernel Hi everyone, I'm using Arena, but it always gives me this error, how can I fix i...

Sorry to say we've been experiencing a higher than usual error rate with this model. It's also possible that you're hitting a rate limit.

#

If you haven't already would recommend to try: hard refresh of site, clear your cookies/cache, and if no luck starting a new chat. This may help.

tribal kernel Dec 16, 2025, 3:14 PM

#

echo aurora If you haven't already would recommend to try: hard refresh of site, clear your ...

Thanks for the reply, no.... I use the service 3 times a day so I don't think I exceeded the limit, also because the model tells me to wait 50 minutes, but it didn't tell me anything.

echo aurora Dec 16, 2025, 3:16 PM

#

tribal kernel Thanks for the reply, no.... I use the service 3 times a day so I don't think I ...

the model tells me to wait 50 minutes, but it didn't tell me anything.
It's a bit confusing but both error messages can both be caused by rate limit.

tribal kernel Dec 16, 2025, 3:16 PM

#

echo aurora Sorry to say we've been experiencing a higher than usual error rate with this mo...

echo aurora Dec 16, 2025, 3:19 PM

#

tribal kernel

Can you: open Developer Tools -> open Network tab -> run a new prompt throwing an error -> in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

#

Does it say 🟢 Status Code 200, or do you see 🔴 Status Code 427, 400, etc?

sour spear Dec 16, 2025, 3:20 PM

#

#

Works after a handful of retries. 🙂

obtuse smelt Dec 16, 2025, 3:22 PM

#

is status red

tribal kernel Dec 16, 2025, 3:35 PM

#

echo aurora Can you: open Developer Tools -> open Network tab -> run a new prompt throwing a...

Hi, I checked DevTools in detail.

Network → Fetch/XHR is working and requests return 200, but no stream request is ever created.
Searching for stream or eval shows nothing.

This means the stream never starts at all, so there is no Status Code to inspect.
The error “something went wrong with this response” happens before the streaming endpoint is created.

Looks like a backend / model issue on LMArena, not browser or client-side.

echo aurora Dec 16, 2025, 3:37 PM

#

tribal kernel Hi, I checked DevTools in detail. Network → Fetch/XHR is working and requests r...

Okay good to know, thank you for sharing. And you're sure you have the network tab open when running another prompt to through the error message agian, yeah?

tribal kernel Dec 16, 2025, 3:38 PM

#

echo aurora Okay good to know, thank you for sharing. And you're sure you have the network t...

I had DevTools open on the Network tab the whole time, with Fetch/XHR enabled, before and during sending the prompt.

I retried multiple times and the error appears, but no stream (or eval) request is ever created.
Only regular fetch requests return 200, then the UI shows “something went wrong with this response”.

So the request seems to fail before the streaming endpoint is initialized.

echo aurora Dec 16, 2025, 3:40 PM

#

tribal kernel I had DevTools open on the Network tab the whole time, with Fetch/XHR enabled, b...

Understood, thank you for the details. Yeah this sounds like it's on our end then for sure. Like I mentioned this model has been erroring out at a higher rate than usual. Our team has been looking to lower this as much as posssible, but I'll be sure to bump this again. Thanks for the info.

tribal kernel Dec 16, 2025, 3:42 PM

#

echo aurora Understood, thank you for the details. Yeah this sounds like it's on our end the...

Thank you, if I can help I am at your disposal

viscid cloak Dec 16, 2025, 3:55 PM

#

No models 😢

echo aurora Dec 16, 2025, 3:56 PM

#

Hmm

#

What browser are you on? Seeing the same for Side by Side?

#

Can't say I'm seeing the same on my end.

viscid cloak Dec 16, 2025, 3:57 PM

#

i tried google ios and firefox ios, same results for Direct. lemme try sidebyside

#

Sidebyside worked well, and Direct turned normal. may just a minor error

echo aurora Dec 16, 2025, 4:00 PM

#

Okay glad to hear it's working again. Keep me updated if things change.

viscid cloak Dec 16, 2025, 4:01 PM

#

hazel revealed?

torn mantle Dec 16, 2025, 4:02 PM

#

Last time i checked lmarena are adding 3 new things

#

Video models
Auto modality
New model selector

atomic lagoon Dec 16, 2025, 4:09 PM

#

#

Lmaoooo

#

The image is silly

molten cipher Dec 16, 2025, 4:15 PM

#

tribal kernel Hi everyone, I'm using Arena, but it always gives me this error, how can I fix i...

just go on gemini's web itself..

#

it free

steel steeple Dec 16, 2025, 4:21 PM

#

Why pro banana doesn't work anymore

echo aurora Dec 16, 2025, 4:22 PM

#

@quaint raptor you'll want to review the information in #1397655624103493813 for a better understanding on how to use the bot.

steel steeple Dec 16, 2025, 4:22 PM

#

Screenshot_2025-12-16-16-31-36-58_40deb401b9ffe8e1df2f1cc5ba480b12.jpg

echo aurora Dec 16, 2025, 4:23 PM

#

steel steeple Why pro banana doesn't work anymore

Sorry to say this model has been pretty high error rate. Our team is looking into a fix asap.

steel steeple Dec 16, 2025, 4:24 PM

#

echo aurora Sorry to say this model has been pretty high error rate. Our team is looking int...

Hope this will work soon🙏🙏🙏

echo aurora Dec 16, 2025, 4:24 PM

#

steel steeple Hope this will work soon🙏🙏🙏

Same. Fingers crossed.

steel steeple Dec 16, 2025, 4:24 PM

#

Love you service, thanks

zealous sparrow Dec 16, 2025, 4:25 PM

#

i wonder if ghostfalcon is flash too, Maybe gem 3 pro?

echo aurora Dec 16, 2025, 4:25 PM

#

steel steeple Love you service, thanks

Thank you! Very glad to hear it!

torn mantle Dec 16, 2025, 4:26 PM

#

zealous sparrow i wonder if ghostfalcon is flash too, Maybe gem 3 pro?

nah its flash

grand flame Dec 16, 2025, 4:29 PM

#

gemini died

ripe mountain Dec 16, 2025, 4:30 PM

#

poll_question_text

Open-Source SOTA for Agentic Coding

victor_answer_votes

1

total_votes

2

zealous sparrow Dec 16, 2025, 4:32 PM

#

torn mantle nah its flash

guess the leak only covers fiercefalcon for rn

fleet lintel Dec 16, 2025, 4:32 PM

#

zealous sparrow i wonder if ghostfalcon is flash too, Maybe gem 3 pro?

It's flash.

winter rain Dec 16, 2025, 4:32 PM

#

Is lmarena a permanent video generator

zealous sparrow Dec 16, 2025, 4:32 PM

#

fleet lintel It's flash.

yeah prob so

fleet lintel Dec 16, 2025, 4:32 PM

#

Only difference between ghost and fierce is search on or off

#

My guess is ghost is with search

echo aurora Dec 16, 2025, 4:33 PM

#

winter rain Is lmarena a permanent video generator

Hmm what do you mean by this?

fleet lintel Dec 16, 2025, 4:33 PM

#

zealous sparrow i wonder if ghostfalcon is flash too, Maybe gem 3 pro?

What is the link to the original source?

winter rain Dec 16, 2025, 4:34 PM

#

@echo aurora free lifetime?? Or any chance of premium

echo aurora Dec 16, 2025, 4:35 PM

#

winter rain <@283397944160550928> free lifetime?? Or any chance of premium

chance of premium
Can't say I'm aware of these plans.

free lifetime
This is our intention.

zealous sparrow Dec 16, 2025, 4:35 PM

#

fleet lintel What is the link to the original source?

twitter post

grand flame Dec 16, 2025, 4:36 PM

#

i think gemini 3.0 pro died

#

since it gives me something went wrong everytime

zealous sparrow Dec 16, 2025, 4:36 PM

#

works in AI studio

winter rain Dec 16, 2025, 4:36 PM

#

@echo aurora we respect your hard work bro??

muted timber Dec 16, 2025, 4:37 PM

#

hello guys

#

i have a problem

echo aurora Dec 16, 2025, 4:37 PM

#

muted timber hello guys

hello ablobwave

muted timber Dec 16, 2025, 4:37 PM

#

on the AI

grand flame Dec 16, 2025, 4:37 PM

#

grand flame i think gemini 3.0 pro died

at lmarea

muted timber Dec 16, 2025, 4:37 PM

#

#

not working anymore

zealous sparrow Dec 16, 2025, 4:37 PM

#

um

muted timber Dec 16, 2025, 4:37 PM

#

and i dont want to create a new chat

zealous sparrow Dec 16, 2025, 4:37 PM

#

pineapple, is your endpoint down?

echo aurora Dec 16, 2025, 4:37 PM

#

grand flame since it gives me something went wrong everytime

It seems to be working for me.

Can you: open Developer Tools -> open Network tab -> run a new prompt throwing an error -> in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

zealous sparrow Dec 16, 2025, 4:37 PM

#

nevermind huh

#

wait ill tes

winter rain Dec 16, 2025, 4:38 PM

#

But how can u access those tool bro lmarena

grand flame Dec 16, 2025, 4:38 PM

#

echo aurora It seems to be working for me. Can you: open Developer Tools -> open Network t...

nvm fixed rn

echo aurora Dec 16, 2025, 4:38 PM

#

echo aurora It seems to be working for me. Can you: open Developer Tools -> open Network t...

It's most common that you're hitting a rate limit, can you try these steps @muted timber and let me know what that status code is?

muted timber Dec 16, 2025, 4:39 PM

#

What do you mean?

echo aurora Dec 16, 2025, 4:39 PM

#

muted timber What do you mean?

Can you: open Developer Tools -> open Network tab -> run a new prompt throwing an error -> in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

muted timber Dec 16, 2025, 4:40 PM

#

open Developer Tools

#

where is it??

grand flame Dec 16, 2025, 4:40 PM

#

muted timber open Developer Tools

F12

echo aurora Dec 16, 2025, 4:41 PM

#

muted timber open Developer Tools

It's a browser setting/option

muted timber Dec 16, 2025, 4:41 PM

#

i know now

#

but

#

" run a new prompt throwing an error"

echo aurora Dec 16, 2025, 4:42 PM

#

muted timber " run a new prompt throwing an error"

Yeah so with it open, run a new prompt (in LMArena) that'll then result in an error

muted timber Dec 16, 2025, 4:42 PM

#

echo aurora Dec 16, 2025, 4:42 PM

#

This is how we get better inforamtion to understand what's going wrong.

muted timber Dec 16, 2025, 4:42 PM

#

so i need to chat another time ?

cosmic salmon Dec 16, 2025, 4:43 PM

#

muted timber

Oh yeah, same thing, but also with Nano Banana Pro sometimes. It seems like the fetch request returns nothing after a while, but still goes through with 200 code

muted timber Dec 16, 2025, 4:43 PM

#

i need for scripting

echo aurora Dec 16, 2025, 4:43 PM

#

muted timber

Follow the rest of the steps: in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

zealous sparrow Dec 16, 2025, 4:44 PM

#

echo aurora Follow the rest of the steps: in the search bar in dev tools search for the word...

theres some 429 status showing in his network tab

echo aurora Dec 16, 2025, 4:45 PM

#

zealous sparrow theres some 429 status showing in his network tab

Okay yeah that's rate limit.

muted timber Dec 16, 2025, 4:45 PM

#

echo aurora Follow the rest of the steps: in the search bar in dev tools search for the word...

English, please

echo aurora Dec 16, 2025, 4:45 PM

#

Need to wait to use again

muted timber Dec 16, 2025, 4:45 PM

#

i waited one day

#

i text for fisrt time this day

#

15 minutes ago

echo aurora Dec 16, 2025, 4:46 PM

#

muted timber i waited one day

I was directing that at pro pro, because they were able to provide a Status Code

#

I'm not sure whatyour Status Code is

cosmic salmon Dec 16, 2025, 4:46 PM

#

cosmic salmon Oh yeah, same thing, but also with Nano Banana Pro sometimes. It seems like the ...

In my case i get this as the response:

b3:"Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: Too Many Requests"

for https://lmarena.ai/nextjs-api/stream/retry-evaluation-session-message/ PUT request

muted timber Dec 16, 2025, 4:46 PM

#

zealous sparrow Dec 16, 2025, 4:47 PM

#

echo aurora I was directing that at pro pro, because they were able to provide a Status Code

Well, It was actually at him. I read it off the image he scribbled on.

fossil fable Dec 16, 2025, 4:48 PM

#

3pig broken smh

echo aurora Dec 16, 2025, 4:48 PM

#

zealous sparrow Well, It was actually at him. I read it off the image he scribbled on.

Ah, thank you.

echo aurora Dec 16, 2025, 4:48 PM

#

muted timber

Okay yeah it's rate limit for why you're hitting this error

muted timber Dec 16, 2025, 4:48 PM

#

Yes, but when I chat with him, he lets me use it, and I don't want to because I have all my information on that chat, and he already has all the information from all the files, because I have over 20+.

muted timber Dec 16, 2025, 4:49 PM

#

echo aurora Okay yeah it's rate limit for why you're hitting this error

so i need only to wait? another one day?

fossil fable Dec 16, 2025, 4:49 PM

#

can ppl stop calling chatbots he smh

cosmic salmon Dec 16, 2025, 4:49 PM

#

muted timber Yes, but when I chat with him, he lets me use it, and I don't want to because I ...

Same. I've noticed that, when you come back to your chat with Opus after a while, it just stops working for some reason

echo aurora Dec 16, 2025, 4:50 PM

#

muted timber so i need only to wait? another one day?

There could be something else that's causing an error @cosmic salmon . However, if you're seeing a 429 Status Code that means it's rate limit. And yes @muted timber will need to wait.

fossil fable Dec 16, 2025, 4:51 PM

#

where can i smooch off 3pig access whilst it's not working on lmarena

muted timber Dec 16, 2025, 4:52 PM

#

That's why yesterday he told me to wait 49 minutes (last night), and now I have to work continuously, it will be much harder for me this way... but what can you do...

cosmic salmon Dec 16, 2025, 4:52 PM

#

echo aurora There could be something else that's causing an error <@535165486011252756> . Ho...

Nope, that's a 200 Status Code, so i assume that this is either related to the Gemini API or LMarena is losing my image somehow on the way to me

#

I'll try to log out and change my browser to see if this issue still persists or nah

zealous sparrow Dec 16, 2025, 4:52 PM

#

cosmic salmon Nope, that's a 200 Status Code, so i assume that this is either related to the G...

The gemini API has to work, I just used on AIStudio

echo aurora Dec 16, 2025, 4:52 PM

#

cosmic salmon Nope, that's a 200 Status Code, so i assume that this is either related to the G...

You're getting a 🟢 Status Code 200 when you get an error?

fossil fable Dec 16, 2025, 4:53 PM

#

cosmic salmon In my case i get this as the response: ``` b3:"Error during image generation wit...

im getting a generic error

cosmic salmon Dec 16, 2025, 4:53 PM

#

echo aurora You're getting a 🟢 Status Code 200 when you get an error?

Yeah, for the PUT fetch request to https://lmarena.ai/nextjs-api/stream/retry-evaluation-session-message/ when retrying my Something went wrong response

cosmic salmon Dec 16, 2025, 4:54 PM

#

fossil fable im getting a generic error

It's always generic, dev tools shows this as the response to the request

long jackal Dec 16, 2025, 5:04 PM

#

Is there any way to upload files other than images for the AIs in LMaren?

zealous sparrow Dec 16, 2025, 5:08 PM

#

long jackal Is there any way to upload files other than images for the AIs in LMaren?

no

half mist Dec 16, 2025, 5:08 PM

#

ChatGPT’s new image model just released!

zealous sparrow Dec 16, 2025, 5:08 PM

#

half mist ChatGPT’s new image model just released!

images v2?

#

or well

#

gpt image 2

half mist Dec 16, 2025, 5:09 PM

#

zealous sparrow images v2?

It allows for copyrighted characters, and political figures which were originally rejected in previous versions.

cloud zinc Dec 16, 2025, 5:09 PM

#

nano banana pro is better

half mist Dec 16, 2025, 5:09 PM

#

Also is way faster

#

than gpt image 1

fossil fable Dec 16, 2025, 5:09 PM

#

cosmic salmon It's always generic, dev tools shows this as the response to the request

right

fossil fable Dec 16, 2025, 5:10 PM

#

half mist ChatGPT’s new image model just released!

What

#

wait did it

half mist Dec 16, 2025, 5:10 PM

#

fossil fable wait did it

It did, just not with a huge announcement for some reason just a post on the OpenAI Discord

zealous sparrow Dec 16, 2025, 5:10 PM

#

half mist It allows for copyrighted characters, and political figures which were originall...

what degree of copyright

cloud zinc Dec 16, 2025, 5:10 PM

#

half mist It did, just not with a huge announcement for some reason just a post on the Ope...

where to use it

half mist Dec 16, 2025, 5:11 PM

#

cloud zinc where to use it

On the ChatGPT website or app

#

It’s out right now

cosmic salmon Dec 16, 2025, 5:11 PM

#

half mist It did, just not with a huge announcement for some reason just a post on the Ope...

Yeah, that was a preview, a sneak-peek for the announcement that's going to be soon.

fossil fable Dec 16, 2025, 5:12 PM

#

holy sh t it's releasing

but it's not out right now is it

openai discord just says this though

half mist Dec 16, 2025, 5:12 PM

#

zealous sparrow what degree of copyright

Well, way less strict with copyrighted characters than Sora 2 for sure

half mist Dec 16, 2025, 5:12 PM

#

fossil fable holy sh t it's releasing but it's not out right now is it openai discord just ...

It is

#

It’s out

#

I tested it myself

fossil fable Dec 16, 2025, 5:12 PM

#

there's nothing on openai.com

half mist Dec 16, 2025, 5:12 PM

#

fossil fable there's nothing on openai.com

It just released, so it’s probably not there yet

grand flame Dec 16, 2025, 5:12 PM

#

bruh broke again

zealous sparrow Dec 16, 2025, 5:12 PM

#

half mist Well, way less strict with copyrighted characters than Sora 2 for sure

sure the whole disney uh scale
but if you can generate goku and stuff yeah OpenAI just wants a lawsuit

half mist Dec 16, 2025, 5:12 PM

#

Anyways, here is an image made with gpt image 2

zealous sparrow Dec 16, 2025, 5:13 PM

#

half mist Anyways, here is an image made with gpt image 2

im pretty sure they dont have the rights to this

fossil fable Dec 16, 2025, 5:13 PM

#

so openai launches it on the one day of the year i need image edit

#

...

#

half mist Dec 16, 2025, 5:13 PM

#

zealous sparrow im pretty sure they dont have the rights to this

Would say the quality degraded a bit in my opinion

compact flame Dec 16, 2025, 5:13 PM

#

half mist Anyways, here is an image made with gpt image 2

I still think nano banana looks better though

zealous sparrow Dec 16, 2025, 5:14 PM

#

half mist Would say the quality degraded a bit in my opinion

aren't they afraid to be sued by uh

#

who has spongebob rights

fossil fable Dec 16, 2025, 5:14 PM

#

compact flame I still think nano banana looks better though

it's one image you need to check out more

fossil fable Dec 16, 2025, 5:14 PM

#

zealous sparrow who has spongebob rights

paramount

half mist Dec 16, 2025, 5:14 PM

#

zealous sparrow who has spongebob rights

They will probably patch this soon and not allow copyrighted characters in the future

fossil fable Dec 16, 2025, 5:15 PM

#

at least now we know to rush before it's locked up

#

we didn't get to rush sora 2 because we thought it'd be normal

cloud zinc Dec 16, 2025, 5:15 PM

#

just use nano banana pro

zealous sparrow Dec 16, 2025, 5:16 PM

#

fossil fable paramount

paramount is deada livid on copyright, they are cooked if they dont act

magic ravine Dec 16, 2025, 5:16 PM

#

cloud zinc just use nano banana pro

Can't. Keeps giving me the error

half mist Dec 16, 2025, 5:16 PM

#

Here is an image of Donald Trump with Sam Altman using GPT Image 2

zealous sparrow Dec 16, 2025, 5:16 PM

#

half mist They will probably patch this soon and not allow copyrighted characters in the f...

defo

#

paramount is already planning to sue if they see this

compact flame Dec 16, 2025, 5:16 PM

#

half mist Here is an image of Donald Trump with Sam Altman using GPT Image 2

Okay this one looks kinda good

grand flame Dec 16, 2025, 5:17 PM

#

half mist Here is an image of Donald Trump with Sam Altman using GPT Image 2

now try gemini 3.0 pro image nanobanana

fossil fable Dec 16, 2025, 5:17 PM

#

half mist Here is an image of Donald Trump with Sam Altman using GPT Image 2

Holy fuuuuuuuuuuuuuuuuuuuuuuuuuuuuck we got a gold rush inbound

cloud zinc Dec 16, 2025, 5:17 PM

#

fossil fable Holy fuuuuuuuuuuuuuuuuuuuuuuuuuuuuck we got a gold rush inbound

u can already do that in nano banana pro

zealous sparrow Dec 16, 2025, 5:17 PM

#

half mist Here is an image of Donald Trump with Sam Altman using GPT Image 2

this will be blocked quickly

#

trust

half mist Dec 16, 2025, 5:17 PM

#

grand flame now try gemini 3.0 pro image nanobanana

Alright

fossil fable Dec 16, 2025, 5:17 PM

#

let me try and get it to expand an image like nbp can do

half mist Dec 16, 2025, 5:18 PM

#

grand flame now try gemini 3.0 pro image nanobanana

grand flame Dec 16, 2025, 5:18 PM

#

half mist

looks nice

fossil fable Dec 16, 2025, 5:19 PM

#

nbp lets me seamlessly expand images to 17:6 for use as a profile banner

i think i'm finally gonna update my banner after like 2-3 years

magic ravine Dec 16, 2025, 5:19 PM

#

half mist Here is an image of Donald Trump with Sam Altman using GPT Image 2

Image 2 isn't on LMArena

fossil fable Dec 16, 2025, 5:20 PM

#

magic ravine Image 2 isn't on LMArena

ik but they say that it's on chatgpt

magic ravine Dec 16, 2025, 5:20 PM

#

fossil fable ik but they say that it's on chatgpt

Ah, makes sense

half mist Dec 16, 2025, 5:20 PM

#

fossil fable ik but they say that it's on chatgpt

It is. Go test it out yourself

fossil fable Dec 16, 2025, 5:21 PM

#

i dont use twitter so i have no reason to generate 500 political images but enjoy the gold rush people

fossil fable Dec 16, 2025, 5:22 PM

#

half mist It is. Go test it out yourself

have you tested it enough to tell whether it's better than 2.5fig

half mist Dec 16, 2025, 5:23 PM

#

fossil fable have you tested it enough to tell whether it's better than 2.5fig

I would say GPT Image 2 better than Nano Banana, but not better than Nano Banana Pro

fossil fable Dec 16, 2025, 5:24 PM

#

that's why i said 2.5fig

zealous sparrow Dec 16, 2025, 5:24 PM

#

half mist I would say GPT Image 2 better than Nano Banana, but not better than Nano Banana...

you cant judge yet

#

you know why

#

google is dropping nano banana 3 flash today

#

lol

fossil fable Dec 16, 2025, 5:24 PM

#

zealous sparrow you cant judge yet

ik

that's why i asked if he's tested it enough

fossil fable Dec 16, 2025, 5:24 PM

#

zealous sparrow google is dropping nano banana 3 flash today

really

#

are you sure

void elm Dec 16, 2025, 5:24 PM

#

zealous sparrow google is dropping nano banana 3 flash today

too late

zealous sparrow Dec 16, 2025, 5:24 PM

#

fossil fable <:really:1440471151121010749>

it is a tuesday

half mist Dec 16, 2025, 5:24 PM

#

zealous sparrow google is dropping nano banana 3 flash today

Wait, Gemini 3 Flash? I thought it was still 2.5 Flash

zealous sparrow Dec 16, 2025, 5:24 PM

#

tuesday is new day

zealous sparrow Dec 16, 2025, 5:25 PM

#

half mist Wait, Gemini 3 Flash? I thought it was still 2.5 Flash

not yet

#

it hasnt dropped yet

#

google always drops stuff on a tuesday

torn mantle Dec 16, 2025, 5:25 PM

#

https://x.com/testingcatalog/status/2000965497110573307

TestingCatalog News 🗞 (@testingcatalog)

BREAKING 🚨: It looks like the Image 2 model rollout has started already for ChatGPT.

It would also come along with a new UI for the Image tab with h style selector and a prompt bar.

Did you get it too? 👀

#

image 2 model is already out for some people

#

also gemini 3 flash prob tomorrow?

zealous sparrow Dec 16, 2025, 5:26 PM

#

torn mantle also gemini 3 flash prob tomorrow?

google releaes stuff on a tuesday usually

#

and there is already an entry on vertex for 3 flash/fiercefalcon

half mist Dec 16, 2025, 5:27 PM

#

zealous sparrow and there is already an entry on vertex for 3 flash/fiercefalcon

Will Gemini 3 Flash be better at coding than 2.5 Flash?

void elm Dec 16, 2025, 5:27 PM

#

???

#

how is that a question

zealous sparrow Dec 16, 2025, 5:28 PM

#

half mist Will Gemini 3 Flash be better at coding than 2.5 Flash?

2.5 pro even

neat apex Dec 16, 2025, 5:30 PM

#

the Flash 2.5 09 have near 2.5 Pro level (since always when they release a new flash model)

#

and looks nothing changed, 3.0 Flash must have a very near level to 3.0 Pro, but without overreasoning issue now

magic ravine Dec 16, 2025, 5:30 PM

#

Oh, image 2 has just released?

zealous sparrow Dec 16, 2025, 5:30 PM

#

yea

magic ravine Dec 16, 2025, 5:31 PM

#

How is it? Still nerfed with fictional characters?

neat apex Dec 16, 2025, 5:31 PM

#

nope, they fixed that issue

zealous sparrow Dec 16, 2025, 5:32 PM

#

magic ravine How is it? Still nerfed with fictional characters?

barely any 3rd party guardrails rn

magic ravine Dec 16, 2025, 5:32 PM

#

zealous sparrow barely any 3rd party guardrails rn

Nice, and it's available on the free tier?

zealous sparrow Dec 16, 2025, 5:33 PM

#

magic ravine Nice, and it's available on the free tier?

for a few gens i guess

#

this is neat too

torn mantle Dec 16, 2025, 5:37 PM

#

flash wen 😖

zealous sparrow Dec 16, 2025, 5:37 PM

#

this could be images v1 im not sure

#

i dont know if i have v2 yet

grand flame Dec 16, 2025, 5:38 PM

#

hell nah

#

status 200 btw

zealous sparrow Dec 16, 2025, 5:40 PM

#

also flowith tweeted this

#

so flash might be today

grand flame Dec 16, 2025, 5:40 PM

#

grand flame status 200 btw

@echo aurora

cloud zinc Dec 16, 2025, 5:40 PM

#

zealous sparrow also flowith tweeted this

they dont have insider info

torn mantle Dec 16, 2025, 5:41 PM

#

cloud zinc they dont have insider info

do u?

echo aurora Dec 16, 2025, 5:41 PM

#

grand flame hell nah

Thank you for sharing. I'm not surprising the infinite generation bug wouldn't have an error status code associated with it

zealous sparrow Dec 16, 2025, 5:42 PM

#

cloud zinc they dont have insider info

sure not but

#

they are already prepping

#

its this week sure

#

but when

neat apex Dec 16, 2025, 5:43 PM

#

i am hoping they launch Gemma 4 (what is very likely soon) and someone finetune it to be responsive like the new generation at all

#

it would have same level of Opus 4.5 lmao

zealous sparrow Dec 16, 2025, 5:43 PM

#

neat apex i am hoping they launch Gemma 4 (what is very likely soon) and someone finetune ...

them asking us to stalk their huggingface page most likely says its this week

half mist Dec 16, 2025, 5:44 PM

#

zealous sparrow also flowith tweeted this

Gemini 3 flash is what I am more excited for since I don’t really care about the flash image generation

zealous sparrow Dec 16, 2025, 5:44 PM

#

I give em 4h to release it

#

if it aint released

#

oh well another day

autumn snow Dec 16, 2025, 5:45 PM

#

hwo to use promt htere

empty stump Dec 16, 2025, 5:46 PM

#

8% chance it releases today

zealous sparrow Dec 16, 2025, 5:46 PM

#

empty stump 8% chance it releases today

nah strong 50

neat apex Dec 16, 2025, 5:46 PM

#

i would say 50%

zealous sparrow Dec 16, 2025, 5:47 PM

#

logan tweeted 3 thunders today

neat apex Dec 16, 2025, 5:47 PM

#

they only delayed Gemini 3 pro a lot to make it 100% consistent

grand flame Dec 16, 2025, 5:47 PM

#

my chat died

#

sad

grand flame Dec 16, 2025, 5:47 PM

#

grand flame my chat died

i tried to make new message and it went instantly to something went wrong