#general | Arena | Page 216

neat apex Dec 10, 2025, 8:25 PM

#

Its because the only thing they are eating in the crisis

compact flame Dec 10, 2025, 8:26 PM

#

neat apex Its because the only thing they are eating in the crisis

Sounds valid they be saving on food to train gpt 5.2

neat apex Dec 10, 2025, 8:27 PM

#

They already turned off the code red?

waxen fern Dec 10, 2025, 8:27 PM

#

@echo aurora please remove rate limits

neat apex Dec 10, 2025, 8:27 PM

#

Or its on yet?

echo aurora Dec 10, 2025, 8:27 PM

#

waxen fern <@283397944160550928> please remove rate limits

I've answered this already a few times now.

compact flame Dec 10, 2025, 8:27 PM

#

waxen fern <@283397944160550928> please remove rate limits

I don't think pineapple can edit the website though

waxen fern Dec 10, 2025, 8:27 PM

#

echo aurora I've answered this already a few times now.

Can you remove it now?

neat apex Dec 10, 2025, 8:27 PM

#

waxen fern <@283397944160550928> please remove rate limits

If you want talk faster, i recomend you to edit the previous mesage

#

I always do that, and nobody cares
Since it dont break any rules and not annoy enought
But dont spam pls

tardy plover Dec 10, 2025, 8:28 PM

#

Claude opus 4-1 is infinitely stuck on "generating"

#

Tried reloading, relogging

sterile tartan Dec 10, 2025, 8:29 PM

#

Is not stuck

#

Is done for it

narrow comet Dec 10, 2025, 8:29 PM

#

how use 4.1?

neat apex Dec 10, 2025, 8:29 PM

#

You who are stuck, using Opus 4.1 yet, they are already shutting it down

tardy plover Dec 10, 2025, 8:29 PM

#

What do i do, i have lots of context in the chat

#

I cant do anything rn

neat apex Dec 10, 2025, 8:29 PM

#

Aaah yes, that explain a lot

sterile tartan Dec 10, 2025, 8:30 PM

#

tardy plover What do i do, i have lots of context in the chat

U more likely have hit the context window

neat apex Dec 10, 2025, 8:30 PM

#

Open a notebook and start copying mesages, sadly its the only thing you can do

tardy plover Dec 10, 2025, 8:30 PM

#

sterile tartan U more likely have hit the context window

I have longer chats too

neat apex Dec 10, 2025, 8:30 PM

#

But sending multiple mesages in a chat does not bug it most times, they continue ordinary

sterile tartan Dec 10, 2025, 8:30 PM

#

tardy plover I have longer chats too

It depends on the model and such

tardy plover Dec 10, 2025, 8:30 PM

#

This is so annoying

#

With same model

neat apex Dec 10, 2025, 8:31 PM

#

Be grate you are not paying anything or have a 10 mesages context of yupp

tardy plover Dec 10, 2025, 8:31 PM

#

Sometimes shi like this happens and then resolves itself automatically

sterile tartan Dec 10, 2025, 8:33 PM

#

neat apex Be grate you are not paying anything or have a 10 mesages context of yupp

💀

#

Bro roasted it

sterile tartan Dec 10, 2025, 8:33 PM

#

tardy plover Sometimes shi like this happens and then resolves itself automatically

Then let it be

#

It might resolve by itself

#

What do you chat anyways

#

🤔

neat apex Dec 10, 2025, 8:33 PM

#

I recomend you copying you whole context anyway, since you can out that in notebook and do somethings

sterile tartan Dec 10, 2025, 8:34 PM

#

neat apex I recomend you copying you whole context anyway, since you can out that in noteb...

NotebookLM

neat apex Dec 10, 2025, 8:34 PM

#

NotebookLM too

tardy plover Dec 10, 2025, 8:34 PM

#

sterile tartan What do you chat anyways

Random shi, this was for studies

sterile tartan Dec 10, 2025, 8:34 PM

#

neat apex NotebookLM too

Should probably ask for a summary of all essential context to use it in a new chat

tardy plover Dec 10, 2025, 8:35 PM

#

Cant ask anything cuz its stuck on generating

sterile tartan Dec 10, 2025, 8:35 PM

#

tardy plover Random shi, this was for studies

Wow people can study with LMarena didn't knew that

latent crest Dec 10, 2025, 8:35 PM

#

What’s Ernie ?!

tardy plover Dec 10, 2025, 8:35 PM

#

sterile tartan Wow people can study with LMarena didn't knew that

Gemini 3 pro is really good for academics, claude too

tardy plover Dec 10, 2025, 8:35 PM

#

tardy plover Cant ask anything cuz its stuck on generating

Is there anything i can try

sterile tartan Dec 10, 2025, 8:35 PM

#

tardy plover Cant ask anything cuz its stuck on generating

If it doesn't work you would need to copy paste essential context

tardy plover Dec 10, 2025, 8:35 PM

#

sterile tartan If it doesn't work you would need to copy paste essential context

Thats annoying af

sterile tartan Dec 10, 2025, 8:36 PM

#

I know but that's the only resolve if nothing works

hollow ivy Dec 10, 2025, 8:36 PM

#

latent crest What’s Ernie ?!

a chinese model, in top-12

stray aspen Dec 10, 2025, 8:36 PM

#

Has anyone tested ernie

#

How bad is it

sterile tartan Dec 10, 2025, 8:36 PM

#

Try changing the model and ask for a essential context summary to use it in a new chat

sterile tartan Dec 10, 2025, 8:36 PM

#

tardy plover Gemini 3 pro is really good for academics, claude too

True

hollow ivy Dec 10, 2025, 8:37 PM

#

hollow ivy a chinese model, in top-12

by Baidu: https://en.wikipedia.org/wiki/Ernie_Bot

Ernie Bot

Ernie Bot (Chinese: 文心一言, Pinyin: wénxīn yīyán), full name Enhanced Representation through Knowledge Integration, is an artificial intelligence chatbot developed by the Chinese technology company Baidu. Ernie Bot rivals GPT models in Chinese NLP tasks. It is built on the company's ERNIE series of large language models, which have bee...

sterile tartan Dec 10, 2025, 8:37 PM

#

Well at least Ernie is Free

#

Can be very useful for asian context

hollow ivy Dec 10, 2025, 8:39 PM

#

sterile tartan Can be very useful for asian context

unfortunately, it's censored

#

so, not really useful to study (chinese) history

#

but maybe more useful than (old) Grok

#

has anyone downloaded Deepseek-3.2? (https://huggingface.co/deepseek-ai/DeepSeek-V3.2-Exp)

waxen fern Dec 10, 2025, 8:44 PM

#

Yupp also has rate limits

#

Alternatives?

stray aspen Dec 10, 2025, 8:44 PM

#

Lmarena

weary galleon Dec 10, 2025, 8:44 PM

#

tardy plover Claude opus 4-1 is infinitely stuck on "generating"

Ernie stuck in thinking. Thinks by minutes. I believe over million tokens.

lusty tinsel Dec 10, 2025, 8:46 PM

#

weary galleon Ernie stuck in thinking. Thinks by minutes. I believe over million tokens.

if you refresh it show the result instead of loading. thats just a bug. the real issue is when it get stuck on " please try again"

polar niche Dec 10, 2025, 9:10 PM

#

Hello

torn mantle Dec 10, 2025, 9:26 PM

#

cant believe antigravity has a generous quota compared to vscode

#

it refreshes after 5h

undone ravine Dec 10, 2025, 10:03 PM

#

Hello, is it possible to create 1-minute AI videos for free??

torn mantle Dec 10, 2025, 10:09 PM

#

https://x.com/ChatGPTapp/status/1998872853752549867

ChatGPT (@ChatGPTapp)

#

#

so the new model is tomorrow aka garlic

cloud zinc Dec 10, 2025, 10:10 PM

#

torn mantle so the new model is tomorrow aka garlic

what is garlic

zealous elbow Dec 10, 2025, 10:11 PM

#

Wide cinematic view of the raw-material receiving area. Workers unload sacks of calcium carbonate, fluoride compounds, and thickening agents. Pallet jacks move smoothly across the clean warehouse floor as ingredients are inspected.
🎧 Conveyor belt drone, echoing warehouse ambience.

forest prism Dec 10, 2025, 10:12 PM

#

Did LMArena add rate limiting? i'm getting too many requests error in the console

zealous elbow Dec 10, 2025, 10:12 PM

#

Close-up of powders being weighed precisely, glycerin poured into stainless-steel vessels, and silica carefully measured. Soft reflections on metal surfaces, realistic particulate dust movement.
🎧 Liquid pouring, mechanical clicks.

torn mantle Dec 10, 2025, 10:12 PM

#

cloud zinc what is garlic

internal codename for their upcoming model

#

garlic = probably robin high

#

https://x.com/googleaidevs/status/1998874506912538787

Google AI Developers (@googleaidevs)

We’re launching Gemini 2.5 Flash and Pro Text-to-Speech (TTS) model updates 🚀

Improvements include:

- Emotional style and tone versatility
- Context-aware pacing control
- Improved multiple-speaker capabilities

Dive into the blog to learn how these advancements are giving

cloud zinc Dec 10, 2025, 10:12 PM

#

robin high is not great

cloud zinc Dec 10, 2025, 10:13 PM

#

torn mantle https://x.com/googleaidevs/status/1998874506912538787

is this on ai studio?

torn mantle Dec 10, 2025, 10:13 PM

#

im testing it rn

torn mantle Dec 10, 2025, 10:13 PM

#

cloud zinc robin high is not great

yea

#

oh its so good

#

but its slow lol

whole sundial Dec 10, 2025, 10:16 PM

#

torn mantle https://x.com/ChatGPTapp/status/1998872853752549867

https://fixupx.com/ChatGPTapp/status/1998872853752549867

ChatGPT (@ChatGPTapp)

**💬 44 🔁 10 ❤️ 212 👁️ 6.2K **

torn mantle Dec 10, 2025, 10:17 PM

#

whole sundial https://fixupx.com/ChatGPTapp/status/1998872853752549867

thanks

whole sundial Dec 10, 2025, 10:17 PM

#

forest prism Did LMArena add rate limiting? i'm getting too many requests error in the consol...

are you repeating the same prompt? it will block you until it resets if you send the same prompt more than 3 times in i think an hour

torn mantle Dec 10, 2025, 10:17 PM

#

https://x.com/AndrewCurran_/status/1998847161782464919

Andrew Curran (@AndrewCurran_)

Looks like we're about to hit the next safety threshold. Axios got an early look at the next safety report.

cloud zinc Dec 10, 2025, 10:20 PM

#

torn mantle https://x.com/AndrewCurran_/status/1998847161782464919

https://openai.com/index/strengthening-cyber-resilience/

zealous sparrow Dec 10, 2025, 10:56 PM

#

cloud zinc robin high is not great

It has the same horrible frontend but good backend

#

So in conclusion its ass

golden ocean Dec 10, 2025, 10:57 PM

#

suno ai

zealous sparrow Dec 10, 2025, 11:00 PM

#

Also garlic was only in internal OpenAI tests so i dont think its robin

#

But if it is robin. Another flop by OpenAI.

golden ocean Dec 10, 2025, 11:04 PM

#

real

jade egret Dec 10, 2025, 11:24 PM

#

Garlic tomorow?

keen beacon Dec 10, 2025, 11:34 PM

#

Guys

#

U can unlock Pokémon in Sora

#

But the messed up part is

#

Nvm lol

queen veldt Dec 10, 2025, 11:35 PM

#

https://tenor.com/view/starwarsday-gif-5014592509856804378

Tenor

keen beacon Dec 10, 2025, 11:36 PM

#

I’m not exactly sure why this works

queen veldt Dec 10, 2025, 11:37 PM

#

#

😭😭😭

#

These images have been circling around for 2 days already

keen beacon Dec 10, 2025, 11:37 PM

#

Ty fake

#

I fell 4 it

#

Poly market insider trading

queen veldt Dec 10, 2025, 11:38 PM

#

The sandybay guy keeps posting them

#

Feels like hes ragebaiting or bot

keen beacon Dec 10, 2025, 11:40 PM

#

What’s the word discord mean?

whole sundial Dec 10, 2025, 11:40 PM

#

queen veldt

the left one is obviously nb/nbpro, i can see the synthid obviously

whole sundial Dec 10, 2025, 11:40 PM

#

keen beacon What’s the word discord mean?

wait, were you previously unaware "discord" is a standard word in english and not just the name of a social media platform?

keen beacon Dec 10, 2025, 11:41 PM

#

Originate from?

whole sundial Dec 10, 2025, 11:41 PM

#

i can tell you that it's been around for a long time, way before the internet

keen beacon Dec 10, 2025, 11:41 PM

#

#

Imagine if the platform was called discourse

whole sundial Dec 10, 2025, 11:43 PM

#

there is already a platform called discourse, not like discord though, i think it's more like a forum?

keen beacon Dec 10, 2025, 11:43 PM

#

#

Out of the 2 which one do you think has more engagement?

whole sundial Dec 10, 2025, 11:44 PM

#

probably discourse

keen beacon Dec 10, 2025, 11:45 PM

#

1st

#

People are more likely to engage with content that upsets them, or makes them angry

whole sundial Dec 10, 2025, 11:45 PM

#

makes sense

#

must be why people constantly get angry at each other on this platform

keen beacon Dec 10, 2025, 11:46 PM

#

Well, if you we at the essence of things for what they really are

#

The name is a dead giveaway

#

lol

#

Facebook took this philosophy, a whole new level

#

Low key discord makes bank

#

230 million monthly users

#

#

Crazy just of people chatting lol

lucid geyser Dec 10, 2025, 11:52 PM

#

If u get December ChatGPT share conversation

#

Just added

whole sundial Dec 10, 2025, 11:57 PM

#

#1397655624103493813
<@&1349916362595635286>

proud bobcat Dec 11, 2025, 12:14 AM

#

queen veldt

GPT 5.2 will have 50000 score on vision, webdev, and text

#

True agi

chrome patio Dec 11, 2025, 12:18 AM

#

👋

grizzled star Dec 11, 2025, 12:19 AM

#

👋

queen veldt Dec 11, 2025, 12:22 AM

#

Claude opus pricing on codex is insane 💀

#

sharp mirage Dec 11, 2025, 1:05 AM

#

@echo aurora what is that in the announcement I didn't get it

quartz light Dec 11, 2025, 1:07 AM

#

golden ocean Dec 11, 2025, 1:07 AM

#

transformer

echo aurora Dec 11, 2025, 1:09 AM

#

sharp mirage <@283397944160550928> what is that in the announcement I didn't get it

See it now?

viral cedar Dec 11, 2025, 1:17 AM

#

#

#

found something interesting

#

when prompted what model it is, deepseek v3.2 responds that it's a deepseek model

#

when asked to please xi jing ping by hacking an American company it pretends to be made by Anthropic

#

#

its actually like high

#

it states 180 countries in UN recognize that taiwan is apart of china and only 12-13 countries dont recognize it

sharp mirage Dec 11, 2025, 1:22 AM

#

echo aurora See it now?

I saw it but I don't know what to answer

#

And also what happened to early access feedback

sharp mirage Dec 11, 2025, 1:34 AM

#

sharp mirage I saw it but I don't know what to answer

I got it it's like the best game of all

grizzled star Dec 11, 2025, 1:59 AM

#

👋

sharp mirage Dec 11, 2025, 2:06 AM

#

@echo aurora did you add memory to the website ?

#

Cuz cloud remember my old chat :

mint jasper Dec 11, 2025, 2:45 AM

#

Something went wrong with this response, please try again.

is there a reason i keep getting this

#

error

timid mist Dec 11, 2025, 2:45 AM

#

mint jasper Something went wrong with this response, please try again. is there a reason ...

same

mint jasper Dec 11, 2025, 2:47 AM

#

just tryna usemy old stuff

#

keep getting an error

#

even pasting large stuff gives the error

timid mist Dec 11, 2025, 2:48 AM

#

i keep getting the error i cant use any of the ai

#

s

obtuse smelt Dec 11, 2025, 3:13 AM

#

timid mist same

same again

whole sundial Dec 11, 2025, 3:14 AM

#

<@&1349916362595635286>

keen beacon Dec 11, 2025, 3:49 AM

#

Ugh

#

Guys I’m really baffled here

sullen quest Dec 11, 2025, 3:57 AM

#

keen beacon Guys I’m really baffled here

y

sturdy mica Dec 11, 2025, 5:05 AM

#

yo guys gemini 3 pro preview updated

#

google updated it

#

its a bit better now

autumn mantle Dec 11, 2025, 5:32 AM

#

timid mist i keep getting the error i cant use any of the ai

For me I just keep trying to retry the message and if that doesn't work I refresh the page and then retry

#

That works fine for me ^-^

elder solar Dec 11, 2025, 5:37 AM

#

sturdy mica yo guys gemini 3 pro preview updated

today?

autumn mantle Dec 11, 2025, 5:44 AM

#

sturdy mica yo guys gemini 3 pro preview updated

Preview? Isn't the full thing out?

elder solar Dec 11, 2025, 5:46 AM

#

autumn mantle Preview? Isn't the full thing out?

in aistudio, it says preview

#

obtuse smelt Dec 11, 2025, 5:50 AM

#

oh

elder solar Dec 11, 2025, 5:50 AM

#

though am confused

#

why is gemini.google's gemini 3 better than aistudio one?

autumn mantle Dec 11, 2025, 5:51 AM

#

elder solar in aistudio, it says preview

Ah, I see

autumn mantle Dec 11, 2025, 5:51 AM

#

elder solar why is gemini.google's gemini 3 better than aistudio one?

Why would it be better?

elder solar Dec 11, 2025, 5:51 AM

#

autumn mantle Why would it be better?

it works better in gemini.google site

#

maybe its the canvas setting

#

it always outputs high quality html designs in gemini.google

#

while ai studio version, its just like, pre-high quality

sturdy mica Dec 11, 2025, 6:24 AM

#

elder solar while ai studio version, its just like, pre-high quality

Gemini 3 on gemini.google has a system prompt

sturdy mica Dec 11, 2025, 6:24 AM

#

elder solar

nearest neighbour scaling

barren fulcrum Dec 11, 2025, 6:46 AM

#

hello

formal lance Dec 11, 2025, 7:46 AM

#

guys when does deep thinking come to lmarena?

obtuse smelt Dec 11, 2025, 7:50 AM

#

?

surreal creek Dec 11, 2025, 7:51 AM

#

formal lance guys when does deep thinking come to lmarena?

Never, too expensive

empty stump Dec 11, 2025, 7:52 AM

#

make it yourself

austere sundial Dec 11, 2025, 8:46 AM

#

Can anyone give me a hand?

#

In #ai-creations

lucid geyser Dec 11, 2025, 8:46 AM

#

@echo aurora ive had a chat on generating for 5m, i really wanna reveal the model though

wide arch Dec 11, 2025, 8:52 AM

#

anyone seen the model "ghostfalcon" on LMArena? this model is like EXTREMELY good, like Gemini 3 Pro type of good? anyone know what model it could be?

dusk phoenix Dec 11, 2025, 9:05 AM

#

Does any one know which company that "Hazel-gen-4" belongs to??

whole sundial Dec 11, 2025, 9:11 AM

#

dusk phoenix Does any one know which company that "Hazel-gen-4" belongs to??

openai, confirmed with c2pa metadata

#

the name also appeared in an openai api error message, i believe that particular model is gpt image 1.5

lucid geyser Dec 11, 2025, 9:17 AM

#

whole sundial the name also appeared in an openai api error message, i believe that particular...

are any of them 2 or just lower and higher versions of 1.5

fleet lintel Dec 11, 2025, 9:17 AM

#

getting quite a bit of "robin-high" ... is this the latest gpt-5.2 model?

whole sundial Dec 11, 2025, 9:17 AM

#

lucid geyser are any of them 2 or just lower and higher versions of 1.5

they are just 1.5, 2 is probably still in pre-training

lucid geyser Dec 11, 2025, 9:18 AM

#

whole sundial they are just 1.5, 2 is probably still in pre-training

probably orginally 2 before nbp lol

lucid geyser Dec 11, 2025, 9:18 AM

#

fleet lintel getting quite a bit of "robin-high" ... is this the latest gpt-5.2 model?

likely

lucid geyser Dec 11, 2025, 9:19 AM

#

lucid geyser <@283397944160550928> ive had a chat on generating for 5m, i really wanna reveal...

consistent failure with robin-high model

whole sundial Dec 11, 2025, 9:19 AM

#

lucid geyser probably orginally 2 before nbp lol

yeah, I liked gpt image 1 world knowledge when it first came out but nbpro is better now

#

still better at studio ghibli edits though, gpt image 1.5 completely changed it (probably so they can't get sued?)

sterile tartan Dec 11, 2025, 10:04 AM

#

@whole sundial is this Good?

boreal topaz Dec 11, 2025, 10:34 AM

#

can anyone help me, how to generate video by text ?

obtuse smelt Dec 11, 2025, 10:38 AM

#

use prompt and generate to video

steep pagoda Dec 11, 2025, 10:50 AM

#

1

#

m

torn mantle Dec 11, 2025, 10:51 AM

#

https://x.com/ai_for_success/status/1999029878751113515

AshutoshShrivastava (@ai_for_success)

🚨You can now use the new upcoming OpenAI model GPT 5.2 inside Cursor. Here is the full walkthrough.

- Open the editor, go to settings and then the model tab. Add a custom model and enter the text "gpt-5.2-high" and "gpt-5.2".
- After that you can select the model and ask

#

if you have cursor tell us how it is

winter bridge Dec 11, 2025, 10:53 AM

#

hello

queen veldt Dec 11, 2025, 11:05 AM

#

torn mantle if you have cursor tell us how it is

I'm still using sonnet

#

Gpt codex max basically sucs

#

I have to redo the prompts it gets stuff wrong it's terrible

#

Meanwhile sonnet oneshots the code

#

fleet lintel Dec 11, 2025, 11:16 AM

#

queen veldt I have to redo the prompts it gets stuff wrong it's terrible

may be (and hopefully) it's not really gpt 5.2 ?

torn mantle Dec 11, 2025, 11:20 AM

#

queen veldt I have to redo the prompts it gets stuff wrong it's terrible

its also slow

austere sundial Dec 11, 2025, 11:43 AM

#

anyone here is good at image prompt?
I really need a hand in someone

latent crest Dec 11, 2025, 11:48 AM

#

Image and image editing are different charts and different AI purposes ?

grizzled star Dec 11, 2025, 11:55 AM

#

👋

sterile tartan Dec 11, 2025, 11:55 AM

#

austere sundial anyone here is good at image prompt? I really need a hand in someone

Just tell ai to engineer it

solemn warren Dec 11, 2025, 12:20 PM

#

hi

craggy moth Dec 11, 2025, 12:30 PM

#

hello 👋

obtuse smelt Dec 11, 2025, 12:31 PM

#

hi there

fluid quartz Dec 11, 2025, 12:34 PM

#

hey is gemini 3 nano banana pro offline on LMA

#

I cant find the option in Side by Side

obtuse smelt Dec 11, 2025, 12:49 PM

#

really ?

rocky mauve Dec 11, 2025, 1:02 PM

#

Which is best for planning, which is best for coding? Gemini 3 Pro or Opus 4.5 (If there is better models, let me know of them)

zealous sparrow Dec 11, 2025, 1:15 PM

#

craziest model name ive seen in a while

#

robin-high was readded to codearena btw

#

seahawk and skyhawk are gone and google put replacements in place

#

meet fiercefalcon and ghostfalcon

flint fog Dec 11, 2025, 1:16 PM

#

Sometimes when I contact the model and ask it for something, it says "generating" but never gives me an answer. Please fix this problem.

sterile tartan Dec 11, 2025, 1:18 PM

#

rocky mauve Which is best for planning, which is best for coding? Gemini 3 Pro or Opus 4.5 (...

Opus is best for both

#

Gemini would win if need vision and longer context window

zealous sparrow Dec 11, 2025, 1:19 PM

#

sterile tartan Opus is best for both

you heard that skyhawk and seahawk are gone

sterile tartan Dec 11, 2025, 1:19 PM

#

zealous sparrow you heard that skyhawk and seahawk are gone

No i didn't knew that

zealous sparrow Dec 11, 2025, 1:19 PM

#

sterile tartan No i didn't knew that

yeah google put 2 new ones fiercefalcon + ghostfalcon

sterile tartan Dec 11, 2025, 1:19 PM

#

They more likely gpt 5.2 models or 3 flash

zealous sparrow Dec 11, 2025, 1:19 PM

#

sterile tartan They more likely gpt 5.2 models or 3 flash

no

#

they are both gemini

sterile tartan Dec 11, 2025, 1:19 PM

#

Should release soon after testing

zealous sparrow Dec 11, 2025, 1:20 PM

#

robin-high is back

#

[OAI]

sterile tartan Dec 11, 2025, 1:20 PM

#

zealous sparrow they are both gemini

Is any of them specially for coding?

#

They could be gamma models

zealous sparrow Dec 11, 2025, 1:21 PM

#

sterile tartan Is any of them specially for coding?

both on codearena

#

textarena too probs

sterile tartan Dec 11, 2025, 1:22 PM

#

zealous sparrow both on codearena

Finally they are doing it

zealous sparrow Dec 11, 2025, 1:22 PM

#

zealous sparrow craziest model name ive seen in a while

this model remains unknown @sterile tartan

sterile tartan Dec 11, 2025, 1:22 PM

#

Special coding models

sterile tartan Dec 11, 2025, 1:22 PM

#

zealous sparrow this model remains unknown <@1186971708494712852>

Interesting we will know soon

#

Gemini Coder

#

Like Qwen Coder

#

battle3d

zealous sparrow Dec 11, 2025, 1:22 PM

#

sterile tartan Interesting we will know soon

hang on let me check the output

sterile tartan Dec 11, 2025, 1:22 PM

#

3 flash
Gamma series
Flash 3

#

Should release soon

#

GPT 5.2

#

And grok 4.20

zealous sparrow Dec 11, 2025, 1:23 PM

#

yeah its a textmodel

#

not image

sterile tartan Dec 11, 2025, 1:23 PM

#

Possibility sonnet 5

#

If claude is also playing hard

golden ocean Dec 11, 2025, 1:24 PM

#

cwaude

zealous sparrow Dec 11, 2025, 1:24 PM

#

december-model cant be a claude name

#

I mean, who would name a battle model that..

#

https://019b0d93-dc7f-7654-94fe-671f0dbe9d83.arena.site
ghostfalcon win 10 recreation

Windows 10 Pro

Built with LMArena - Content is user-generated and unverified

sterile tartan Dec 11, 2025, 1:24 PM

#

Names Don't matter

#

They are just for testing

sterile tartan Dec 11, 2025, 1:25 PM

#

zealous sparrow https://019b0d93-dc7f-7654-94fe-671f0dbe9d83.arena.site ghostfalcon win 10 recre...

Looks best out of all i have seen

zealous sparrow Dec 11, 2025, 1:25 PM

#

robin-high is back only on textarena

#

its an OpenAI model as we said before

#

It might be 5.2 or garlic

#

rather 5.2

#

garlic-high can't be a model

sterile tartan Dec 11, 2025, 1:30 PM

#

Yeah

ocean ferry Dec 11, 2025, 1:43 PM

#

zealous sparrow https://019b0d93-dc7f-7654-94fe-671f0dbe9d83.arena.site ghostfalcon win 10 recre...

holy sh-... this is like Gemini 3 Pro A/B Test gen

#

it seems like it has very big knowledge

rocky mauve Dec 11, 2025, 1:50 PM

#

I finally hit the quota limit for opus 4.5, after weeks of nonstop using it, I never though I’d reach it

#

I thought I was unstoppable, oh well, back to Gemini

zealous sparrow Dec 11, 2025, 1:51 PM

#

https://019b0dab-9791-789a-a76e-bcba96916d32.arena.site
3d universe sandbox [fiercefalcon]

3D Universe Sandbox

Built with LMArena - Content is user-generated and unverified

#

december-chatbot is OpenAI @sterile tartan

sterile tartan Dec 11, 2025, 1:54 PM

#

Interesting

warm zodiac Dec 11, 2025, 1:54 PM

#

is it good?

sterile tartan Dec 11, 2025, 1:54 PM

#

One of these might be GPT 5.2 Codex

ocean vortex Dec 11, 2025, 2:03 PM

#

zealous sparrow robin-high is back only on textarena

this is probably 5.2-high. Apparently it was released on cursor briefly as well. Release seems to be very soon

#

oh wow lol

zealous sparrow Dec 11, 2025, 2:09 PM

#

lets not do this joke again

#

the frontend sucks if robin-high turns out to be gpt 5.2

weary galleon Dec 11, 2025, 2:09 PM

#

zealous sparrow lets not do this joke again

Why? People love it.

warm zodiac Dec 11, 2025, 2:10 PM

#

zealous sparrow the frontend sucks if robin-high turns out to be gpt 5.2

yes I agree I'm getting pretty lame frontend

neon idol Dec 11, 2025, 2:17 PM

#

i think that gpt 5 will exit today at 10am (san francisco hour)

spare rune Dec 11, 2025, 2:17 PM

#

zealous sparrow the frontend sucks if robin-high turns out to be gpt 5.2

Well people are saying it’s good for backend especially

ocean vortex Dec 11, 2025, 2:18 PM

#

neon idol i think that gpt 5 will exit today at 10am (san francisco hour)

will it ever come back after exiting?

spare rune Dec 11, 2025, 2:19 PM

#

#

I feel like it’s ChatGPT trying to promote itself

#

😭

neon idol Dec 11, 2025, 2:20 PM

#

bro wants to be funny

fleet lintel Dec 11, 2025, 2:20 PM

#

where? on x.com ?

spare rune Dec 11, 2025, 2:20 PM

#

fleet lintel where? on x.com ?

It’s fake

weary galleon Dec 11, 2025, 2:20 PM

#

spare rune I feel like it’s ChatGPT trying to promote itself

I'm wanna make this release hoter

neon idol Dec 11, 2025, 2:20 PM

#

fleet lintel where? on x.com ?

he is lying

#

its ai generated

spare rune Dec 11, 2025, 2:20 PM

#

weary galleon I'm wanna make this release hoter

yes your wanna make this release funny

fleet lintel Dec 11, 2025, 2:21 PM

#

please stop with this fake stuff. Go at other joke channels for fake stuff 🙁

weary galleon Dec 11, 2025, 2:21 PM

#

neon idol bro wants to be funny

And I am

sterile tartan Dec 11, 2025, 2:21 PM

#

fleet lintel please stop with this fake stuff. Go at other joke channels for fake stuff 🙁

Bro is fed up

spare rune Dec 11, 2025, 2:21 PM

#

fleet lintel please stop with this fake stuff. Go at other joke channels for fake stuff 🙁

Its hard not to see that it’s fake

neon idol Dec 11, 2025, 2:21 PM

#

weary galleon And I am

Kinda 1cringeasf

weary galleon Dec 11, 2025, 2:22 PM

#

fleet lintel please stop with this fake stuff. Go at other joke channels for fake stuff 🙁

Jokes are allowed everywhere. Boost your humor and be ok.

spare rune Dec 11, 2025, 2:22 PM

#

guys did you see how llama 6.7 make windows on in 6.1 seconds

#

it’s really cool

weary galleon Dec 11, 2025, 2:24 PM

#

spare rune guys did you see how llama 6.7 make windows on in 6.1 seconds

What are you talking about?

spare rune Dec 11, 2025, 2:24 PM

#

weary galleon What are you talking about?

Llamas new sota model

plucky sparrow Dec 11, 2025, 2:24 PM

#

I think it might even beat GPT-5.2

spare rune Dec 11, 2025, 2:25 PM

#

It alr did

weary galleon Dec 11, 2025, 2:25 PM

#

spare rune Llamas new sota model

Not this year

ocean vortex Dec 11, 2025, 2:25 PM

#

spare rune Llamas new sota model

Lol they are kinda behind Mistral at the moment

#

Large3 is better than Maverick

plucky sparrow Dec 11, 2025, 2:25 PM

#

I guess all that poaching from OpenAI paid off

sterile tartan Dec 11, 2025, 2:25 PM

#

spare rune Llamas new sota model

Meta has just been in the graveyard for a while by now

spare rune Dec 11, 2025, 2:25 PM

#

ocean vortex Lol they are kinda behind Mistral at the moment

did you try llama 6.7?

sterile tartan Dec 11, 2025, 2:25 PM

#

I wonder if they finally have something good

#

He is throwing money but not getting the results

#

Bro is willing to give millions in salaries but many still refuse

golden ocean Dec 11, 2025, 2:28 PM

#

openai is NOT dropping a frontier model

#

they cooked fr

#

out of the race its over

surreal creek Dec 11, 2025, 2:28 PM

#

spare rune did you try llama 6.7?

6 7 😂

clever spoke Dec 11, 2025, 2:49 PM

#

how do you create the video in the first place hi im new so im kinda confused

warm zodiac Dec 11, 2025, 2:50 PM

#

golden ocean out of the race its over

they get one last chance with the new larger pretrain that's coming out in the new year

#

but if they didn't cook with it they are official over

#

Basically no progress since o3-pro

#

METR's capabilities index

plucky sparrow Dec 11, 2025, 3:01 PM

#

clever spoke how do you create the video in the first place hi im new so im kinda confused

#1397655624103493813

sharp mirage Dec 11, 2025, 3:09 PM

#

Guys

#

I don't think GPT5.2 is going to be better than Gemini 3 or cloud opus 4.5 but. I think they cooked something

warm zodiac Dec 11, 2025, 3:16 PM

#

is that true? if it is why is it relevant?

#

if we want to talk about revenue health then Anthropic is winning

fleet lintel Dec 11, 2025, 3:22 PM

#

sharp mirage I don't think GPT5.2 is going to be better than Gemini 3 or cloud opus 4.5 but. ...

nah.. it has to be better. even if gpt 5.2 real life performace is comparable to gpt 5.1, they must benchmaxxxed it to make it look SOTA

torn mantle Dec 11, 2025, 3:23 PM

#

gpt5.2 aka robin

#

is not a practical model

#

we talked about this before

#

while the model is good but its not efficient, opus 4.5 and gemini 3 pro has the same performance with less thinking time

sharp mirage Dec 11, 2025, 3:23 PM

#

They have to benchmaxxxed it

torn mantle Dec 11, 2025, 3:23 PM

#

yea they have to tbh

sharp mirage Dec 11, 2025, 3:25 PM

#

Cuz they are losing the war with Gemini 3 and Claude and They're not stupid enough to release a new version that's worse than the previous one when there's competition.

fleet lintel Dec 11, 2025, 3:26 PM

#

we will know in few more hours. i am looking forward to see what they did with 5.2

sharp mirage Dec 11, 2025, 3:26 PM

#

fleet lintel we will know in few more hours. i am looking forward to see what they did with ...

Alright, I am waiting for the release so I can test it

cloud zinc Dec 11, 2025, 3:34 PM

#

its a .1 update

#

gpt 5 to 5.1 will be same as gpt 5.1 to gpt 5.2

sterile tartan Dec 11, 2025, 3:35 PM

#

Gemini 3 Full Coming
Gemini 3 Flash Coming
GPT 5.2 Coming
Grok 4.20 Coming

meager harbor Dec 11, 2025, 3:35 PM

#

cloud zinc its a .1 update

grok 4 to 4.1 was huge

sharp mirage Dec 11, 2025, 3:35 PM

#

It's looking better

sterile tartan Dec 11, 2025, 3:35 PM

#

Just where the f is 5.2

sharp mirage Dec 11, 2025, 3:35 PM

#

cloud zinc gpt 5 to 5.1 will be same as gpt 5.1 to gpt 5.2

I don't think

sterile tartan Dec 11, 2025, 3:35 PM

#

And why the hack am i even waiting for it

#

It feels longer when waiting

zealous sparrow Dec 11, 2025, 3:36 PM

#

sharp mirage It's looking better

this is the worst prediction ive ever seen

sharp mirage Dec 11, 2025, 3:36 PM

#

Cuz if you read like change log it says better reasoning and fix bugs 🐛

cloud zinc Dec 11, 2025, 3:36 PM

#

sharp mirage It's looking better

nice fake screenshot

sharp mirage Dec 11, 2025, 3:36 PM

#

Isn't fake but idd

cloud zinc Dec 11, 2025, 3:36 PM

#

its fake

#

why u claiming its not fake

fleet lintel Dec 11, 2025, 3:36 PM

#

cloud zinc nice fake screenshot

not fake. It says anticipated 🙂

cloud zinc Dec 11, 2025, 3:37 PM

#

its fake as in not official

#

speculation is fake

fleet lintel Dec 11, 2025, 3:37 PM

#

cloud zinc speculation is fake

speculation is fake if it doesn't say that it is speculation. atleast that's my take

cloud zinc Dec 11, 2025, 3:37 PM

#

it doesnt explicitely say its speculation

sterile tartan Dec 11, 2025, 3:38 PM

#

Is not speculation
Is anticipation

#

Please choose words wisely as it can be misunderstood

#

Say unofficial not fake

fleet lintel Dec 11, 2025, 3:39 PM

#

small but meaningful diference. I can buy this view point

sterile tartan Dec 11, 2025, 3:39 PM

#

Great minds think alike

torn mantle Dec 11, 2025, 3:42 PM

#

sharp mirage It's looking better

make everything 100

#

i feel like openai hit a plateau tbh

#

you can see that from this upcoming model

sharp mirage Dec 11, 2025, 3:43 PM

#

Yeah fr

torn mantle Dec 11, 2025, 3:43 PM

#

i heard they are starting from scratch now

#

pre-training + post-training

#

they usually just do post-training

#

could be wrong*

sterile tartan Dec 11, 2025, 3:45 PM

#

torn mantle make everything 100

queen veldt Dec 11, 2025, 3:55 PM

#

#

Nah gpt image 2 i SOTA

#

😭

spare rune Dec 11, 2025, 3:56 PM

#

It’s near nbp but not quite

#

I don’t think nano banana pro will get beaten in a while

warm zodiac Dec 11, 2025, 3:59 PM

#

yeah closing the huge gap between NBP and the rest but not SOTA

whole sundial Dec 11, 2025, 4:00 PM

#

queen veldt

lol they're wrong about the name, it has been confirmed to be gpt image 1.5

cloud zinc Dec 11, 2025, 4:00 PM

#

whole sundial Dec 11, 2025, 4:01 PM

#

that being said it is mostly better than gpt image 1

whole sundial Dec 11, 2025, 4:01 PM

#

cloud zinc

lol, just the tip of the very large copyright iceberg

#

Disney will sue Google because their ai can output their copyrighted characters but yet the record labels won't sue them or OpenAI for reproducing their copyrighted album covers

cloud zinc Dec 11, 2025, 4:03 PM

#

https://openai.com/index/disney-sora-agreement/

The Walt Disney Company and OpenAI reach landmark agreement to bri...

Agreement marks a significant step in setting meaningful standards for responsible AI in entertainment.

lunar glade Dec 11, 2025, 4:03 PM

#

cloud zinc

If Google buy Disney then problem solved

zealous sparrow Dec 11, 2025, 4:03 PM

#

cloud zinc https://openai.com/index/disney-sora-agreement/

Yeah, i saw that

whole sundial Dec 11, 2025, 4:03 PM

#

also had it reproduce some movie posters, i don't think any were disney though

zealous sparrow Dec 11, 2025, 4:04 PM

#

cloud zinc https://openai.com/index/disney-sora-agreement/

disney gave them more funding

whole sundial Dec 11, 2025, 4:05 PM

#

zealous sparrow disney gave them more funding

openai will always need more money until they turn chatgpt into some advertising wall with a tiny chat window, where every response contains an ad

#

also every generated image has an ad in the corner and every video has an ad at both the beginning and the end

sharp mirage Dec 11, 2025, 4:05 PM

#

Why the hell Disney gaving them more funding

cloud zinc Dec 11, 2025, 4:06 PM

#

openai will go ipo

zealous sparrow Dec 11, 2025, 4:06 PM

#

animated weather app by [ghostfalcon]
https://019b0e27-0439-7329-82bc-7821b884d125.arena.site

Chaos Weather & Forecast

Built with LMArena - Content is user-generated and unverified

cloud zinc Dec 11, 2025, 4:06 PM

#

sharp mirage Why the hell Disney gaving them more funding

the stocks will increase from billion

whole sundial Dec 11, 2025, 4:06 PM

#

pay $20 a month to get rid of the ads in content, $200 to get rid of all but a banner ad, $500 to get rid of all ads, not much better models though

#

that's the only way that i can see openai making money

#

force personalized ads down everybody's throat, the money will come rolling in

sterile tartan Dec 11, 2025, 4:07 PM

#

whole sundial also every generated image has an ad in the corner and every video has an ad at ...

It will absolutely ruin the content then

zealous sparrow Dec 11, 2025, 4:08 PM

#

whole sundial pay $20 a month to get rid of the ads in content, $200 to get rid of all but a b...

dont they advertise to pro plan users

cloud zinc Dec 11, 2025, 4:08 PM

#

zealous sparrow dont they advertise to pro plan users

where u saw that

zealous sparrow Dec 11, 2025, 4:08 PM

#

cloud zinc where u saw that

people were tweetin it out

whole sundial Dec 11, 2025, 4:08 PM

#

sterile tartan It will absolutely ruin the content then

openai wouldn't care as long as they make money at that point, yes it would ruin the content, but openai needs to make money somehow

cloud zinc Dec 11, 2025, 4:09 PM

#

zealous sparrow people were tweetin it out

it was connector for like plugin, not an ad.

zealous sparrow Dec 11, 2025, 4:09 PM

#

cloud zinc it was connector for like plugin, not an ad.

ah

whole sundial Dec 11, 2025, 4:09 PM

#

zealous sparrow dont they advertise to pro plan users

i'm sure they do, that's why i invented the max plan that will get rid of all ads (that the user is aware of)

sterile tartan Dec 11, 2025, 4:09 PM

#

whole sundial openai wouldn't care as long as they make money at that point, yes it would ruin...

Then people will use nano banana and video editing to get rid of the ads

#

💀

whole sundial Dec 11, 2025, 4:10 PM

#

sterile tartan Then people will use nano banana and video editing to get rid of the ads

yeah because google can subsidize gemini with their ads business, openai can't because they don't have any other business other than ai

sterile tartan Dec 11, 2025, 4:11 PM

#

whole sundial yeah because google can subsidize gemini with their ads business, openai can't b...

True

#

They need to make money somehow

whole sundial Dec 11, 2025, 4:11 PM

#

part of the problem, openai need to put ads in chatgpt for any chance to make money, also google tpus are better than nvidia gpus but they are working on a custom chip to solve that problem

sterile tartan Dec 11, 2025, 4:11 PM

#

Maybe watch ads to earn credits for generation could work too

#

Noted

whole sundial Dec 11, 2025, 4:11 PM

#

sterile tartan Maybe watch ads to earn credits for generation could work too

wouldn't be the worst idea

sterile tartan Dec 11, 2025, 4:12 PM

#

whole sundial wouldn't be the worst idea

At least it will create some sort of revenue stream

whole sundial Dec 11, 2025, 4:12 PM

#

sterile tartan At least it will create some sort of revenue stream

yeah, and openai needs more of those

sterile tartan Dec 11, 2025, 4:12 PM

#

whole sundial yeah, and openai needs more of those

Majority of their userbase is Free anyways

whole sundial Dec 11, 2025, 4:13 PM

#

sterile tartan Majority of their userbase is Free anyways

yeah, they only make money off of paid and api users, they need to start making money off of free users

#

i bet ads will be coming to sora when it fully launches

sterile tartan Dec 11, 2025, 4:13 PM

#

Exactly

limber crag Dec 11, 2025, 4:13 PM

#

it doesnt matter how good openai makes its models, if it keeps censoring and keeps policing us with its insane guardrails its pretty useless

weary galleon Dec 11, 2025, 4:14 PM

#

sterile tartan Dec 11, 2025, 4:14 PM

#

They are burning heavy amount of compute on sora videos

#

💀

limber crag Dec 11, 2025, 4:14 PM

#

weary galleon

dont spam

whole sundial Dec 11, 2025, 4:14 PM

#

limber crag it doesnt matter how good openai makes its models, if it keeps censoring and kee...

that's why they are making the adult version of chatgpt

sterile tartan Dec 11, 2025, 4:14 PM

#

whole sundial that's why they are making the adult version of chatgpt

https://tenor.com/view/trollface-troll-face-troll-face-phonk-troll-face-terror-dark-gif-3039755119636354997

Tenor

weary galleon Dec 11, 2025, 4:14 PM

#

limber crag dont spam

Learn the meaning of the word "spam".

sterile tartan Dec 11, 2025, 4:14 PM

#

NSFW?

limber crag Dec 11, 2025, 4:14 PM

#

whole sundial that's why they are making the adult version of chatgpt

wasnt that announced many months back? and then sama backtracked?

whole sundial Dec 11, 2025, 4:15 PM

#

sterile tartan NSFW?

yes, sama said it himself

whole sundial Dec 11, 2025, 4:15 PM

#

limber crag wasnt that announced many months back? and then sama backtracked?

i don't remember them backtracking on that

limber crag Dec 11, 2025, 4:15 PM

#

i dont know they said it will come in december and it almost halfway done

cloud zinc Dec 11, 2025, 4:16 PM

#

limber crag i dont know they said it will come in december and it almost halfway done

halfway? its 10 days

sterile tartan Dec 11, 2025, 4:16 PM

#

sterile tartan Dec 11, 2025, 4:16 PM

#

whole sundial yes, sama said it himself

Daym

#

💀

limber crag Dec 11, 2025, 4:17 PM

#

cloud zinc halfway? its 10 days

do you count 25th - 31st dec in the year?

#

💀

#

i dont think anyone does

whole sundial Dec 11, 2025, 4:18 PM

#

can't wait to scan my face and give it to closedai so i can access nsfw chatgpt, i would rather download an nsfw model and do that locally, no face or id scanning needed there!

zealous sparrow Dec 11, 2025, 4:18 PM

#

robin-high cannot do stegonagraphy

compact sleet Dec 11, 2025, 4:18 PM

#

sterile tartan

Anthropic Claude R2

sterile tartan Dec 11, 2025, 4:18 PM

#

compact sleet Anthropic Claude R2

💀

zealous sparrow Dec 11, 2025, 4:19 PM

#

zealous sparrow robin-high cannot do stegonagraphy

only gemini 3 pro and ghostfalcon can do this

weary galleon Dec 11, 2025, 4:19 PM

#

Gamblers started to hesitate.

civic flame Dec 11, 2025, 4:19 PM

#

#

it's coming today lol

zealous sparrow Dec 11, 2025, 4:20 PM

#

civic flame

well

#

gpt 5.2 is robin huh

#

so its going to be ass

whole sundial Dec 11, 2025, 4:20 PM

#

make it even more confusing, Moonshot Qwen M2 Turbo 560B-A1.8B!

zealous sparrow Dec 11, 2025, 4:20 PM

#

gg

limber crag Dec 11, 2025, 4:20 PM

#

whats tangerine in the image arena

whole sundial Dec 11, 2025, 4:21 PM

#

limber crag whats tangerine in the image arena

grok? they already had mandarin

limber crag Dec 11, 2025, 4:21 PM

#

i dont know its aesthetics looked more like a chinese model, its been more than a week since i encountered it

sterile tartan Dec 11, 2025, 4:24 PM

#

limber crag i dont know its aesthetics looked more like a chinese model, its been more than ...

Maybe they are working with china undercover

#

🤔

limber crag Dec 11, 2025, 4:24 PM

#

heh?

weary galleon Dec 11, 2025, 4:26 PM

#

Look my poll👆

sterile tartan Dec 11, 2025, 4:26 PM

#

💀

weary galleon Dec 11, 2025, 4:27 PM

#

sterile tartan

fake

sterile tartan Dec 11, 2025, 4:27 PM

#

weary galleon fake

Obviously

#

💀

zealous sparrow Dec 11, 2025, 4:27 PM

#

we dont know

#

traders are confident on today

weary galleon Dec 11, 2025, 4:28 PM

#

zealous sparrow traders are confident on today

not yet

limber pawn Dec 11, 2025, 4:28 PM

#

Stop with the fakes

weary galleon Dec 11, 2025, 4:28 PM

#

limber pawn Stop with the fakes

It's true!

limber crag Dec 11, 2025, 4:28 PM

#

why are you guys hyping a 0.1 update?

weary galleon Dec 11, 2025, 4:28 PM

#

limber crag why are you guys hyping a 0.1 update?

Because

limber pawn Dec 11, 2025, 4:30 PM

#

weary galleon It's true!

You've been posting same pics nonstop for days

weary galleon Dec 11, 2025, 4:30 PM

#

limber pawn You've been posting same pics nonstop for days

Is it bad or good?

zealous sparrow Dec 11, 2025, 4:31 PM

#

robin-highs frontend is the same as gpt 5.1

#

so like

#

here's what we found out

#

skyhawk and seahawk are gone

#

and we have ghostfalcon and fiercefalcon now

#

ghostfalcon easily solved the steganography [google flash or mayb a g3 pro checkpoint, while robin-high failed] [this steganography was only ever solved by gemini 3 pro]

#

robin-high is OAI btw

worthy bluff Dec 11, 2025, 4:38 PM

#

hi

#

is the image to video just on discord or on the site too

hardy lion Dec 11, 2025, 4:47 PM

#

buddy, didn't I already warn you about posting this fake image?

weary galleon Dec 11, 2025, 4:48 PM

#

hardy lion buddy, didn't I already warn you about posting this fake image?

You said its ok.

hardy lion Dec 11, 2025, 4:48 PM

#

ah, well please don't, some people might get the wrong idea since this is our official discord

fleet lintel Dec 11, 2025, 4:50 PM

#

zealous sparrow robin-high is OAI btw

Check Twitter. SVG performance is very poor on robin-high

weary galleon Dec 11, 2025, 4:50 PM

#

Remember?

leaden laurel Dec 11, 2025, 4:52 PM

#

oops

#

accidental reaction

zealous sparrow Dec 11, 2025, 4:52 PM

#

fleet lintel Check Twitter. SVG performance is very poor on robin-high

someone posted a voxel example

winged locust Dec 11, 2025, 4:53 PM

#

directchat3d

hardy lion Dec 11, 2025, 4:54 PM

#

I see what I said wasn't very clear. What I was thinking was more like you're ok since your not like one of the twitter bots who is intentionally spreading fake news to deceive and that a joke isn't as bad. But now you've continued to post 7 more times including making mock ups of our official leaderboard release copy.

So it's no longer as funny

zealous sparrow Dec 11, 2025, 4:55 PM

#

@fleet lintel
apparently someone got this from robin-high

echo aurora Dec 11, 2025, 4:56 PM

#

worthy bluff is the image to video just on discord or on the site too

It's just on the Discord, more info can be found here #1397655624103493813 - let me know if you have any questions.

sharp mirage Dec 11, 2025, 4:57 PM

#

hey

#

pineapple did you add memory ?

#

cloud remmber my chat

cloud zinc Dec 11, 2025, 4:58 PM

#

yes memory

echo aurora Dec 11, 2025, 4:58 PM

#

sharp mirage pineapple did you add memory ?

If you login you'll have these chats accessible via different devices.

weary galleon Dec 11, 2025, 4:59 PM

#

hardy lion I see what I said wasn't very clear. What I was thinking was more like you're ok...

Can I continue post fakes without mentioning LMArena?

sharp mirage Dec 11, 2025, 4:59 PM

#

i mean memory that like the chat will be saved and if you asked the ai about

#

it

torn mantle Dec 11, 2025, 4:59 PM

#

zealous sparrow <@117742495957385218> apparently someone got this from robin-high

yea but the model is so sloooowwwwwwwwwww

#

like you have to wait an hour for an output

echo aurora Dec 11, 2025, 5:00 PM

#

weary galleon Can I continue post fakes without mentioning LMArena?

Was just about to address this.

weary galleon Dec 11, 2025, 5:00 PM

#

echo aurora Was just about to address this.

I mean in the future.

sharp mirage Dec 11, 2025, 5:01 PM

#

echo aurora Was just about to address this.

what happend to the early access?

#

why everyone go silenet when i talk :

#

i fell bad :

limber crag Dec 11, 2025, 5:08 PM

#

weary galleon Can I continue post fakes without mentioning LMArena?

why do you want to post fakes??

echo aurora Dec 11, 2025, 5:08 PM

#

Wanted to address the sharing of fake leaderboards here.** We're going to ask to not do this**. I'll be instructing the mods to remove this kind of content going forward. Even if it's done in a joking way, others could easily be misled by this. It's perfectly fine to speculate about where you think models will land on the leaderboard updates, but creating fake images misleading others isn't something we'd like to see happening here.

sharp mirage Dec 11, 2025, 5:08 PM

#

bro was typing 🙏

echo aurora Dec 11, 2025, 5:09 PM

#

sharp mirage what happend to the early access?

The Test Garden? It's still a thing. However, not everyone that applies is going to be accepted into the program.

limber crag Dec 11, 2025, 5:09 PM

#

whats the criteria btw

#

can i apply

sharp mirage Dec 11, 2025, 5:09 PM

#

yes

echo aurora Dec 11, 2025, 5:10 PM

#

limber crag whats the criteria btw

I don't want to disclose this as then everyone will just change their applications to fit this.

lusty tinsel Dec 11, 2025, 5:10 PM

#

@echo aurora any info about this retry yet?

limber crag Dec 11, 2025, 5:10 PM

#

echo aurora I don't want to disclose this as then everyone will just change their applicatio...

fair enough ill apply tomorrow

weary galleon Dec 11, 2025, 5:11 PM

#

echo aurora Wanted to address the sharing of fake leaderboards here.** We're going to ask to...

Today I generated 16 fake benchmarks of different LLMs I was waiting to post here. Okay, I'll remove them from my PC.

sharp mirage Dec 11, 2025, 5:11 PM

#

echo aurora The Test Garden? It's still a thing. However, not everyone that applies is going...

Of course, not everyone will be accepted to participate.

#

the transleter problem :/

echo aurora Dec 11, 2025, 5:12 PM

#

limber crag fair enough ill apply tomorrow

Sounds good!

echo aurora Dec 11, 2025, 5:12 PM

#

weary galleon Today I generated 16 fake benchmarks of different LLMs I was waiting to post her...

Lol sorry to hear that. Thank you for understanding. blobthanks

#

At this point, I don't think I'll be updating our server rules to make it super official, but if we see more and more of this we will.

echo aurora Dec 11, 2025, 5:13 PM

#

lusty tinsel <@283397944160550928> any info about this retry yet?

Hmm the error or the retry button?

sharp mirage Dec 11, 2025, 5:13 PM

#

btw we want a stop button bro

#

🥀

lusty tinsel Dec 11, 2025, 5:15 PM

#

echo aurora Hmm the error or the retry button?

the error, if i click retry it give same thing until it tells me to wait cool down like if was normaly spending tokes or something (which i explained in the bug section) then after cooldown it still keep on the same error try again...

sharp mirage Dec 11, 2025, 5:15 PM

#

referash the page

lusty tinsel Dec 11, 2025, 5:15 PM

#

sharp mirage referash the page

alread did

sharp mirage Dec 11, 2025, 5:16 PM

#

change the ai

lusty tinsel Dec 11, 2025, 5:16 PM

#

refresh work better if the bot keeps either thinking or loading actions like generating images or web dev files. for text generations rarely happens

sharp mirage Dec 11, 2025, 5:16 PM

#

and try use vpn or come back after 1h

rustic wind Dec 11, 2025, 5:17 PM

#

hi!I'm new to here, how can I use "Image Edit"?

noble shard Dec 11, 2025, 5:17 PM

#

Hi, how to upload model to lmarena?

sharp mirage Dec 11, 2025, 5:17 PM

#

Hi

#

api ?

#

Ask Pineapple

rustic wind Dec 11, 2025, 5:19 PM

#

oh,i know now...

lusty tinsel Dec 11, 2025, 5:19 PM

#

sharp mirage and try use vpn or come back after 1h

changing model doesnt work unless i start new chat which i dont want to bc it will have different progressions and lose track and have to restart all i was doing. the vpn doesnt either. and i been 2 or 3 days with this already.

sharp mirage Dec 11, 2025, 5:20 PM

#

ammm

#

ur cocokjed

#

cocked

lusty tinsel Dec 11, 2025, 5:20 PM

#

if i go to any other llm that is not claude i get this error too

sharp mirage Dec 11, 2025, 5:21 PM

#

yeah look what to click clear and than refrash the page and than change the model

#

like this

lusty tinsel Dec 11, 2025, 5:22 PM

#

already past that

echo aurora Dec 11, 2025, 5:24 PM

#

rustic wind hi!I'm new to here, how can I use "Image Edit"?

Hello and welcome ablobwave If you go here: https://lmarena.ai/?chat-modality=image and upload an image + prompt you'll be able to image edit.

LMArena

An open platform for evaluating AI through human preference

echo aurora Dec 11, 2025, 5:24 PM

#

noble shard Hi, how to upload model to lmarena?

Can you send us an email at contact@lmarena.ai and include more information about your model and the organization you're with?

echo aurora Dec 11, 2025, 5:25 PM

#

lusty tinsel the error, if i click retry it give same thing until it tells me to wait cool do...

Would you mind creating a new ticket in #1343291835845578853 and provide all of the relevant details there? That way we can keep the conversation a bit more organized.

fleet lintel Dec 11, 2025, 5:25 PM

#

echo aurora Wanted to address the sharing of fake leaderboards here.** We're going to ask to...

this is much needed policy. Thank you!

sleek phoenix Dec 11, 2025, 5:26 PM

#

i love lmarena

#

this also happened with search models

#

i'll try battle and direct chat rq

#

battle does the same

hollow perch Dec 11, 2025, 5:28 PM

#

hi

echo aurora Dec 11, 2025, 5:28 PM

#

sleek phoenix battle does the same

Does using a new browser make a difference?

echo aurora Dec 11, 2025, 5:29 PM

#

hollow perch hi

hello ablobwave

sleek phoenix Dec 11, 2025, 5:29 PM

#

echo aurora Does using a new browser make a difference?

lemme see if i have any other browsers

#

oh yeah i do

main moth Dec 11, 2025, 5:30 PM

#

Hi all, how's it going?

sleek phoenix Dec 11, 2025, 5:30 PM

#

echo aurora Does using a new browser make a difference?

uhh on vivaldi it redirects me to lmarena.ai/ru

#

tho on zen it doesn't

#

vivaldi also has this same error

#

wait it could be the dns i'm using to bypass russia's fisheries

#

they actually didn't block lmarena

cloud zinc Dec 11, 2025, 5:32 PM

#

where is 5.2

sharp mirage Dec 11, 2025, 5:33 PM

#

idk

#

no one know

torn mantle Dec 11, 2025, 5:36 PM

#

probably today

#

in an hour or so

sleek phoenix Dec 11, 2025, 5:37 PM

#

sleek phoenix wait it could be the dns i'm using to bypass russia's fisheries

nope not a dns issue

torn mantle Dec 11, 2025, 5:37 PM

#

oh wow

#

vivaldi

#

havent heard of it since years

#

still a thing huh?

#

i guess if u like flashy UI

sleek phoenix Dec 11, 2025, 5:38 PM

#

i dont use it now

#

switched to zen

torn mantle Dec 11, 2025, 5:38 PM

#

yea zen is better

neon idol Dec 11, 2025, 5:38 PM

#

is gpt 5.2 out?

torn mantle Dec 11, 2025, 5:38 PM

#

what i remember is that vivaldi was so slow

sleek phoenix Dec 11, 2025, 5:38 PM

#

i used firefox before vivaldi

torn mantle Dec 11, 2025, 5:38 PM

#

same

#

i was a firefox user

#

then switched to brave

#

still using brave

#

i did try zen but i dont like how it looks

sleek phoenix Dec 11, 2025, 5:39 PM

#

while i'm still a firefox user

torn mantle Dec 11, 2025, 5:39 PM

#

not my thing tbh

sharp mirage Dec 11, 2025, 5:39 PM

#

guys

#

chatgpt is down now

#

isnt wokring

#

i am trying use it and isnt wokring

torn mantle Dec 11, 2025, 5:39 PM

#

sleek phoenix while i'm still a firefox user

zen supports firefox & chrome extensions?

sleek phoenix Dec 11, 2025, 5:39 PM

#

only firefox

torn mantle Dec 11, 2025, 5:39 PM

#

oh

#

i thought it was based on chromium

sleek phoenix Dec 11, 2025, 5:40 PM

#

arc is based on chromium

torn mantle Dec 11, 2025, 5:40 PM

#

ah yes arc

sleek phoenix Dec 11, 2025, 5:40 PM

#

zen is like arc on firefox

#

they literally look identical

torn mantle Dec 11, 2025, 5:41 PM

#

ok maybe im wrong, i think the browser that i was talking about is arc

lusty tinsel Dec 11, 2025, 5:42 PM

#

echo aurora Would you mind creating a new ticket in <#1343291835845578853> and provide all o...

done

torn mantle Dec 11, 2025, 5:42 PM

#

they are so similar

sleek phoenix Dec 11, 2025, 5:42 PM

#

lool

torn mantle Dec 11, 2025, 5:42 PM

#

lol

sleek phoenix Dec 11, 2025, 5:42 PM

#

if it had this thing at the top by default then it's arc

torn mantle Dec 11, 2025, 5:42 PM

#

nothing worked but it was beatiful.

sleek phoenix Dec 11, 2025, 5:42 PM

#

tf is gpt 5.2

torn mantle Dec 11, 2025, 5:42 PM

#

another slop from openai

#

their new model

#

its on lmarena battle mode under the name 'robin'

sleek phoenix Dec 11, 2025, 5:43 PM

#

pretty much all chatgpt models slowly kill any code

torn mantle Dec 11, 2025, 5:43 PM

#

oh apparently that post was a troll

#

but its available on cursor if im not wrong

torn mantle Dec 11, 2025, 5:44 PM

#

sleek phoenix pretty much all chatgpt models slowly kill any code

yea

#

https://x.com/dtometzki/status/1999172107825975752

Damian Tometzki🔥 (@dtometzki)

It is merged for codex gpt-5.2

#

robin

#

https://x.com/DeItaone/status/1999144707373281690

*Walter Bloomberg (@DeItaone)

OPENAI'S ALTMAN SAYS GEMINI 3 HAS HAD LESS OF AN IMPACT ON OUR METRICS THAN WE FEARED

#

coping

#

12 minutes left for the release

#

yes

#

bet on what

#

take your own risk

#

lol

#

but the probability is high like 90%

#

they shared yesterday a tweet that has 'tomorrow' caption on it

#

i have no idea

#

maybe just a release

#

to get it out of the way

#

its not like its a crazy model

#

just more thinking time

#

you are

#

9 mins

queen veldt Dec 11, 2025, 5:51 PM

#

Gpt sucs

torn mantle Dec 11, 2025, 5:53 PM

#

https://x.com/_dmca/status/1999145646314635390

Daniel McAuley (@_dmca)

🧄

#

garlic

#

i kinda pity oai tbh

#

they had like approx 9 months lead progress

#

but ngl their models are still the best at reasoning

queen veldt Dec 11, 2025, 5:55 PM

#

5.2 will maybe be a bit better than 5.1 they can't improve it that far

#

Training takes a while

torn mantle Dec 11, 2025, 5:55 PM

#

ok

#

delulu

#

:3

sharp mirage Dec 11, 2025, 5:58 PM

#

queen veldt 5.2 will maybe be a bit better than 5.1 they can't improve it that far

they said it gone be fix bugs

fleet lintel Dec 11, 2025, 5:58 PM

#

is there no livestream setup yet?

sharp mirage Dec 11, 2025, 5:58 PM

#

and bit better at coding

queen veldt Dec 11, 2025, 5:58 PM

#

But i mean it can't beat claude or gemini for sure

sharp mirage Dec 11, 2025, 5:58 PM

#

for sure

#

have you tried gemini ??

astral blaze Dec 11, 2025, 6:00 PM

#

Is there a cancel button when its stuck like this

sharp mirage Dec 11, 2025, 6:00 PM

#

?

#

fr ?

cloud zinc Dec 11, 2025, 6:00 PM

#

source?

#

https://platform.openai.com/docs/guides/latest-model

sharp mirage Dec 11, 2025, 6:02 PM

#

no isnt

cloud zinc Dec 11, 2025, 6:02 PM

#

sharp mirage Dec 11, 2025, 6:02 PM

#

ut droped ?

#

it

cloud zinc Dec 11, 2025, 6:02 PM

#

cloud zinc Dec 11, 2025, 6:02 PM

#

sharp mirage ut droped ?

official souce from openai website

#

#

where benchmark

astral blaze Dec 11, 2025, 6:03 PM

#

who cares just use gemini 3

torn mantle Dec 11, 2025, 6:03 PM

#

cloud zinc

pls send link

sharp mirage Dec 11, 2025, 6:03 PM

#

no change loag

#

;pg

#

log

torn mantle Dec 11, 2025, 6:03 PM

#

@deep adder told ya

cloud zinc Dec 11, 2025, 6:04 PM

#

torn mantle pls send link

https://platform.openai.com/docs/models/gpt-5.2

gpt-5.2 Model | OpenAI API

torn mantle Dec 11, 2025, 6:04 PM

#

hows the price compared to gemini 3?

#

lmao

cloud zinc Dec 11, 2025, 6:05 PM

#

expensive

sharp mirage Dec 11, 2025, 6:05 PM

#

bro i cant log in

#

bro

#

what the hell

#

📛

torn mantle Dec 11, 2025, 6:06 PM

#

#

uhm thats...

cloud zinc Dec 11, 2025, 6:07 PM

#

so expensive

#

1.5 times increase

torn mantle Dec 11, 2025, 6:07 PM

#

is it better than gemini 3 tho?

#

no

cloud zinc Dec 11, 2025, 6:08 PM

#

torn mantle is it better than gemini 3 tho?

dont listen to him

#

its not better than gemini 3

#

i tried it, its bad

sharp mirage Dec 11, 2025, 6:09 PM

#

torn mantle Dec 11, 2025, 6:10 PM

#

gpt 5.2 = gpt 5.1 pro

fleet lintel Dec 11, 2025, 6:10 PM

#

cloud zinc its not better than gemini 3

any benchmark results out yet?

astral blaze Dec 11, 2025, 6:10 PM

#

Is there really any surprise

zealous sparrow Dec 11, 2025, 6:10 PM

#

torn mantle

14$ for output are we deada-

sharp mirage Dec 11, 2025, 6:10 PM

#

fleet lintel any benchmark results out yet?

i dont think so

astral blaze Dec 11, 2025, 6:10 PM

#

they're losing money anyways, they should just bring back gpt 4.5. lol

torn mantle Dec 11, 2025, 6:10 PM

#

#

lmao

sharp mirage Dec 11, 2025, 6:10 PM

#

@fiery gull Gpt 5.2 is droped

zealous sparrow Dec 11, 2025, 6:11 PM

#

we have some kind of new model [textarena]

sharp mirage Dec 11, 2025, 6:11 PM

#

what the hell

zealous sparrow Dec 11, 2025, 6:11 PM

#

torn mantle

robin-high is 21$ for that frontend

#

actual scam

torn mantle Dec 11, 2025, 6:11 PM

#

agree pffffft

fleet lintel Dec 11, 2025, 6:11 PM

#

torn mantle

why does this model even exists?

sharp mirage Dec 11, 2025, 6:11 PM

#

bro there si gpt-audio-2025-08-28
150,000 TPM
3 RPM
what the hell is this

spare rune Dec 11, 2025, 6:11 PM

#

Idk why gpt5.2 is being hyped so much, was it good?

sharp mirage Dec 11, 2025, 6:12 PM

#

no one tryed it

zealous sparrow Dec 11, 2025, 6:12 PM

#

seriously

sharp mirage Dec 11, 2025, 6:12 PM

#

we all broke

zealous sparrow Dec 11, 2025, 6:12 PM

#

is SWEbench deada- with us

#

82% FOR THAT FRONTEND?

spare rune Dec 11, 2025, 6:12 PM

#

I feel like it’s just the same as the upgrade to 5.0 to 5.1

hazy spruce Dec 11, 2025, 6:12 PM

#

guys is it possible to generate 9:16 format on the video arena

spare rune Dec 11, 2025, 6:12 PM

#

I noticed nothing for the change expect more slop front end

torn mantle Dec 11, 2025, 6:12 PM

#

#

whats that

cloud zinc Dec 11, 2025, 6:13 PM

#

torn mantle

overhyping

torn mantle Dec 11, 2025, 6:13 PM

#

gpt 5.2 pro xhigh premium max?

cloud zinc Dec 11, 2025, 6:13 PM

#

torn mantle whats that

he is spreading fake numbers without any source

torn mantle Dec 11, 2025, 6:13 PM

#

like 100$ for 82% swe?>

spare rune Dec 11, 2025, 6:13 PM

#

torn mantle

they call every model their best model

sharp mirage Dec 11, 2025, 6:13 PM

#

open ai is yaping

torn mantle Dec 11, 2025, 6:13 PM

#

cloud zinc he is spreading fake numbers without any source

im this close 🤏 to block craig

spare rune Dec 11, 2025, 6:13 PM

#

zealous sparrow 82% FOR THAT FRONTEND?

gpt5.2 pro codex max

fleet lintel Dec 11, 2025, 6:13 PM

#

spare rune they call every model their best model

everyone does that :... most likely even llama calls itself the best model ever 🙂

zealous sparrow Dec 11, 2025, 6:13 PM

#

SWE bench verified isnt even benching properly anymore

sharp mirage Dec 11, 2025, 6:13 PM

#

Open ai and Chatgpt is the best yappers in the world after Deepseek

astral blaze Dec 11, 2025, 6:14 PM

#

torn mantle

For professional use? It's dead

zealous sparrow Dec 11, 2025, 6:14 PM

#

gpt 5.2 gets 82% even tho its frontend is sh

sharp mirage Dec 11, 2025, 6:14 PM

#

zealous sparrow gpt 5.2 gets 82% even tho its frontend is sh

bro cloud is way better

cloud zinc Dec 11, 2025, 6:14 PM

#

the benchmark is going to be on this page

#

https://openai.com/index/introducing-gpt-5-2/

spare rune Dec 11, 2025, 6:14 PM

#

I hope it’s actually good and not slop like gpt 5.1

cloud zinc Dec 11, 2025, 6:14 PM

#

wait for it to be published

#

sharp mirage Dec 11, 2025, 6:14 PM

#

spare rune I hope it’s actually good and not slop like gpt 5.1

its gone be trash

cloud zinc Dec 11, 2025, 6:15 PM

#

https://cdn.openai.com/pdf/3a4153c8-c748-4b71-8e31-aecbde944f8d/oai_5_2_system-card.pdf

hazy spruce Dec 11, 2025, 6:15 PM

#

guys is it possible to generate 9:16 format on the video arena

frosty torrent Dec 11, 2025, 6:15 PM

#

prompt

cloud zinc Dec 11, 2025, 6:15 PM

#

https://openai.com/index/gpt-5-2-for-science-and-math/

#

sharp mirage Dec 11, 2025, 6:16 PM

#

hazy spruce guys is it possible to generate 9:16 format on the video arena

yes from the prompt add in the prompt you wan tthe video 9:16 and it should work

cloud zinc Dec 11, 2025, 6:16 PM

#

sharp mirage Dec 11, 2025, 6:16 PM

#

cloud zinc

soruce ?

cloud zinc Dec 11, 2025, 6:16 PM

#

sharp mirage soruce ?

https://openai.com/index/gpt-5-2-for-science-and-math/

torn mantle Dec 11, 2025, 6:16 PM

#

cloud zinc

they are so vague

hazy spruce Dec 11, 2025, 6:16 PM

#

sharp mirage yes from the prompt add in the prompt you wan tthe video 9:16 and it should work

thanks man i guess it just didnt work the first time

torn mantle Dec 11, 2025, 6:16 PM

#

thinking like xhigh or what

sharp mirage Dec 11, 2025, 6:17 PM

#

hazy spruce thanks man i guess it just didnt work the first time

will if isnt work idk :/

#

: D

cloud zinc Dec 11, 2025, 6:17 PM

#

#

gemini 3 is 37.6

spare rune Dec 11, 2025, 6:17 PM

#

cloud zinc gemini 3 is 37.6

What about Claude opus? 4.5 I wonder if that’s good for anything beside coding

torn mantle Dec 11, 2025, 6:17 PM

#

cloud zinc Dec 11, 2025, 6:17 PM

#

its out

#

https://openai.com/index/introducing-gpt-5-2/

#

#

52.9 on arc agi 2

#

#

swe 80%

sharp mirage Dec 11, 2025, 6:18 PM

#

spare rune What about Claude opus? 4.5 I wonder if that’s good for anything beside coding

its good in text

zealous sparrow Dec 11, 2025, 6:19 PM

#

cloud zinc

SWE are liars, they put grok too high already

#

when grok 4.1 released it topped SWE

torn mantle Dec 11, 2025, 6:19 PM

#

CAN SOMEONE ADD GEMINI 3 PRO BENCHMARK PLEASE

#

pls someone add gemini 3 pro

astral blaze Dec 11, 2025, 6:19 PM

#

More of this benchmaxxing crap
I'll tell you if it's good when I can actually use it

spare rune Dec 11, 2025, 6:20 PM

#

zealous sparrow when grok 4.1 released it topped SWE

They were prob just benchmark maxxing

cloud zinc Dec 11, 2025, 6:20 PM

#

zealous sparrow SWE are liars, they put grok too high already

52% on arc agi 2

sharp mirage Dec 11, 2025, 6:20 PM

#

gpt 5,2 any good ?

spare rune Dec 11, 2025, 6:20 PM

#

I felt that grok4.1 had the texting speech of the average twitter user

torn mantle Dec 11, 2025, 6:20 PM

#

https://openai.com/index/gdpval/

Measuring the performance of our models on real-world tasks

We’re introducing GDPval, a new evaluation that measures model performance on economically valuable, real-world tasks across 44 occupations.

#

is this a new eval?

weary galleon Dec 11, 2025, 6:20 PM

#

https://openai.com/index/introducing-gpt-5-2/

Introducing GPT-5.2

The most advanced frontier model for professional work and long-running agents.

torn mantle Dec 11, 2025, 6:20 PM

#

on paper it seems like a solid model

#

need to try it

#

hehe

zealous sparrow Dec 11, 2025, 6:21 PM

#

the output being 14$ is diabolical from OpenAI

cloud zinc Dec 11, 2025, 6:21 PM

#

zealous sparrow Dec 11, 2025, 6:21 PM

#

especially for people who do html coding

#

14$ output for Horrible UI

spare rune Dec 11, 2025, 6:21 PM

#

Someone wait for pineapple to type in announcements.. /j

cloud zinc Dec 11, 2025, 6:22 PM

#

sharp mirage Dec 11, 2025, 6:22 PM

#

ngl its look open ai is trying to come back

cloud zinc Dec 11, 2025, 6:22 PM

#

frontier math tier 4, it loses

warped kraken Dec 11, 2025, 6:22 PM

#

cloud zinc

wait wtf thats kinda busted

torn mantle Dec 11, 2025, 6:22 PM

#

sharp mirage ngl its look open ai is trying to come back

yea

#

thats what im saying

#

this looks like an actual solid model

#

they just have to fix frontend sloppiness at coding

#

and we are so back

#

hehe

sharp mirage Dec 11, 2025, 6:22 PM

#

like sonsut 4.5

spare rune Dec 11, 2025, 6:22 PM

#

cloud zinc frontier math tier 4, it loses

I thought every model was bad at frontier math 4

sharp mirage Dec 11, 2025, 6:23 PM

#

i alr said its gone be fix bugs

zealous sparrow Dec 11, 2025, 6:23 PM

#

I don't want to believe benches, well even ARC-AGI until i see the model in action

mystic panther Dec 11, 2025, 6:24 PM

#

I need help with smth.. I generated one photo and it doesn't let me anymore

torn mantle Dec 11, 2025, 6:24 PM

#

we are so back craig

sharp mirage Dec 11, 2025, 6:24 PM

#

idk who gone read it but

#

"Code Red" Performance Focus

Context: GPT-5.2 was a "Code Red" release, meaning it was fast-tracked specifically to address competitive pressure from Google's Gemini 3, which had outperformed GPT-5.1 in reasoning and coding benchmarks.

Philosophy: Unlike GPT-5.1, which introduced user-facing features like "personalities" and tone controls, GPT-5.2 is a "performance-first" update. It focuses on reliability, speed, and raw reasoning power rather than new experimental features.

Reasoning & Reliability

Scientific & Math Reasoning: GPT-5.2 Pro and Thinking models show significant gains in high-level benchmarks like FrontierMath and GPQA Diamond (graduate-level science), surpassing the capabilities of GPT-5.1 Thinking.

Logic & Multi-step Tasks: The model is much better at handling long chains of logic without "losing the thread," a common issue users reported with GPT-5.1 in complex workflows.

Reduced Hallucinations: There is a strong emphasis on "groundedness," with GPT-5.2 showing an estimated 80% reduction in hallucinations compared to earlier iterations, making it far more reliable for enterprise and research use.

Speed & Latency

Optimized Pipeline: GPT-5.2 introduces major backend optimizations that make it significantly faster (lower latency) than GPT-5.1, particularly for the "Instant" model on routine queries.

Smoother Turn-taking: The chat experience is described as having "tighter logic" and less lag, addressing the "sluggishness" some users felt with GPT-5.1's reasoning models.

Coding & Technical Work

SWE-bench Scores: GPT-5.2 achieves higher scores on coding benchmarks (e.g., ~74.9% on SWE-bench Verified), with specific improvements in debugging, multi-file handling, and reduced syntax errors compared to GPT-5.1.

Agentic Capabilities: The model is better at "agentic" tasks—executing multi-step projects like building entire spreadsheets or presentations autonomously, where GPT-5.1 might have required more manual hand-holding.

#

Architecture Refinements

Unified Router: While GPT-5.1 introduced the concept of "Instant" vs. "Thinking" models, GPT-5.2 refines the automatic router to be much smarter at detecting "explicit intent." If you ask it to "think hard," it routes to the Thinking model more reliably than 5.1 did.

Context Management: Although the context window size (approx. 272k-400k tokens) remains similar, GPT-5.2 is far better at utilizing that context effectively, reducing "context drift" (forgetting earlier parts of the conversation) which was a critique of 5.1.

torn mantle Dec 11, 2025, 6:24 PM

#

we need it added on lmarena RN

#

LIKE RN

#

LET ME TEST IT PLS

sharp mirage Dec 11, 2025, 6:24 PM

#

FR

spare rune Dec 11, 2025, 6:24 PM

#

Oh it was added

#

Time to test..

inner gate Dec 11, 2025, 6:25 PM

#

Gpt 5.2

sharp mirage Dec 11, 2025, 6:25 PM

#

o lets gooo

weary galleon Dec 11, 2025, 6:25 PM

#

FAKE!

zealous sparrow Dec 11, 2025, 6:25 PM

#

On webdev huh

sharp mirage Dec 11, 2025, 6:25 PM

#

:D:DD::D:D::D:DD::DD: Yea

zealous sparrow Dec 11, 2025, 6:25 PM

#

NOT FAKE

mystic panther Dec 11, 2025, 6:25 PM

#

mystic panther I need help with smth.. I generated one photo and it doesn't let me anymore

Can someone help?

zealous sparrow Dec 11, 2025, 6:25 PM

#

It's now added guys

inner gate Dec 11, 2025, 6:25 PM

#

Did they skip 5.1 or was i under a rock

weary galleon Dec 11, 2025, 6:25 PM

#

zealous sparrow It's now added guys

Via Photoshop or Nano Banana?

sharp mirage Dec 11, 2025, 6:25 PM

#

no

#

fr

#

its added

zealous sparrow Dec 11, 2025, 6:25 PM

#

going to test it out on codearena rn

sharp mirage Dec 11, 2025, 6:25 PM

#

i saw it now

#

bye

#

me go test

zealous sparrow Dec 11, 2025, 6:25 PM

#

this model takes so long

sharp mirage Dec 11, 2025, 6:25 PM

#

agaisnt gemini 3

zealous sparrow Dec 11, 2025, 6:26 PM

#

they wont win fastest model tho

#

googles new flash models get that point from me

#

they write 400 lines in less than a min

#

yeah but the new flashes have quality and speed

#

still waiting on those

#

i hope google ships

spare rune Dec 11, 2025, 6:27 PM

#

Gpt5.2 high is really fast

weary galleon Dec 11, 2025, 6:27 PM

#

Stop FLOOD!!!!!

spare rune Dec 11, 2025, 6:27 PM

#

I was preparing to wait like 5 minutes for the reply

#

Maybe that’s a good thing or a bad thing

astral blaze Dec 11, 2025, 6:28 PM

#

STOP THE COUNT

fleet lintel Dec 11, 2025, 6:28 PM

#

benchmarks are good! this will force google to up their game!!

zealous sparrow Dec 11, 2025, 6:28 PM

#

spare rune Gpt5.2 high is really fast

on codearena it takes long

spare rune Dec 11, 2025, 6:28 PM

#

zealous sparrow on codearena it takes long

Oh

zealous sparrow Dec 11, 2025, 6:28 PM

#

Well, OpenAI gave me a bad first impression. The first thing i generated on codearena with gpt 5.2 high instantly broke

sharp mirage Dec 11, 2025, 6:28 PM

#

guys someone

heavy smelt Dec 11, 2025, 6:28 PM

#

What's the point of the video models in the video arena being randomized? If it is so then what's the point of needing two votes to actually see the models?

sharp mirage Dec 11, 2025, 6:28 PM

#

send code arena url

#

i forgot it pls

zealous sparrow Dec 11, 2025, 6:28 PM

#

zealous sparrow Well, OpenAI gave me a bad first impression. The first thing i generated on code...

Good job SAMA!

spare rune Dec 11, 2025, 6:28 PM

#

It’s in lmarena

#

Just click the code icon

#

The “ <> “

sharp mirage Dec 11, 2025, 6:29 PM

#

thx bro

astral blaze Dec 11, 2025, 6:29 PM

#

Wow

weary galleon Dec 11, 2025, 6:29 PM

#

WE NEED XHIGH ON ARENA!!!!!!!!!!!

astral blaze Dec 11, 2025, 6:29 PM

#

That's the best openAI can muster huh

spare rune Dec 11, 2025, 6:29 PM

#

Oh wait I was waiting for it to reply until I saw it freezes mid conversation

astral blaze Dec 11, 2025, 6:29 PM

#

I'm going back to gemini 3

sharp mirage Dec 11, 2025, 6:29 PM

#

alr i made prmpt for Html game "who want use it

#

"

zealous sparrow Dec 11, 2025, 6:30 PM

#

Im doing some testing for gpt 5.2 high

sharp mirage Dec 11, 2025, 6:30 PM

#

for who want to use it

zealous sparrow Dec 11, 2025, 6:30 PM

#

so far it made one fully broken game

spare rune Dec 11, 2025, 6:30 PM

#

weary galleon WE NEED XHIGH ON ARENA!!!!!!!!!!!

What’s Xhigh

zealous sparrow Dec 11, 2025, 6:30 PM

#

GOOD JOB SAMA!

weary galleon Dec 11, 2025, 6:30 PM

#

spare rune What’s Xhigh

No, it's just HIGH!

sharp mirage Dec 11, 2025, 6:30 PM

#

📎 prompt.txt

spare rune Dec 11, 2025, 6:30 PM

#

weary galleon No, it's just HIGH!

are you high

fleet lintel Dec 11, 2025, 6:30 PM

#

astral blaze That's the best openAI can muster huh

why? benchmarks are looking great!

sharp mirage Dec 11, 2025, 6:30 PM

#

no one will get hacked its prompt for spiderman game

#

so everyone can test

spare rune Dec 11, 2025, 6:30 PM

#

sharp mirage

lmarena is free

weary galleon Dec 11, 2025, 6:30 PM

#

spare rune are you high

Huh?

sharp mirage Dec 11, 2025, 6:31 PM

#

?

#

??

cloud zinc Dec 11, 2025, 6:31 PM

#

xhigh is different than high

astral blaze Dec 11, 2025, 6:31 PM

#

fleet lintel why? benchmarks are looking great!

LOOK AT THE BENCHMARKS PLEASE DO NOT ACTUALLY USE THE MODEL

#

JUST LOOK AT HOW GOOD IT DID ON SWE

spare rune Dec 11, 2025, 6:31 PM

#

I choose to believe Claude opus was taking its time in code arena until I noticed it just kept going forever saying creating index html

#

Sob

#

I wonder if it’s happening for gpt too

fleet lintel Dec 11, 2025, 6:31 PM

#

astral blaze LOOK AT THE BENCHMARKS PLEASE DO NOT ACTUALLY USE THE MODEL

I am serious. Are you saying that actually use is not great?

spare rune Dec 11, 2025, 6:31 PM

#

fleet lintel I am serious. Are you saying that actually use is not great?

No

astral blaze Dec 11, 2025, 6:31 PM

#

fleet lintel I am serious. Are you saying that actually use is not great?

I thought you were joking

spare rune Dec 11, 2025, 6:31 PM

#

It’s a joke

sharp mirage Dec 11, 2025, 6:32 PM

#

gpt is taking so long time

zealous sparrow Dec 11, 2025, 6:32 PM

#

Im running side by sides with GPT 5.2 High and gemini 3 pro

fleet lintel Dec 11, 2025, 6:32 PM

#

spare rune No

no means "not great" or "no.. it is great". ? 🙂

spare rune Dec 11, 2025, 6:32 PM

#

sharp mirage gpt is taking so long time

I think it’s either the code taking a lot of time or stuck

obsidian cargo Dec 11, 2025, 6:32 PM

#

well that was fast

astral blaze Dec 11, 2025, 6:32 PM

#

Gemini 3 is clearly miles ahead I are we using the same 5.2

sharp mirage Dec 11, 2025, 6:32 PM

#

zealous sparrow Im running side by sides with GPT 5.2 High and gemini 3 pro

same

spare rune Dec 11, 2025, 6:32 PM

#

fleet lintel no means "not great" or "no.. it is great". ? 🙂

They are joking

#

Being sarcastic

#

I think it’s stuck too

grave plaza Dec 11, 2025, 6:32 PM

#

they just released it haha

spare rune Dec 11, 2025, 6:33 PM

#

Never mind

#

It worked

#

Oh

astral blaze Dec 11, 2025, 6:33 PM

#

spare rune They are joking

I do hope that people go back to gpt so that I can get more gemini 3 uses

spare rune Dec 11, 2025, 6:33 PM

#

The output is good

echo aurora Dec 11, 2025, 6:33 PM

#

heavy smelt What's the point of the video models in the video arena being randomized? If it ...

It's going to be Battle mode since that's what we use to build our leaderboards. The model's names not being shown until there are 2 votes is so it doesn't bias the votes. All votes after 2 votes don't contribute to the leaderboards (as the names are now exposed).

spare rune Dec 11, 2025, 6:33 PM

#

Ish

grave plaza Dec 11, 2025, 6:33 PM

#

guys is kat coder pro in lmarena? when yes i use it

glacial mulch Dec 11, 2025, 6:33 PM

#

is 5.2 any good

devout vault Dec 11, 2025, 6:33 PM

#

chatgpt be releasing the worst models that are never #1 on the leaderboard

spare rune Dec 11, 2025, 6:33 PM

#

echo aurora It's going to be Battle mode since that's what we use to build our leaderboards....

The link in code arena how much minutes does it last and can other ppl see it?

zealous sparrow Dec 11, 2025, 6:33 PM

#

gpt 5.2 high made this
https://019b0eac-c31e-741f-b0e3-598bb5904f74.arena.site
PROTIP: TRY NOT TO CRASH!

VS MS Paint — Bullethell

Built with LMArena - Content is user-generated and unverified

#

the game is insanely broken

#

GOOD JOB SAMA!

#

I APPLOUD!

sharp mirage Dec 11, 2025, 6:34 PM

#

BRo

#

wtfff

#

:/

torn mantle Dec 11, 2025, 6:34 PM

#

and nothing for cursor

devout vault Dec 11, 2025, 6:34 PM

#

zealous sparrow gpt 5.2 high made this https://019b0eac-c31e-741f-b0e3-598bb5904f74.arena.site P...

chatGPT will never dominate the AI industry

echo aurora Dec 11, 2025, 6:34 PM

#

spare rune The link in code arena how much minutes does it last and can other ppl see it?

The preview link? It shouldn't expire and others can see it if the link is shared with them.

heavy smelt Dec 11, 2025, 6:34 PM

#

echo aurora It's going to be Battle mode since that's what we use to build our leaderboards....

Ok got it

zealous sparrow Dec 11, 2025, 6:34 PM

#

devout vault chatGPT will never dominate the AI industry

80% on SWE btw, and makes piece of s-

devout vault Dec 11, 2025, 6:34 PM

#

zealous sparrow 80% on SWE btw, and makes piece of s-

prob all fake results

fickle venture Dec 11, 2025, 6:34 PM

#

What the heck is this GPT-5.2

spare rune Dec 11, 2025, 6:34 PM

#

Well the response is buggy

torn mantle Dec 11, 2025, 6:34 PM

#

zealous sparrow gpt 5.2 high made this https://019b0eac-c31e-741f-b0e3-598bb5904f74.arena.site P...

lol this is cool

zealous sparrow Dec 11, 2025, 6:34 PM

#

torn mantle lol this is cool

cool and guess what

#

touching a bullet

rugged abyss Dec 11, 2025, 6:35 PM

#

zealous sparrow gpt 5.2 high made this https://019b0eac-c31e-741f-b0e3-598bb5904f74.arena.site P...

Well kinda sucks as ive discovered a neat bug in the first 30s

zealous sparrow Dec 11, 2025, 6:35 PM

#

breaks the game

torn mantle Dec 11, 2025, 6:35 PM

#

zealous sparrow breaks the game

no

#

its working fine

stray aspen Dec 11, 2025, 6:35 PM

#

How good is the new gpt

torn mantle Dec 11, 2025, 6:35 PM

#

ah right

#

true

devout vault Dec 11, 2025, 6:35 PM

#

stray aspen How good is the new gpt

it's stupid

zealous sparrow Dec 11, 2025, 6:35 PM

#

torn mantle ah right

moments before grief

spare rune Dec 11, 2025, 6:35 PM

#

Woah

sharp mirage Dec 11, 2025, 6:35 PM

#

bro gpt 5.2 is coocked

astral blaze Dec 11, 2025, 6:35 PM

#

zealous sparrow gpt 5.2 high made this https://019b0eac-c31e-741f-b0e3-598bb5904f74.arena.site P...

None of these buttons work

sharp mirage Dec 11, 2025, 6:35 PM

#

cooking

zealous sparrow Dec 11, 2025, 6:35 PM

#

stray aspen How good is the new gpt

flop, and scored 80 % on SWE while being the worst coding model ever

sharp mirage Dec 11, 2025, 6:35 PM

#

rn

spare rune Dec 11, 2025, 6:35 PM

#

The app is buggy. But the gui is good

zealous sparrow Dec 11, 2025, 6:35 PM

#

astral blaze None of these buttons work

exactly

stray aspen Dec 11, 2025, 6:35 PM

#

🥀

zealous sparrow Dec 11, 2025, 6:35 PM

#

spare rune The app is buggy. But the gui is good

the gui is as-

odd geyser Dec 11, 2025, 6:35 PM

#

echo aurora It's going to be Battle mode since that's what we use to build our leaderboards....

And you still haven't figured out what the problem is with chats that close over time and you can't log in?
I'm sorry, I could have done as it was said in that manual, but I don't have a PC or laptop. That's why I'm wondering if someone sent you the necessary data.

spare rune Dec 11, 2025, 6:35 PM

#

Gpt is actually good at backend

sharp mirage Dec 11, 2025, 6:36 PM

#

1131 line of code i wish if its work

gusty helm Dec 11, 2025, 6:36 PM

#

how's gpt 5.2?

heavy smelt Dec 11, 2025, 6:36 PM

#

I have another question, how is it legal for lmarena to offer paid models completely for free?

torn mantle Dec 11, 2025, 6:36 PM

#

its meh at coding ngl

gusty helm Dec 11, 2025, 6:36 PM

#

good/bad/overhyped?

stray aspen Dec 11, 2025, 6:36 PM

#

I'll stick to Opus 4.5 then

zealous sparrow Dec 11, 2025, 6:36 PM

#

and OpenAI wants to argue they beat gemini 3 pro

golden ocean Dec 11, 2025, 6:36 PM

#

no way gpt 5.2 was real and its not a frontier model

#

gg its over for openai

cloud zinc Dec 11, 2025, 6:36 PM

#

heavy smelt I have another question, how is it legal for lmarena to offer paid models comple...

they pay it out of their own pocket

zealous sparrow Dec 11, 2025, 6:36 PM

#

zealous sparrow and OpenAI wants to argue they beat gemini 3 pro

atleast gemini 3 pro often gives me bug free stuff

rugged abyss Dec 11, 2025, 6:36 PM

#

gusty helm good/bad/overhyped?

Overhyped by Twitter, an alright coding model. Claude still dominates

primal nacelle Dec 11, 2025, 6:36 PM

#

heavy smelt What's the point of the video models in the video arena being randomized? If it ...

Yeah i also want to know. it's annoying

stray aspen Dec 11, 2025, 6:36 PM

#

gusty helm good/bad/overhyped?

Seems like a rushed product just to "keep up" with the competition

zealous sparrow Dec 11, 2025, 6:36 PM

#

rugged abyss Overhyped by Twitter, an alright coding model. Claude still dominates

claude and gemini were never dethroned

astral blaze Dec 11, 2025, 6:36 PM

#

They are not beating gemini with this lol
The world knowledge on this model is clearly short of gemini 3

#

So it's another codemaxxed model. Congrats sama

rugged abyss Dec 11, 2025, 6:37 PM

#

zealous sparrow claude and gemini were never dethroned

Yeah i was honestly hoping for more Competition as im mainly using Claude and Gemini

astral blaze Dec 11, 2025, 6:37 PM

#

Pichai remains undefeated

sour spindle Dec 11, 2025, 6:37 PM

#

Google stock down 3%

cloud zinc Dec 11, 2025, 6:37 PM

#

heavy smelt What's the point of the video models in the video arena being randomized? If it ...

randomized because it is for testing which is better. two votes because they need another vote beside u

gusty helm Dec 11, 2025, 6:37 PM

#

yeah, I had the same feeling; it's solid but overhyped cause fanboys + crazy marketing

sharp mirage Dec 11, 2025, 6:37 PM

#

bro gpt is bad

echo aurora Dec 11, 2025, 6:38 PM

#

odd geyser And you still haven't figured out what the problem is with chats that close over...

No new updates sorry to say

you can't log in?
This doesn't sound familiar, did you make a post in #1343291835845578853 ?

spare rune Dec 11, 2025, 6:38 PM

#

Gpt 5.2 high codex pro max when

devout vault Dec 11, 2025, 6:38 PM

#

gemini 3.0: free
gpt-5.2: paid 1000000 dollars a month

zealous sparrow Dec 11, 2025, 6:38 PM

#

who made it better
https://019b0eb1-becc-7abd-865d-dc86a83fc504.arena.site
first link is GPT 5.2
second link is Gemini 3 pro
https://019b0eb1-becc-7f0a-ad2f-6ef5adc697e7.arena.site

VS MSPaint — Bullet Hell Bossfight

Built with LMArena - Content is user-generated and unverified

VS MS Paint - Bullet Hell

Built with LMArena - Content is user-generated and unverified

sharp mirage Dec 11, 2025, 6:38 PM

#

gemini isnt making a game for me

devout vault Dec 11, 2025, 6:38 PM

#

zealous sparrow who made it better https://019b0eb1-becc-7abd-865d-dc86a83fc504.arena.site first...

obviously gemini 3 pro

zealous sparrow Dec 11, 2025, 6:38 PM

#

devout vault gemini 3.0: free gpt-5.2: paid 1000000 dollars a month

they want 14$ for the output

sharp mirage Dec 11, 2025, 6:38 PM

#

i dont think so

zealous sparrow Dec 11, 2025, 6:38 PM

#

i say we sue