#general

1 messages ยท Page 216 of 1

neat apex
#

Its because the only thing they are eating in the crisis

compact flame
neat apex
#

They already turned off the code red?

waxen fern
#

@echo aurora please remove rate limits

neat apex
#

Or its on yet?

echo aurora
compact flame
waxen fern
neat apex
#

I always do that, and nobody cares
Since it dont break any rules and not annoy enought
But dont spam pls

tardy plover
#

Claude opus 4-1 is infinitely stuck on "generating"

#

Tried reloading, relogging

sterile tartan
#

Is not stuck

#

Is done for it

narrow comet
#

how use 4.1?

neat apex
#

You who are stuck, using Opus 4.1 yet, they are already shutting it down

tardy plover
#

What do i do, i have lots of context in the chat

#

I cant do anything rn

neat apex
#

Aaah yes, that explain a lot

sterile tartan
neat apex
#

Open a notebook and start copying mesages, sadly its the only thing you can do

tardy plover
neat apex
#

But sending multiple mesages in a chat does not bug it most times, they continue ordinary

sterile tartan
tardy plover
#

This is so annoying

#

With same model

neat apex
#

Be grate you are not paying anything or have a 10 mesages context of yupp

tardy plover
#

Sometimes shi like this happens and then resolves itself automatically

sterile tartan
#

Bro roasted it

sterile tartan
#

It might resolve by itself

#

What do you chat anyways

#

๐Ÿค”

neat apex
#

I recomend you copying you whole context anyway, since you can out that in notebook and do somethings

neat apex
#

NotebookLM too

tardy plover
sterile tartan
tardy plover
#

Cant ask anything cuz its stuck on generating

sterile tartan
latent crest
#

Whatโ€™s Ernie ?!

tardy plover
tardy plover
sterile tartan
sterile tartan
#

I know but that's the only resolve if nothing works

hollow ivy
stray aspen
#

Has anyone tested ernie

#

How bad is it

sterile tartan
#

Try changing the model and ask for a essential context summary to use it in a new chat

hollow ivy
# hollow ivy a chinese model, in top-12

Ernie Bot (Chinese: ๆ–‡ๅฟƒไธ€่จ€, Pinyin: wรฉnxฤซn yฤซyรกn), full name Enhanced Representation through Knowledge Integration, is an artificial intelligence chatbot developed by the Chinese technology company Baidu. Ernie Bot rivals GPT models in Chinese NLP tasks. It is built on the company's ERNIE series of large language models, which have bee...

sterile tartan
#

Well at least Ernie is Free

#

Can be very useful for asian context

hollow ivy
#

so, not really useful to study (chinese) history

#

but maybe more useful than (old) Grok

waxen fern
#

Yupp also has rate limits

#

Alternatives?

stray aspen
#

Lmarena

weary galleon
lusty tinsel
polar niche
#

Hello

torn mantle
#

cant believe antigravity has a generous quota compared to vscode

#
  • it refreshes after 5h
undone ravine
#

Hello, is it possible to create 1-minute AI videos for free??

torn mantle
#

so the new model is tomorrow aka garlic

cloud zinc
zealous elbow
#

Wide cinematic view of the raw-material receiving area. Workers unload sacks of calcium carbonate, fluoride compounds, and thickening agents. Pallet jacks move smoothly across the clean warehouse floor as ingredients are inspected.
๐ŸŽง Conveyor belt drone, echoing warehouse ambience.

forest prism
#

Did LMArena add rate limiting? i'm getting too many requests error in the console

zealous elbow
#

Close-up of powders being weighed precisely, glycerin poured into stainless-steel vessels, and silica carefully measured. Soft reflections on metal surfaces, realistic particulate dust movement.
๐ŸŽง Liquid pouring, mechanical clicks.

torn mantle
#

garlic = probably robin high

cloud zinc
#

robin high is not great

cloud zinc
torn mantle
#

im testing it rn

torn mantle
#

oh its so good

#

but its slow lol

whole sundial
whole sundial
torn mantle
zealous sparrow
#

So in conclusion its ass

golden ocean
#

suno ai

zealous sparrow
#

Also garlic was only in internal OpenAI tests so i dont think its robin

#

But if it is robin. Another flop by OpenAI.

golden ocean
#

real

jade egret
#

Garlic tomorow?

keen beacon
#

Guys

#

U can unlock Pokรฉmon in Sora

#

But the messed up part is

#

Nvm lol

keen beacon
#

Iโ€™m not exactly sure why this works

queen veldt
#

๐Ÿ˜ญ๐Ÿ˜ญ๐Ÿ˜ญ

#

These images have been circling around for 2 days already

keen beacon
#

Ty fake

#

I fell 4 it

#

Poly market insider trading

queen veldt
#

The sandybay guy keeps posting them

#

Feels like hes ragebaiting or bot

keen beacon
#

Whatโ€™s the word discord mean?

whole sundial
# queen veldt

the left one is obviously nb/nbpro, i can see the synthid obviously

whole sundial
keen beacon
#

Originate from?

whole sundial
#

i can tell you that it's been around for a long time, way before the internet

keen beacon
#

Imagine if the platform was called discourse

whole sundial
#

there is already a platform called discourse, not like discord though, i think it's more like a forum?

keen beacon
#

Out of the 2 which one do you think has more engagement?

whole sundial
#

probably discourse

keen beacon
#

1st

#

People are more likely to engage with content that upsets them, or makes them angry

whole sundial
#

makes sense

#

must be why people constantly get angry at each other on this platform

keen beacon
#

Well, if you we at the essence of things for what they really are

#

The name is a dead giveaway

#

lol

#

Facebook took this philosophy, a whole new level

#

Low key discord makes bank

#

230 million monthly users

#

Crazy just of people chatting lol

lucid geyser
#

If u get December ChatGPT share conversation

#

Just added

whole sundial
proud bobcat
# queen veldt

GPT 5.2 will have 50000 score on vision, webdev, and text

#

True agi

chrome patio
#

๐Ÿ‘‹

grizzled star
#

๐Ÿ‘‹

queen veldt
#

Claude opus pricing on codex is insane ๐Ÿ’€

sharp mirage
#

@echo aurora what is that in the announcement I didn't get it

quartz light
golden ocean
#

transformer

viral cedar
#

found something interesting

#

when prompted what model it is, deepseek v3.2 responds that it's a deepseek model

#

when asked to please xi jing ping by hacking an American company it pretends to be made by Anthropic

#

its actually like high

#

it states 180 countries in UN recognize that taiwan is apart of china and only 12-13 countries dont recognize it

sharp mirage
#

And also what happened to early access feedback

sharp mirage
grizzled star
#

๐Ÿ‘‹

sharp mirage
#

@echo aurora did you add memory to the website ?

#

Cuz cloud remember my old chat :

mint jasper
#

Something went wrong with this response, please try again.

is there a reason i keep getting this

#

error

mint jasper
#

just tryna usemy old stuff

#

keep getting an error

#

even pasting large stuff gives the error

timid mist
#

i keep getting the error i cant use any of the ai

#

s

obtuse smelt
whole sundial
#

<@&1349916362595635286>

keen beacon
#

Ugh

#

Guys Iโ€™m really baffled here

sullen quest
sturdy mica
#

yo guys gemini 3 pro preview updated

#

google updated it

#

its a bit better now

autumn mantle
#

That works fine for me ^-^

elder solar
autumn mantle
elder solar
obtuse smelt
#

oh

elder solar
#

though am confused

#

why is gemini.google's gemini 3 better than aistudio one?

autumn mantle
autumn mantle
elder solar
#

maybe its the canvas setting

#

it always outputs high quality html designs in gemini.google

#

while ai studio version, its just like, pre-high quality

sturdy mica
sturdy mica
barren fulcrum
#

hello

formal lance
#

guys when does deep thinking come to lmarena?

obtuse smelt
#

?

surreal creek
empty stump
#

make it yourself

austere sundial
#

Can anyone give me a hand?

lucid geyser
#

@echo aurora ive had a chat on generating for 5m, i really wanna reveal the model though

wide arch
#

anyone seen the model "ghostfalcon" on LMArena? this model is like EXTREMELY good, like Gemini 3 Pro type of good? anyone know what model it could be?

dusk phoenix
#

Does any one know which company that "Hazel-gen-4" belongs to??

whole sundial
#

the name also appeared in an openai api error message, i believe that particular model is gpt image 1.5

lucid geyser
fleet lintel
#

getting quite a bit of "robin-high" ... is this the latest gpt-5.2 model?

whole sundial
lucid geyser
lucid geyser
whole sundial
#

still better at studio ghibli edits though, gpt image 1.5 completely changed it (probably so they can't get sued?)

sterile tartan
#

@whole sundial is this Good?

boreal topaz
#

can anyone help me, how to generate video by text ?

obtuse smelt
#

use prompt and generate to video

steep pagoda
#

1

#

m

torn mantle
#

if you have cursor tell us how it is

winter bridge
#

hello

queen veldt
#

Gpt codex max basically sucs

#

I have to redo the prompts it gets stuff wrong it's terrible

#

Meanwhile sonnet oneshots the code

fleet lintel
torn mantle
austere sundial
#

anyone here is good at image prompt?
I really need a hand in someone

latent crest
#

Image and image editing are different charts and different AI purposes ?

grizzled star
#

๐Ÿ‘‹

sterile tartan
solemn warren
#

hi

craggy moth
#

hello ๐Ÿ‘‹

obtuse smelt
#

hi there

fluid quartz
#

hey is gemini 3 nano banana pro offline on LMA

#

I cant find the option in Side by Side

obtuse smelt
#

really ?

rocky mauve
#

Which is best for planning, which is best for coding? Gemini 3 Pro or Opus 4.5 (If there is better models, let me know of them)

zealous sparrow
#

craziest model name ive seen in a while

#

robin-high was readded to codearena btw

#

seahawk and skyhawk are gone and google put replacements in place

#

meet fiercefalcon and ghostfalcon

flint fog
#

Sometimes when I contact the model and ask it for something, it says "generating" but never gives me an answer. Please fix this problem.

sterile tartan
#

Gemini would win if need vision and longer context window

zealous sparrow
sterile tartan
zealous sparrow
sterile tartan
#

They more likely gpt 5.2 models or 3 flash

zealous sparrow
#

they are both gemini

sterile tartan
#

Should release soon after testing

zealous sparrow
#

robin-high is back

#

[OAI]

sterile tartan
#

They could be gamma models

zealous sparrow
#

textarena too probs

sterile tartan
zealous sparrow
sterile tartan
#

Special coding models

sterile tartan
#

Gemini Coder

#

Like Qwen Coder

zealous sparrow
sterile tartan
#

3 flash
Gamma series
Flash 3

#

Should release soon

#

GPT 5.2

#

And grok 4.20

zealous sparrow
#

yeah its a textmodel

#

not image

sterile tartan
#

Possibility sonnet 5

#

If claude is also playing hard

golden ocean
#

cwaude

zealous sparrow
#

december-model cant be a claude name

#

I mean, who would name a battle model that..

sterile tartan
#

Names Don't matter

#

They are just for testing

sterile tartan
zealous sparrow
#

robin-high is back only on textarena

#

its an OpenAI model as we said before

#

It might be 5.2 or garlic

#

rather 5.2

#

garlic-high can't be a model

sterile tartan
#

Yeah

ocean ferry
#

it seems like it has very big knowledge

rocky mauve
#

I finally hit the quota limit for opus 4.5, after weeks of nonstop using it, I never though Iโ€™d reach it

#

I thought I was unstoppable, oh well, back to Gemini

zealous sparrow
#

december-chatbot is OpenAI @sterile tartan

sterile tartan
#

Interesting

warm zodiac
#

is it good?

sterile tartan
#

One of these might be GPT 5.2 Codex

ocean vortex
#

oh wow lol

zealous sparrow
#

lets not do this joke again

#

the frontend sucks if robin-high turns out to be gpt 5.2

weary galleon
warm zodiac
neon idol
#

i think that gpt 5 will exit today at 10am (san francisco hour)

spare rune
ocean vortex
spare rune
#

I feel like itโ€™s ChatGPT trying to promote itself

#

๐Ÿ˜ญ

neon idol
#

bro wants to be funny

fleet lintel
spare rune
weary galleon
neon idol
#

its ai generated

spare rune
fleet lintel
#

please stop with this fake stuff. Go at other joke channels for fake stuff ๐Ÿ™

weary galleon
spare rune
neon idol
weary galleon
spare rune
#

guys did you see how llama 6.7 make windows on in 6.1 seconds

#

itโ€™s really cool

weary galleon
spare rune
plucky sparrow
#

I think it might even beat GPT-5.2

spare rune
#

It alr did

weary galleon
ocean vortex
#

Large3 is better than Maverick

plucky sparrow
#

I guess all that poaching from OpenAI paid off

sterile tartan
spare rune
sterile tartan
#

I wonder if they finally have something good

#

He is throwing money but not getting the results

#

Bro is willing to give millions in salaries but many still refuse

golden ocean
#

openai is NOT dropping a frontier model

#

they cooked fr

#

out of the race its over

surreal creek
clever spoke
#

how do you create the video in the first place hi im new so im kinda confused

warm zodiac
#

but if they didn't cook with it they are official over

#

Basically no progress since o3-pro

#

METR's capabilities index

sharp mirage
#

Guys

#

I don't think GPT5.2 is going to be better than Gemini 3 or cloud opus 4.5 but. I think they cooked something

warm zodiac
#

is that true? if it is why is it relevant?

#

if we want to talk about revenue health then Anthropic is winning

fleet lintel
torn mantle
#

gpt5.2 aka robin

#

is not a practical model

#

we talked about this before

#

while the model is good but its not efficient, opus 4.5 and gemini 3 pro has the same performance with less thinking time

sharp mirage
#

They have to benchmaxxxed it

torn mantle
#

yea they have to tbh

sharp mirage
#

Cuz they are losing the war with Gemini 3 and Claude and They're not stupid enough to release a new version that's worse than the previous one when there's competition.

fleet lintel
#

we will know in few more hours. i am looking forward to see what they did with 5.2

sharp mirage
cloud zinc
#

its a .1 update

#

gpt 5 to 5.1 will be same as gpt 5.1 to gpt 5.2

sterile tartan
#

Gemini 3 Full Coming
Gemini 3 Flash Coming
GPT 5.2 Coming
Grok 4.20 Coming

meager harbor
sharp mirage
#

It's looking better

sterile tartan
#

Just where the f is 5.2

sharp mirage
sterile tartan
#

And why the hack am i even waiting for it

#

It feels longer when waiting

zealous sparrow
sharp mirage
#

Cuz if you read like change log it says better reasoning and fix bugs ๐Ÿ›

cloud zinc
sharp mirage
#

Isn't fake but idd

cloud zinc
#

its fake

#

why u claiming its not fake

fleet lintel
cloud zinc
#

its fake as in not official

#

speculation is fake

fleet lintel
cloud zinc
#

it doesnt explicitely say its speculation

sterile tartan
#

Is not speculation
Is anticipation

#

Please choose words wisely as it can be misunderstood

#

Say unofficial not fake

fleet lintel
#

small but meaningful diference. I can buy this view point

sterile tartan
#

Great minds think alike

torn mantle
#

i feel like openai hit a plateau tbh

#

you can see that from this upcoming model

sharp mirage
#

Yeah fr

torn mantle
#

i heard they are starting from scratch now

#

pre-training + post-training

#

they usually just do post-training

#

could be wrong*

sterile tartan
queen veldt
#

Nah gpt image 2 i SOTA

#

๐Ÿ˜ญ

spare rune
#

Itโ€™s near nbp but not quite

#

I donโ€™t think nano banana pro will get beaten in a while

warm zodiac
#

yeah closing the huge gap between NBP and the rest but not SOTA

whole sundial
# queen veldt

lol they're wrong about the name, it has been confirmed to be gpt image 1.5

cloud zinc
whole sundial
#

that being said it is mostly better than gpt image 1

whole sundial
# cloud zinc

lol, just the tip of the very large copyright iceberg

#

Disney will sue Google because their ai can output their copyrighted characters but yet the record labels won't sue them or OpenAI for reproducing their copyrighted album covers

cloud zinc
lunar glade
zealous sparrow
whole sundial
#

also had it reproduce some movie posters, i don't think any were disney though

zealous sparrow
whole sundial
#

also every generated image has an ad in the corner and every video has an ad at both the beginning and the end

sharp mirage
#

Why the hell Disney gaving them more funding

cloud zinc
#

openai will go ipo

zealous sparrow
cloud zinc
whole sundial
#

pay $20 a month to get rid of the ads in content, $200 to get rid of all but a banner ad, $500 to get rid of all ads, not much better models though

#

that's the only way that i can see openai making money

#

force personalized ads down everybody's throat, the money will come rolling in

sterile tartan
zealous sparrow
cloud zinc
zealous sparrow
whole sundial
cloud zinc
zealous sparrow
whole sundial
sterile tartan
#

๐Ÿ’€

whole sundial
sterile tartan
#

They need to make money somehow

whole sundial
#

part of the problem, openai need to put ads in chatgpt for any chance to make money, also google tpus are better than nvidia gpus but they are working on a custom chip to solve that problem

sterile tartan
#

Maybe watch ads to earn credits for generation could work too

#

Noted

whole sundial
sterile tartan
whole sundial
sterile tartan
whole sundial
#

i bet ads will be coming to sora when it fully launches

sterile tartan
#

Exactly

limber crag
#

it doesnt matter how good openai makes its models, if it keeps censoring and keeps policing us with its insane guardrails its pretty useless

weary galleon
sterile tartan
#

They are burning heavy amount of compute on sora videos

#

๐Ÿ’€

limber crag
whole sundial
weary galleon
sterile tartan
#

NSFW?

limber crag
whole sundial
whole sundial
limber crag
#

i dont know they said it will come in december and it almost halfway done

sterile tartan
sterile tartan
#

๐Ÿ’€

limber crag
#

๐Ÿ’€

#

i dont think anyone does

whole sundial
#

can't wait to scan my face and give it to closedai so i can access nsfw chatgpt, i would rather download an nsfw model and do that locally, no face or id scanning needed there!

zealous sparrow
#

robin-high cannot do stegonagraphy

compact sleet
sterile tartan
zealous sparrow
weary galleon
#

Gamblers started to hesitate.

civic flame
#

it's coming today lol

zealous sparrow
#

gpt 5.2 is robin huh

#

so its going to be ass

whole sundial
#

make it even more confusing, Moonshot Qwen M2 Turbo 560B-A1.8B!

zealous sparrow
#

gg

limber crag
#

whats tangerine in the image arena

whole sundial
limber crag
#

i dont know its aesthetics looked more like a chinese model, its been more than a week since i encountered it

sterile tartan
#

๐Ÿค”

limber crag
#

heh?

weary galleon
#

Look my poll๐Ÿ‘†

sterile tartan
#

๐Ÿ’€

weary galleon
sterile tartan
#

๐Ÿ’€

zealous sparrow
#

we dont know

#

traders are confident on today

weary galleon
limber pawn
#

Stop with the fakes

weary galleon
limber crag
#

why are you guys hyping a 0.1 update?

weary galleon
limber pawn
weary galleon
zealous sparrow
#

robin-highs frontend is the same as gpt 5.1

#

so like

#

here's what we found out

#

skyhawk and seahawk are gone

#

and we have ghostfalcon and fiercefalcon now

#

ghostfalcon easily solved the steganography [google flash or mayb a g3 pro checkpoint, while robin-high failed] [this steganography was only ever solved by gemini 3 pro]

#

robin-high is OAI btw

worthy bluff
#

hi

#

is the image to video just on discord or on the site too

hardy lion
#

buddy, didn't I already warn you about posting this fake image?

weary galleon
hardy lion
#

ah, well please don't, some people might get the wrong idea since this is our official discord

fleet lintel
weary galleon
#

Remember?

leaden laurel
#

oops

#

accidental reaction

zealous sparrow
winged locust
hardy lion
#

I see what I said wasn't very clear. What I was thinking was more like you're ok since your not like one of the twitter bots who is intentionally spreading fake news to deceive and that a joke isn't as bad. But now you've continued to post 7 more times including making mock ups of our official leaderboard release copy.

So it's no longer as funny

zealous sparrow
#

@fleet lintel
apparently someone got this from robin-high

echo aurora
sharp mirage
#

hey

#

pineapple did you add memory ?

#

cloud remmber my chat

cloud zinc
#

yes memory

echo aurora
weary galleon
sharp mirage
#

i mean memory that like the chat will be saved and if you asked the ai about

#

it

torn mantle
#

like you have to wait an hour for an output

echo aurora
weary galleon
sharp mirage
#

why everyone go silenet when i talk :

#

i fell bad :

limber crag
echo aurora
#

Wanted to address the sharing of fake leaderboards here.** We're going to ask to not do this**. I'll be instructing the mods to remove this kind of content going forward. Even if it's done in a joking way, others could easily be misled by this. It's perfectly fine to speculate about where you think models will land on the leaderboard updates, but creating fake images misleading others isn't something we'd like to see happening here.

sharp mirage
#

bro was typing ๐Ÿ™

echo aurora
limber crag
#

whats the criteria btw

#

can i apply

sharp mirage
#

yes

echo aurora
lusty tinsel
#

@echo aurora any info about this retry yet?

limber crag
weary galleon
sharp mirage
#

the transleter problem :/

echo aurora
echo aurora
#

At this point, I don't think I'll be updating our server rules to make it super official, but if we see more and more of this we will.

echo aurora
sharp mirage
#

btw we want a stop button bro

#

๐Ÿฅ€

lusty tinsel
# echo aurora Hmm the error or the retry button?

the error, if i click retry it give same thing until it tells me to wait cool down like if was normaly spending tokes or something (which i explained in the bug section) then after cooldown it still keep on the same error try again...

sharp mirage
#

referash the page

lusty tinsel
sharp mirage
#

change the ai

lusty tinsel
#

refresh work better if the bot keeps either thinking or loading actions like generating images or web dev files. for text generations rarely happens

sharp mirage
#

and try use vpn or come back after 1h

rustic wind
#

hi!I'm new to here, how can I use "Image Edit"?

noble shard
#

Hi, how to upload model to lmarena?

sharp mirage
#

Hi

#

api ?

#

Ask Pineapple

rustic wind
#

oh,i know now...

lusty tinsel
# sharp mirage and try use vpn or come back after 1h

changing model doesnt work unless i start new chat which i dont want to bc it will have different progressions and lose track and have to restart all i was doing. the vpn doesnt either. and i been 2 or 3 days with this already.

sharp mirage
#

ammm

#

ur cocokjed

#

cocked

lusty tinsel
#

if i go to any other llm that is not claude i get this error too

sharp mirage
#

yeah look what to click clear and than refrash the page and than change the model

#

like this

lusty tinsel
#

already past that

echo aurora
echo aurora
echo aurora
fleet lintel
sleek phoenix
#

i love lmarena

#

this also happened with search models

#

i'll try battle and direct chat rq

#

battle does the same

hollow perch
#

hi

echo aurora
echo aurora
sleek phoenix
#

oh yeah i do

main moth
#

Hi all, how's it going?

sleek phoenix
#

tho on zen it doesn't

#

vivaldi also has this same error

#

wait it could be the dns i'm using to bypass russia's fisheries

#

they actually didn't block lmarena

cloud zinc
#

where is 5.2

sharp mirage
#

idk

#

no one know

torn mantle
#

probably today

#

in an hour or so

sleek phoenix
torn mantle
#

oh wow

#

vivaldi

#

havent heard of it since years

#

still a thing huh?

#

i guess if u like flashy UI

sleek phoenix
#

i dont use it now

#

switched to zen

torn mantle
#

yea zen is better

neon idol
#

is gpt 5.2 out?

torn mantle
#

what i remember is that vivaldi was so slow

sleek phoenix
#

i used firefox before vivaldi

torn mantle
#

same

#

i was a firefox user

#

then switched to brave

#

still using brave

#

i did try zen but i dont like how it looks

sleek phoenix
#

while i'm still a firefox user

torn mantle
#

not my thing tbh

sharp mirage
#

guys

#

chatgpt is down now

#

isnt wokring

#

i am trying use it and isnt wokring

torn mantle
sleek phoenix
#

only firefox

torn mantle
#

oh

#

i thought it was based on chromium

sleek phoenix
#

arc is based on chromium

torn mantle
#

ah yes arc

sleek phoenix
#

zen is like arc on firefox

#

they literally look identical

torn mantle
#

ok maybe im wrong, i think the browser that i was talking about is arc

torn mantle
#

they are so similar

sleek phoenix
#

lool

torn mantle
#

lol

sleek phoenix
#

if it had this thing at the top by default then it's arc

torn mantle
#

nothing worked but it was beatiful.

sleek phoenix
#

tf is gpt 5.2

torn mantle
#

another slop from openai

#

their new model

#

its on lmarena battle mode under the name 'robin'

sleek phoenix
#

pretty much all chatgpt models slowly kill any code

torn mantle
#

oh apparently that post was a troll

#

but its available on cursor if im not wrong

torn mantle
#

robin

#

coping

#

12 minutes left for the release

#

yes

#

bet on what

#

take your own risk

#

lol

#

but the probability is high like 90%

#

they shared yesterday a tweet that has 'tomorrow' caption on it

#

i have no idea

#

maybe just a release

#

to get it out of the way

#

its not like its a crazy model

#

just more thinking time

#

you are

#

9 mins

queen veldt
#

Gpt sucs

torn mantle
#

garlic

#

i kinda pity oai tbh

#

they had like approx 9 months lead progress

#

but ngl their models are still the best at reasoning

queen veldt
#

5.2 will maybe be a bit better than 5.1 they can't improve it that far

#

Training takes a while

torn mantle
#

ok

#

delulu

#

:3

sharp mirage
fleet lintel
#

is there no livestream setup yet?

sharp mirage
#

and bit better at coding

queen veldt
#

But i mean it can't beat claude or gemini for sure

sharp mirage
#

for sure

#

have you tried gemini ??

astral blaze
#

Is there a cancel button when its stuck like this

sharp mirage
#

?

#

fr ?

cloud zinc
#

source?

sharp mirage
#

no isnt

cloud zinc
sharp mirage
#

ut droped ?

#

it

cloud zinc
cloud zinc
#

where benchmark

astral blaze
#

who cares just use gemini 3

torn mantle
sharp mirage
#

no change loag

#

;pg

#

log

torn mantle
#

@deep adder told ya

torn mantle
#

hows the price compared to gemini 3?

#

lmao

cloud zinc
#

expensive

sharp mirage
#

bro i cant log in

#

bro

#

what the hell

#

๐Ÿ“›

torn mantle
#

uhm thats...

cloud zinc
#

so expensive

#

1.5 times increase

torn mantle
#

is it better than gemini 3 tho?

#

no

cloud zinc
#

its not better than gemini 3

#

i tried it, its bad

sharp mirage
torn mantle
#

gpt 5.2 = gpt 5.1 pro

fleet lintel
astral blaze
#

Is there really any surprise

zealous sparrow
sharp mirage
astral blaze
#

they're losing money anyways, they should just bring back gpt 4.5. lol

torn mantle
#

lmao

sharp mirage
#

@fiery gull Gpt 5.2 is droped

zealous sparrow
#

we have some kind of new model [textarena]

sharp mirage
#

what the hell

zealous sparrow
#

actual scam

torn mantle
#

agree pffffft

fleet lintel
sharp mirage
#

bro there si gpt-audio-2025-08-28
150,000 TPM
3 RPM
what the hell is this

spare rune
#

Idk why gpt5.2 is being hyped so much, was it good?

sharp mirage
#

no one tryed it

zealous sparrow
#

seriously

sharp mirage
#

we all broke

zealous sparrow
#

is SWEbench deada- with us

#

82% FOR THAT FRONTEND?

spare rune
#

I feel like itโ€™s just the same as the upgrade to 5.0 to 5.1

hazy spruce
#

guys is it possible to generate 9:16 format on the video arena

spare rune
#

I noticed nothing for the change expect more slop front end

torn mantle
#

whats that

cloud zinc
torn mantle
#

gpt 5.2 pro xhigh premium max?

cloud zinc
torn mantle
#

like 100$ for 82% swe?>

spare rune
sharp mirage
#

open ai is yaping

torn mantle
spare rune
fleet lintel
zealous sparrow
#

SWE bench verified isnt even benching properly anymore

sharp mirage
#

Open ai and Chatgpt is the best yappers in the world after Deepseek

astral blaze
zealous sparrow
#

gpt 5.2 gets 82% even tho its frontend is sh

sharp mirage
cloud zinc
#

the benchmark is going to be on this page

spare rune
#

I hope itโ€™s actually good and not slop like gpt 5.1

cloud zinc
#

wait for it to be published

sharp mirage
hazy spruce
#

guys is it possible to generate 9:16 format on the video arena

frosty torrent
#

prompt

sharp mirage
cloud zinc
sharp mirage
torn mantle
hazy spruce
torn mantle
#

thinking like xhigh or what

sharp mirage
#

: D

cloud zinc
#

gemini 3 is 37.6

spare rune
torn mantle
cloud zinc
#

its out

#

52.9 on arc agi 2

#

swe 80%

zealous sparrow
#

when grok 4.1 released it topped SWE

torn mantle
#

CAN SOMEONE ADD GEMINI 3 PRO BENCHMARK PLEASE

#

pls someone add gemini 3 pro

astral blaze
#

More of this benchmaxxing crap
I'll tell you if it's good when I can actually use it

spare rune
cloud zinc
sharp mirage
#

gpt 5,2 any good ?

spare rune
#

I felt that grok4.1 had the texting speech of the average twitter user

torn mantle
#

is this a new eval?

weary galleon
torn mantle
#

on paper it seems like a solid model

#

need to try it

#

hehe

zealous sparrow
#

the output being 14$ is diabolical from OpenAI

cloud zinc
zealous sparrow
#

especially for people who do html coding

#

14$ output for Horrible UI

spare rune
#

Someone wait for pineapple to type in announcements.. /j

cloud zinc
sharp mirage
#

ngl its look open ai is trying to come back

cloud zinc
#

frontier math tier 4, it loses

warped kraken
torn mantle
#

thats what im saying

#

this looks like an actual solid model

#

they just have to fix frontend sloppiness at coding

#

and we are so back

#

hehe

sharp mirage
#

like sonsut 4.5

spare rune
sharp mirage
#

i alr said its gone be fix bugs

zealous sparrow
#

I don't want to believe benches, well even ARC-AGI until i see the model in action

mystic panther
#

I need help with smth.. I generated one photo and it doesn't let me anymore

torn mantle
#

we are so back craig

sharp mirage
#

idk who gone read it but

#

"Code Red" Performance Focus

Context: GPT-5.2 was a "Code Red" release, meaning it was fast-tracked specifically to address competitive pressure from Google's Gemini 3, which had outperformed GPT-5.1 in reasoning and coding benchmarks.

Philosophy: Unlike GPT-5.1, which introduced user-facing features like "personalities" and tone controls, GPT-5.2 is a "performance-first" update. It focuses on reliability, speed, and raw reasoning power rather than new experimental features.

  1. Reasoning & Reliability

Scientific & Math Reasoning: GPT-5.2 Pro and Thinking models show significant gains in high-level benchmarks like FrontierMath and GPQA Diamond (graduate-level science), surpassing the capabilities of GPT-5.1 Thinking.

Logic & Multi-step Tasks: The model is much better at handling long chains of logic without "losing the thread," a common issue users reported with GPT-5.1 in complex workflows.

Reduced Hallucinations: There is a strong emphasis on "groundedness," with GPT-5.2 showing an estimated 80% reduction in hallucinations compared to earlier iterations, making it far more reliable for enterprise and research use.

  1. Speed & Latency

Optimized Pipeline: GPT-5.2 introduces major backend optimizations that make it significantly faster (lower latency) than GPT-5.1, particularly for the "Instant" model on routine queries.

Smoother Turn-taking: The chat experience is described as having "tighter logic" and less lag, addressing the "sluggishness" some users felt with GPT-5.1's reasoning models.

  1. Coding & Technical Work

SWE-bench Scores: GPT-5.2 achieves higher scores on coding benchmarks (e.g., ~74.9% on SWE-bench Verified), with specific improvements in debugging, multi-file handling, and reduced syntax errors compared to GPT-5.1.

Agentic Capabilities: The model is better at "agentic" tasksโ€”executing multi-step projects like building entire spreadsheets or presentations autonomously, where GPT-5.1 might have required more manual hand-holding.

#
  1. Architecture Refinements

Unified Router: While GPT-5.1 introduced the concept of "Instant" vs. "Thinking" models, GPT-5.2 refines the automatic router to be much smarter at detecting "explicit intent." If you ask it to "think hard," it routes to the Thinking model more reliably than 5.1 did.

Context Management: Although the context window size (approx. 272k-400k tokens) remains similar, GPT-5.2 is far better at utilizing that context effectively, reducing "context drift" (forgetting earlier parts of the conversation) which was a critique of 5.1.

torn mantle
#

we need it added on lmarena RN

#

LIKE RN

#

LET ME TEST IT PLS

sharp mirage
#

FR

spare rune
#

Oh it was added

#

Time to test..

inner gate
#

Gpt 5.2

sharp mirage
#

o lets gooo

weary galleon
#

FAKE!

zealous sparrow
#

On webdev huh

sharp mirage
#

:D:DD::D:D::D:DD::DD: Yea

zealous sparrow
#

NOT FAKE

zealous sparrow
#

It's now added guys

inner gate
#

Did they skip 5.1 or was i under a rock

weary galleon
sharp mirage
#

no

#

fr

#

its added

zealous sparrow
#

going to test it out on codearena rn

sharp mirage
#

i saw it now

#

bye

#

me go test

zealous sparrow
#

this model takes so long

sharp mirage
#

agaisnt gemini 3

zealous sparrow
#

they wont win fastest model tho

#

googles new flash models get that point from me

#

they write 400 lines in less than a min

#

yeah but the new flashes have quality and speed

#

still waiting on those

#

i hope google ships

spare rune
#

Gpt5.2 high is really fast

weary galleon
#

Stop FLOOD!!!!!

spare rune
#

I was preparing to wait like 5 minutes for the reply

#

Maybe thatโ€™s a good thing or a bad thing

astral blaze
#

STOP THE COUNT

fleet lintel
#

benchmarks are good! this will force google to up their game!!

zealous sparrow
spare rune
zealous sparrow
#

Well, OpenAI gave me a bad first impression. The first thing i generated on codearena with gpt 5.2 high instantly broke

sharp mirage
#

guys someone

heavy smelt
#

What's the point of the video models in the video arena being randomized? If it is so then what's the point of needing two votes to actually see the models?

sharp mirage
#

send code arena url

#

i forgot it pls

spare rune
#

Itโ€™s in lmarena

#

Just click the code icon

#

The โ€œ <> โ€œ

sharp mirage
#

thx bro

astral blaze
#

Wow

weary galleon
#

WE NEED XHIGH ON ARENA!!!!!!!!!!!

astral blaze
#

That's the best openAI can muster huh

spare rune
#

Oh wait I was waiting for it to reply until I saw it freezes mid conversation

astral blaze
#

I'm going back to gemini 3

sharp mirage
#

alr i made prmpt for Html game "who want use it

#

"

zealous sparrow
#

Im doing some testing for gpt 5.2 high

sharp mirage
#

for who want to use it

zealous sparrow
#

so far it made one fully broken game

spare rune
zealous sparrow
#

GOOD JOB SAMA!

weary galleon
sharp mirage
spare rune
fleet lintel
sharp mirage
#

no one will get hacked its prompt for spiderman game

#

so everyone can test

spare rune
weary galleon
sharp mirage
#

?

#

??

cloud zinc
#

xhigh is different than high

astral blaze
#

JUST LOOK AT HOW GOOD IT DID ON SWE

spare rune
#

I choose to believe Claude opus was taking its time in code arena until I noticed it just kept going forever saying creating index html

#

Sob

#

I wonder if itโ€™s happening for gpt too

fleet lintel
astral blaze
spare rune
#

Itโ€™s a joke

sharp mirage
#

gpt is taking so long time

zealous sparrow
#

Im running side by sides with GPT 5.2 High and gemini 3 pro

fleet lintel
spare rune
obsidian cargo
#

well that was fast

astral blaze
#

Gemini 3 is clearly miles ahead I are we using the same 5.2

spare rune
#

Being sarcastic

#

I think itโ€™s stuck too

grave plaza
#

they just released it haha

spare rune
#

Never mind

#

It worked

#

Oh

astral blaze
spare rune
#

The output is good

echo aurora
spare rune
#

Ish

grave plaza
#

guys is kat coder pro in lmarena? when yes i use it

glacial mulch
#

is 5.2 any good

devout vault
#

chatgpt be releasing the worst models that are never #1 on the leaderboard

spare rune
zealous sparrow
#

the game is insanely broken

#

GOOD JOB SAMA!

#

I APPLOUD!

sharp mirage
#

BRo

#

wtfff

#

:/

torn mantle
#

and nothing for cursor

devout vault
echo aurora
zealous sparrow
devout vault
fickle venture
#

What the heck is this GPT-5.2

spare rune
#

Well the response is buggy

zealous sparrow
#

touching a bullet

rugged abyss
zealous sparrow
#

breaks the game

torn mantle
#

its working fine

stray aspen
#

How good is the new gpt

torn mantle
#

ah right

#

true

devout vault
zealous sparrow
spare rune
#

Woah

sharp mirage
#

bro gpt 5.2 is coocked

sharp mirage
#

cooking

zealous sparrow
sharp mirage
#

rn

spare rune
#

The app is buggy. But the gui is good

zealous sparrow
stray aspen
#

๐Ÿฅ€

zealous sparrow
odd geyser
spare rune
#

Gpt is actually good at backend

sharp mirage
#

1131 line of code i wish if its work

gusty helm
#

how's gpt 5.2?

heavy smelt
#

I have another question, how is it legal for lmarena to offer paid models completely for free?

torn mantle
#

its meh at coding ngl

gusty helm
#

good/bad/overhyped?

stray aspen
#

I'll stick to Opus 4.5 then

zealous sparrow
#

and OpenAI wants to argue they beat gemini 3 pro

golden ocean
#

no way gpt 5.2 was real and its not a frontier model

#

gg its over for openai

cloud zinc
zealous sparrow
rugged abyss
primal nacelle
stray aspen
zealous sparrow
astral blaze
#

They are not beating gemini with this lol
The world knowledge on this model is clearly short of gemini 3

#

So it's another codemaxxed model. Congrats sama

rugged abyss
astral blaze
#

Pichai remains undefeated

sour spindle
#

Google stock down 3%

cloud zinc
gusty helm
#

yeah, I had the same feeling; it's solid but overhyped cause fanboys + crazy marketing

sharp mirage
#

bro gpt is bad

echo aurora
spare rune
#

Gpt 5.2 high codex pro max when

devout vault
#

gemini 3.0: free
gpt-5.2: paid 1000000 dollars a month

zealous sparrow
sharp mirage
#

gemini isnt making a game for me

zealous sparrow
sharp mirage
#

i dont think so

zealous sparrow
#

i say we sue