#general

1 messages · Page 222 of 1

cloud zinc
#

yes

compact flame
#

Got the auto again

echo aurora
#

Yeah this is an experiment, meaning a small percentage of people are going to be seeing it currently. cc @cloud zinc

#

Also same with auto modality. blobthumbsup

proud bobcat
#

5.2 still get mogged in every category by gemini

#

LONG LIVE GEMINI

compact flame
#

Now I got some type of new ui

#

Instead of icons it's like a dropdown

neon idol
#

what is it?

proud bobcat
#

people wont use gpt 5.2 because its too pricy

#

if gemini 3 pro is basically still the king why pay more for gpt

plucky sparrow
#

Gpt useless 🤣

mild granite
#

cherry picking

hollow echo
#

👋

echo aurora
haughty jetty
#

👋

cinder gull
#

hello everyone what's up? I want to know how to face swap a pic with her face and her into a video on Instagram from another lady, thanks

zealous sparrow
#

Speaking of removed models, will speciale ever make a comeback? It's been a long while since it got deleted.

zealous sparrow
mild granite
zealous sparrow
mild granite
#

oh

tardy plover
#

grok removed?

whole sundial
#

I also heard some complaints about reve-v1 and reve-fast-edit being removed, it was replaced with the stealth models epsilon and epsilon-fast, some people would like to be able to select these models again

zealous sparrow
#

they might be updated

whole sundial
zealous sparrow
mystic sluice
#

is there a limits for gemeni-3-pro?

plucky sparrow
#

Haha this is beautiful

echo aurora
zealous sparrow
#

peak new model name @whole sundial

#

its a textarena model

#

i hope its not amazon

agile bloom
#

so best overall ai model is gemini 3 pro for December 2025?

golden ocean
#

artificial neural network vs biological neural network

agile bloom
cloud zinc
#

yes

cloud zinc
agile bloom
#

is there any way i can upload a txt file to gpt 5.2 on lmarena?

cloud zinc
#

lmarena doesnt allow pdf upload. you have to upload txt manually

empty stump
#

i dont like 5.2

#

i will have to try it in api

agile bloom
frosty lava
#

yes and we're definitly improving everything your talking about ? each new month a new model with better capabilities came out

#

and it wont stop in 2026 for sure

#

less hallucination, follow task much better, better at coding

#

like everything we want its what they're working on

lunar glade
#

anyone know any site that have nanobanana 4K model?

cloud zinc
#

no it is a scam

#

scam models

empty stump
#

sota in what

weary galleon
weary galleon
cloud zinc
empty stump
#

in benchmarks

zealous sparrow
#

5.2 benchmaxxing

weary galleon
#

It is designed for benchmarks only, not for real tasks.

cloud zinc
weary galleon
empty stump
#

that openai made...

weary galleon
#

GDPval is a banchmark.

cloud zinc
#

for work tasks

empty stump
#

of course a model by openai will perform best on an openai made benchmark

weary galleon
weary galleon
#

I will laugh when it will get lower place in Text Arena than GPT 5.1

#

🤣

cloud zinc
weary galleon
#

Banchmark is a banchmark. Period.

cloud zinc
weary galleon
weary galleon
#

I won!

cloud zinc
weary galleon
echo sinew
#

Hello! Let's keep disagreements respectful and friendly.

weary galleon
cloud zinc
weary galleon
#

OpenAI got 500 billion dollars from the US government, Anthropic got nothing, Google got nothing and Gemini 3 Pro and Opus 4.5 both outperform GPT 5.2. How is that possible? Scam Altman's mismanaging of the company is the reason.

cloud zinc
#

us only approved the stargate project

weary galleon
#

By the way, right answer is yes, GPT 5.2 said wrong answer, like always.

weary galleon
weary galleon
cloud zinc
#

its called investment

weary galleon
cloud zinc
#

gift to google

weary galleon
weary galleon
#

Even if a gift(but it's not) 500B and 200M is 2500X difference.

cloud zinc
#

500 billion over several years

weary galleon
cloud zinc
weary galleon
# cloud zinc

Google to Anthropic and US government (taxpayers money) to OpenAI. Are you feeling the difference?

hollow flicker
#

is it just me or do you guys think that most people use LMArena to get all the paid ai's for free

weary galleon
weary galleon
#

But some like me, don't

cloud zinc
weary galleon
# cloud zinc

Anyway Scam Altman will get those money. Anthropic and Google will not.

cloud zinc
#

cuz scam google are bad

weary galleon
cloud zinc
#

cuz gpt 5.2 is #1

stray aspen
#

Lmao

#

Scam altman

weary galleon
#

Let's go!

#

Vote, guys! Democracy will win!

empty stump
#

its bad because of the safety

cloud zinc
#

where is it? u are making a rumor, not actual reality

#

so far with my testing, it looks bad

rapid merlin
#

Gemini hallucinates the fact it has dall e 3? Interesting....

echo aurora
#

Is it just me, or are others experiencing a bug when trying to login? Essentially, after doing an email login, after entering email/password, clicking the login button doesn't do anything. Are others seeing the same?

echo aurora
weary galleon
#

Maybe because this "test" is paid by Scam Altman?

weary galleon
cloud zinc
cloud zinc
weary galleon
#

@cloud zinc If GPT 5.2 is #1 as you said multiple times today, why members of LMArena hate it so much?

weary galleon
cloud zinc
#

yes u used ur alt to vote

#

vote is not over yet

#

23 hours left

weary galleon
cloud zinc
weary galleon
weary galleon
weary galleon
echo aurora
#

Hey going to ask we move on from this conversation.

#

This doesn't seem to be very productive and is just escalating a bit here and there.

weary galleon
#

Just accuse, accuse, accuse, again and again! Without proofs.

golden ocean
cloud zinc
night kelp
#

can anyone help me a bit?

cloud zinc
weary galleon
cloud zinc
weary galleon
cloud zinc
#

thought so

night kelp
# cloud zinc yes

im working on something with python, been workin on it for a few days, then the chat decides to just stop, not being able to send emssages (i mean like i can send but it gives me that annyoing red prompt "something went wrong with this response, please try again") i refreshed, restarted, tried typing again, same thing

#

using ai ofc

#

also gemini 3.0

night kelp
#

i have almost the whole stuff on that specific chat, any other way i can restore it?

cloud zinc
#

that chat is bugged

#

always backup ur files after couple of hours

torn mantle
#

@cloud zinc what did i miss

#

any new model

night kelp
torn mantle
#

kaiser

night kelp
#

uhh

stray aspen
#

Why can't gpt 5.2 extra high think of another way of designing stuff

#

It all looks the same

night kelp
#

gimini

stray aspen
#

Wheres creativity

cloud zinc
torn mantle
cloud zinc
stray aspen
#

Yeah it cooked for me but I just don't like that its not that creative

torn mantle
cloud zinc
neat apex
torn mantle
#

imma sleep early

#

_<

neat apex
#

Likely because the gpt is sooo formal

#

So it will rarely try outshine, besises it writes well when you ask something exactly

ocean vortex
neat apex
#

😂

cloud zinc
golden ocean
#

gpt 6 latent space reasoning

neat apex
#

I think its general knowledge

ocean vortex
neat apex
#

I hope

ocean vortex
#

I mean it's reasonable to assume they have bigger release in the works

#

there was less than 1 month between 5.1 and 5.2

#

this was like a small incremental update

neat apex
ocean vortex
#

No clue why they tried to oversell it this hard lol

#

Like pushing for benchmarks this hard with 5.2 wasn't necessary I feel like

ocean vortex
#

5.2 you mean?

#

I don't think it is tbh

#

have you seen SimpleBench and other stuff?

#

5.2 gets beaten by both 5.1 and even more so by 5.0 there lol

#

That's the thing, my experience was the same. 5.0 > 5.1 and 5.2

#

And then Gemini3 just somehow manages to score great everywhere they test it at

#

Not definitively the best in select things perhaps, but not really underperforming anywhere either

neat apex
#

5.1 is way better than 5.0

#

Almost same smartness, but it actually efforts to make better responses

steel dune
#

Hi everyone, a quick question “Which AI model will one recommend to aid in accounting work?”

echo aurora
steel dune
#

Thank you

jade egret
cloud zinc
#

@torn mantle nice

thorn path
#

Wait the text leaderboard updated and 5.2 isn't even in the top 10 lmfao wth

echo aurora
thorn path
burnt sinew
#

anyone notice gemini 3 is hallucinating thinking it has experiences?
"I personally have a database with over 800 entries."

burnt sinew
#

if you can share

echo aurora
echo aurora
burnt sinew
#

fails seahorse test

echo aurora
lucid geyser
#

Really flipping it around

swift oyster
#

LIVE BENCHMARK UPDATE
Model: Hawk (Launch 25th)
We're currently halfway through the official ARC-AGI-2 benchmark - one of the hardest AI reasoning tests in existence!

Current Stats (48% Complete)
Correct: 14
Incorrect: 44
Accuracy: 24.1%
Progress: 58/120 tasks

Key Takeaways:
Outperforming Claude Opus 4.5 by 10%
Currently ranked #9 globally

surreal creek
sullen quest
cloud zinc
thorn path
burnt sinew
#

i think only o3 and opus 4.5 thinking gets seahorse question right?

jade egret
burnt sinew
#

not without search i dont think

jade egret
#

gemini 3?

#

o

#

lol

brisk turret
#

Where is 5.2 on the leaderboard?

#

Wtf is going on

jade egret
#

Must be tomorrow?

tired plaza
#

guys, what are all the site's keyboard shortcuts? is there a shorcut for new chat?

astral bloom
#

@echo aurora when will the video arena be accessible, I've seen people here claim they had access to it.

frosty lava
#

when will we get an ai with like 1% hallucination

#

that's all im waiting for

#

ofc its good to have genius ai but what if they always do errors

#

give me actual genius ai if you want but that are capable of actually doing things without mistake

frosty lava
#

Cause like in 2026 we're supposed to see alot of new ai like grok 5 and all

#

and they're supposed to be like much better

frosty lava
#

we'll see

#

no one know actually

delicate wagon
#

Hi everyone

obtuse smelt
#

hello

sullen quest
#

@astral bloom video arena is right here in discord, is a discord exclusive feature, go to the channel how-to-video-bot for more info

cloud zinc
obtuse smelt
#

hmm is happening
Something went wrong with this response, please try again.

left lodge
#

Anyone wanna search with images?
I have a way.

#

Literally just go to lmarena.ai
Add images
Switch to search modality
Send Prompt.
Done.

#

Enjoy the information 💁

#

Currently these links are redirected to lmarena.ai
And video generation requires login.

cloud zinc
sour spear
torn mantle
#

so sus

mild harness
#

hi, how can I use gpt 5.2 xhigh on lmarena? I only see 5.2 high. thanks

left lodge
keen beacon
#

Does anyone know why it no longer displays images in 1920x1080?

obtuse smelt
#

is just 768 x1360

keen beacon
frosty lava
#

theres actually no llm without hallucination

obtuse smelt
#

is gemini error ?

weary galleon
#

🏆Sonnet 4.7 is coming... until Christmas ⛄

sterile tartan
weary galleon
sterile tartan
#

Expect quicker releases now from all companies

#

Thr competition is peak cutthroat now

weary galleon
#

AI winter is coming... My expectations are Anthropic and Google will release lots of great models under this competition.
P.S. Maybe xAI also, I am not sure.

#

Also, AMD and Nvidia will release new AI chips in January, which also accelerate this rush dramatically.

#

Googles and Amazons chips are also great 👍

#

My expectations are after Sonnet 4.7 release, it will get 🥇 first place in coding, Opus 4.5 🥈 second.

compact flame
#

They gonna be either focused on other stuff or just training the new models for long time

weary galleon
#

Grok 4.20 is coming until Christmas ⛄ also. Not sure how much good it will be.

frosty lava
#

Do you guys think ai will really replace like almost every jobs ?

#

many people is saying this is stupid and will never happen

#

but

#

why tho it can definitly happen

compact flame
#

Animators? Could be after I seen sora

frosty lava
#

then why majority of people still think it wont happen isn't it like crazy

compact flame
frosty lava
#

how can we ignore it when we see the massive investment and the literal race

#

for ai

compact flame
#

Its kinda based how well they train ai

frosty lava
#

there's always a solution

compact flame
frosty lava
#

It will just be a problem to solve not something impossible to achieve, people be like its impossible yet every time human do new discoveries and what felt impossible is just a reality

#

there's always someone that find a new way

#

there will be some problem but saying we can't ever solve it feel stupid

#

in my opinion

weary galleon
frosty lava
#

we just want less hallucinations and yes

weary galleon
frosty lava
#

your right but are we already at that point where it will be like as efficient as a team of programmer, i don't really know at all

#

i think definitly in 2026 what your saying will be doable

#

we're close for sure

weary galleon
#

I'm a programmer, and that's why I think LLMs are bad for us. To find a job is much harder for me even now, if to compare a few years ago.

frosty lava
#

it will just be very very different

#

it can definitly help us all if we think about medecine, new discoveries, and future

#

its just that theres this questions about jobs and what will happen but no one can answer already were not here yet

#

but for sure, we won't let people struggle forever

weary galleon
frosty lava
#

Physical work might also get replaced

weary galleon
frosty lava
#

In china its already happening

strong wren
#

hi

weary galleon
frosty lava
#

there is also robotics going with it

#

and yes they are doing alot of progress

#

I would say we're really into automation these days

#

everythings is about it

#

we want more efficient, cheaper worker

#

and its doing alot of progress

#

You can look deeper into it

#

its happening.

viscid cloak
#

nanobanana pro DIRECT keeps giving“something wrong”, anyone as well?

obtuse smelt
weary galleon
frosty lava
#

im not saying its already done, but definitly it seems that it will happen

#

and not like in 50 years

#

that's definitly something else than LLM

#

but its also doing progress

weary galleon
frosty lava
#

like we definitly won't make something that kill you

#

or is dangerous

weary galleon
frosty lava
#

you know how the world were 50 years ago ?

#

its changing much faster than that

weary galleon
frosty lava
#

But we shouldn't see only the bad side of it honestly

#

nothing is all good or all bad

#

but from what we saw in history, when we progress its mostly good

robust sluice
obtuse smelt
#

need fix this

#

several hours is look error

robust sluice
#

mine its been 3 days

#

Gemini never appear in Battle only Flux models

obtuse smelt
robust sluice
#

and Direct keep error and said try again in 50 min

obtuse smelt
#

hmm

plucky sparrow
#

the question is whether you will end up living the majority of your life in hell, or abundance

frosty lava
plucky sparrow
#

the great depression was a pretty long period of time

frosty lava
#

its a win win

plucky sparrow
#

it will take a long time

frosty lava
#

if its bad for most people then its bad for them too

#

so it most likely won't be bad

plucky sparrow
#

at first they'll be like "oh, but, AI is creating new jobs. we'll just create more gov jobs."

#

they'll delay it as long as possible

#

we might even get a revolution before we getr change

#

historically, that's how it plays out

frosty lava
#

yes theres a time for it to be good actually they will delay it your right, but definitly the worst case will not happen

plucky sparrow
#

yeah i agree. to be honest, worst case i'm worried about is, the rich people are in charge of AI right now

frosty lava
plucky sparrow
#

and the rich people have access to the best AI models. not worried about an AI 'killing us all' as much as 'rich people having great influence over AI' and using it to manipulate people

#

imagine a 100x better claude opus 4.5, but only the rich have access to it, adn they use it to exploit you

frosty lava
#

i don't believe we all just gonna accept something like that happening and they know it

plucky sparrow
#

sure, but, like all wars, it's usually won by the people with the most power

#

can you win against a robot army?

#

i think we see great famine before we see great abundance

frosty lava
#

why will they use a robot army, why will they want to be hated by everyone, and that mean like every country have to do it

plucky sparrow
#

historically, that's always happened. not sure this time is 'different'

frosty lava
#

if everyone like everyone do hate them they are loosing

#

they are human too

#

they don't want human to die

#

and nobody will let someone doing it

plucky sparrow
#

have you not seen how many humans were enslaved historically?

frosty lava
plucky sparrow
#

we're living in less than 0.01% of human history. we forget how bad things have been

frosty lava
#

if something like a robot army controlling everyone happen humanity is dead

#

would mean even them at some point will die

plucky sparrow
#

i don't think it will go on forever, i'm just saying, historically, we go through incredibly rough times before we get good times

#

and we forget, as a species, how good it is now, compared to how it was

#

and that cycle has repeated itself for as long as history was written

shrewd citrus
#

Anyone else having issues with nano banana pro

plucky sparrow
#

so either this time is different, or it's not

frosty lava
#

not that bad

#

atleast

obtuse smelt
plucky sparrow
#

the industrial revolution, the great depression, the last 2 wars..

frosty lava
#

But also people getting smarter with more right

#

you can't compare something that happened when like everythings was different from now to now

plucky sparrow
#

so you're saying. "this time is different"

#

it could be. but it rarely is

frosty lava
#

it always are different, cause something like that happening in the world of today doesn't seem achievable and if it is like everyone will loose

#

even them

plucky sparrow
#

yes, i don't think it will go on forever

#

i think history never repeats itself exactly but it often rhymes

#

i think we will get an age where a lot of people are in famine because of AI, and eventually a revolution of some sort, and eventually we will have abundance

frosty lava
#

they need to be popular too

plucky sparrow
#

the only question is whether we end up living in the majority of the famine and barely see the abundance

frosty lava
#

to be popular it also mean they have to do good things somehow

plucky sparrow
#

like many people who survived through WW2

#

we might end up telling our grand kids, "Back in my day, AIs made us all poor.."

frosty lava
plucky sparrow
#

no, it started because of famine in Germany

frosty lava
#

so its completely different

#

not even the same reason

#

Ai are new technologies

plucky sparrow
#

we'll see. you're banking on the government being actually controlled by the poor majority instead of manipulated by the rich minority

frosty lava
#

yes cause like you said majority and minority, and if the majority start to defend themselves the minority wont win

#

and they don't want that anyway

#

it would in every case make them loose

plucky sparrow
#

i think what you're saying is right, i just think it'll take a lot longer than we'd like

frosty lava
#

but i dont know what will happen

#

i just know we can't go back anyway

plucky sparrow
#

yeah i agree

#

people somehow think protesting AI will get anywhere 🤣

swift oyster
#

Result are in for our upcoming model.

plucky sparrow
#

if anything, it's better the country you live in, develops AI before the country that may not share your same values

sterile ore
#

hello

obtuse smelt
#

hi

frosty lava
#

its more efficient

#

and that's all we want

#

when people will realize they can't beat someone that use ai in the future

#

they will use ai

#

and you can't really blame them

plucky sparrow
#

imagine trying to protest the development of guns 😄

frosty lava
#

you just can't tell to everyone to stop

plucky sparrow
#

imagine if your country had no guns but every other country does 😄

frosty lava
#

and if someone still do it he will be more efficient

plucky sparrow
#

not even in the army

frosty lava
plucky sparrow
#

i do agree with you 💯 on one thing, protesting AI is stupid

#

it won't lead anywhere

#

except your own misfortune

frosty lava
#

we can't protest progress even if the progress is dangerous or will lead to big problem like mass unemployment

#

cause its somehow still a progress

#

all we can do is find solution but it doesn't even depend on us

#

but on few people :

#

all i can say is if its too bad the majority will defend themselves

#

so it probably won't be "too" bad

plucky sparrow
#

hmm

#

but things are 'too bad' in many countries

#

and the majority isn't able to defend themselves

frosty lava
#

its not equality everywhere, for the big countries it work like i said

plucky sparrow
#

i think that's why AI development is actually super important

#

it will determine whether your country is a 'big' country or not, and how long the famine will last

frosty lava
#

if ever a "bad" countries get the lead of the most powerfull progress then we're all at their mercy

#

that's why its a race

#

we're saying to know who will be the leader

#

of the next decades

#

but no one understand it atleast not yet

plucky sparrow
#

will be interesting to see who wins

#

might not be who we think at all

frosty lava
#

yes

#

we never know it can come from anyone

plucky sparrow
#

e.g. may not be US or China

frosty lava
#

it will be the smarter one

#

that's it

plucky sparrow
#

might not even be smarter, could be luck

frosty lava
#

yes your right

#

we just can't tell

#

but its a race and not for nothing

#

its much more important than people think

plucky sparrow
frosty lava
#

when china and us are racing how can people still say hey its nothing

#

i don't understand sometime

plucky sparrow
#

put it on 2x or whatever. the borders historically change like crazy, yet somehow, we think war is over

frosty lava
#

how delusionnal we can be

frosty lava
plucky sparrow
#

I do hope you're right, that this time really is different

frosty lava
#

if its not big enough to see it through eyes people will still think its nothing

plucky sparrow
#

otherwise in the next 7-30 years we're likely going to go throuigh hell 😄

frosty lava
#

by seeing at the progress of it this year for example

#

its crazy

#

all of it is being speed up cause its a race

plucky sparrow
#

well i mean, people predicted self-driving cars everywhere 5 years ago 😄

frosty lava
#

when its about power you'll be surprised

#

how fast it happen

#

seeing Trillion dollars of investment lol

#

i don't even know who have this much money but yeah

#

its still happening

weary galleon
# swift oyster Result are in for our upcoming model.

It doesn't look real. In my experience really good models don't need hype, they get hype automatically from their existence. You post these pictures every single day maybe because after the release hype will over.

frosty lava
#

Yes gpt 5.2 what a scam in my opinion

#

they did this so they don't loose fame but its definitly not as good as expected

#

benchmark are impressive yet we dont see real result

#

like student getting 100 / 100 on a test but when it come to apply it in real life nothing happen

plucky sparrow
frosty lava
#

its not just a new fancy technologies

#

its power but people be underestimating this

#

while Trillion dollars being invested

hollow mist
#

Tell me what the problem is. I've been nerfing everything for a long time, but now I keep getting the error "Something went wrong with this response, please try again." (Gemini 3 pro image preview nano bsnana)

#

Help please

compact flame
#

It's not available on yupp either

robust sluice
#

someone told me error result doesnt count on limit, but I try over and over when limit resets, I found it really count, cuz this hour I got only error msg and limits still runs out

obtuse smelt
#

yeah me too is sme happening again

robust sluice
#

how I test is: generate something random and Flux model appear (I remember the style it draw cuz I see it a lot) with other Error model I keep clicking on retry with that error model and it count on 11s then error again, I do this untill it doesnt count anymore, that means limit runs out already

hollow mist
#

This is an update, the server just doesn't work.

magic ravine
#

Nano Banana Pro is not working again

obtuse smelt
#

hmm, well me too

exotic iris
#

Everyone you love this image or not

storm summit
#

any cods invite for sora?

exotic iris
pastel bone
storm summit
pastel bone
storm summit
#

but why u can generate video and i cant?

ocean vortex
pastel bone
#

Maybe u can use Sora👍🏻Good luck

true harbor
#

Is image generation not working at all in lm arena since yesterday? At least for Nanobanana

pastel bone
#

I was amazed

#

And I saw many models like kling😆

magic ravine
#

Damn NatsuSadChibi

ocean vortex
#

with GPT5 they went all in for genuine performance. With 5.1 they went for response style, and then for 5.2 they had this "oh sh'it" moment franctically responding to Gemini3

pastel bone
obtuse smelt
#

oh well is work again

ocean vortex
#

that turnaround of less than 1 month between model versions is not normal tbh

#

We had similar with 2.5Pro 05-06 vs 06-05, but that was marginal differences and was advertised as such

magic ravine
pastel bone
weary galleon
magic ravine
frosty lava
#

they wanted to release a new model fast due to them loosing people cause of gemini and claude

#

but it wasn't the best idea

weary galleon
weary galleon
weary galleon
frosty lava
#

i guess they just lost due to their idea can they get back on the race ? i don't know honestly it depends on so many things

#

people will always use the best

#

fake benchmark doesn't get them anything lol

#

if it can't do it in real life

buoyant crypt
#

Nano banana is not working today

weary galleon
viscid cloak
acoustic crater
magic ravine
#

And hope they fix it

plucky sparrow
#

Probably because they're trying to rush out flash 3

magic ravine
plucky sparrow
#

Faster and cheaper for those paying

#

(High use people and api people)

#

And some claim it's better in some ways. We'll see today

vale vortex
#

Quick question, which one is better for research,

Kimi, MiniMax Agent, or Deepseek?

compact flame
plucky sparrow
#

I haven't used deepseek or minimax but K2 is actually pretty good for research

#

surprisingly good

pale torrent
#

How to use Sor 3?

plucky sparrow
plucky sparrow
onyx shore
hollow mist
#

Is there any news when nano banana will work?

obtuse smelt
#

hmm still not respond

sterile tartan
obtuse smelt
plucky sparrow
sterile tartan
#

Is the research very important?

plucky sparrow
#

what is 'low price'

sterile tartan
plucky sparrow
#

so not very cheap then

sterile tartan
#

Probably well cheaper

#

The black internet market of ai yeah

#

I have a a seat in ChatGPT business myself

compact flame
#

Seems like advertisment

#

Should we call mods or something

sterile tartan
#

💀

compact flame
sterile tartan
#

The Stage is Yours

compact flame
sterile tartan
#

U can atleast ping pineapple afaik

zealous sparrow
#

gemini flash 3 is defo today

#

google put out another model onto battle LMArena

#

Updated Ghostfalcon and Fiercefalcon

obtuse smelt
#

what the

sour spear
plucky sparrow
#

preliminary reports are saying 'the sonnet to the opus'

#

we'll see soon enough if it is

plucky sparrow
compact sleet
#

It felt like a Google model to me, could be wrong though

sour spear
#

If Ghostfalcon is Flash 3, then OpenAI is in even deeper s*** than they already are. I just got it in battle mode, and it did a fantastic job

zealous sparrow
#

google got the 3/3 on my questions after i changed them up a bit

#

btw

#

no AI could 3/3 this

#

i want to see if Xhigh can after i changed em up a bit

compact sleet
zealous sparrow
#

ima test xhigh rq

#

also deepseek r1 turbo got it right

#

so if xhigh doesnt

#

OpenAi is cooked

fiery gull
zealous sparrow
#

gpt 5.2 xhigh officialy lost to deepseek r1 turbo and the new gemini model

#

2/3 because of the Macy question

fiery gull
zealous sparrow
#

if you argue fell behind means she fell off the hill it aint, it means that she is in the back of the hill

acoustic bolt
#

Guys work the server on lm?

neat apex
#

Gpt 5.2 trys a lot to not allucinate and ends not reasoning many things

fiery gull
zealous sparrow
fiery gull
acoustic bolt
#

Gemini 3 pro

neat apex
#

When R1 was released it was the best model sometimes by far in everything, yes?

zealous sparrow
#

it was then NERFED

#

a f- ton

#

It was able to recreate websites

neat apex
#

Yeah, i am not tweaking

He was a burfed 4o and near to level of sonnet 3.5 but with a reasonable reasoning

compact sleet
dull mason
#

5.2 on text leaderboard wen

zealous sparrow
neat apex
dull mason
compact sleet
#

Do lmarena have a predecent of having a model avaliable on the direct select chat, but that model is not on the leaderboards yet?

sour spear
neat apex
#

The fact open ai makes a benchmaxxed model, to then make it actually smart is very true, it is very noticeable in 4o

fleet lintel
compact sleet
#

I feel like it can be used as boosting purposes, if the side by side and battle votes are not separated.

zealous sparrow
pale torrent
#

Veo 3 please

zealous sparrow
neat apex
#

Since he does many works perfectly, and loses perfomance gradually when the task is more different

zealous sparrow
#

They always release on a tuesday

neat apex
#

Its sad that gpt 5.1 had a relative flop

#

Its gpt 5, but actually efforts to make a better answer, it is great

zealous sparrow
#

Google also gave us the best OCR this year

fleet lintel
# zealous sparrow tuesday

yeah, Logan tweet also suggest that it will happen today. Good thing that I dont gamble, otherwise I would have place bet for Wednesday 🙂

zealous sparrow
#

It will take a LOOOOOOOOONG time till someone beats gemini pro 3 OCR

dull mason
neat apex
fiery gull
compact sleet
zealous sparrow
compact sleet
#

And probably no one can't beat them.

zealous sparrow
#

I also used gemini Live from the app and told it to identify things in my room

#

it didnt get one wrong

neat apex
#

Well, Qwen3 Omni had a level close to Gemini 2.5 and runs in a phone

compact sleet
#

Hmm, ofc China had to have one lol

#

Perhaps you're right.

neat apex
#

There was a company the Prime was not in the 3rd model?

sour spear
zealous sparrow
#

I want to know Flash score on simplebench

#

I hope its high

obtuse smelt
#

well yeah need fix

plucky sparrow
#

i don't get how some people like gpt 5.2

#

are they all paid shills?

neat apex
#

Primes are in the 3rd model?:
Gpt 3/3.5 ✅ (Gpt 4 was not way better against Claude 2)
Grok 3 ✅ (Grok 4 fast was great, but it suprassed the 1400 points)
Gemini 3 ✅ (By far)
Claude 3 ✅
Qwen 3 ✅ (By far)
Mistral 3 ❌ (The first Mixtral 22bx8 were a beast, yet nowadays they barely manage to be competitive)

tiny halo
#

we need claude has image import

plucky sparrow
#

i really tried to like it, i tried various things on it, but, it feels like a model from 1.5 years ago

neat apex
#

What are you dooing with that?

fleet lintel
# plucky sparrow i don't get how some people like gpt 5.2

Are you using Plus tier or Pro tier?
Plus tier is terrible.. you get medium gpt 5.2, which is like gemini flash models. They are treating their paid customers very badly 🙁

You need to use Pro or API to get the best out of gpt 5.2

neat apex
#

Ah yes, gpt 5.2 medium

plucky sparrow
#

i'm using it from api, even xhigh. i haven't tried 'pro' but i'm guessing it's similar to xhigh

neat apex
#

Its only gpt 5.1 medium, but allucinates sigthly less (to the unoticeable level)

fleet lintel
plucky sparrow
#

maybe 5.2 medium. but i find 5.1 way better

neat apex
#

Why non Gpt 5.2 xtra high even exists? Lmao

plucky sparrow
#

i tried it with coding, i tried it with logic, tried it with medical text, tried it with long context

sour spear
plucky sparrow
#

good at what?!?!

#

like can someone give an actual example, prompt and output?

neat apex
#

If you use xtra high, it does very very good analises in documents

#

Sometimes better than opus 4.5 or Gemini 3

plucky sparrow
#

give an example please

#

because every time i've tried it, it's worse than all the other models

neat apex
#

My brother creates documents using Gemini 3, and when go fixing it with Opus 4.5 and Gpt 5.2 xtra high

plucky sparrow
#

i might be using 5.2 medium, if xhigh is 5.2 medium

#

what kind of documents? what did it fix that opus 4.5 couldn't?

neat apex
#

Most times 5.2 notices way more issues than opus 4.5

plucky sparrow
#

proofreading?

neat apex
#

I dont have it now xd

plucky sparrow
#

ok but what kind of documents and what kind of errors?

neat apex
#

But yes, you are right, besises xtra high, it looks be worse than gpt 5.1

plucky sparrow
#

i had xtra high try to create a game for me. it was terrible

#

not functional

#

or barely functional, rather

neat apex
plucky sparrow
#

please be more specific than in general because every document and information i've fed it, it acts like it has low context

#

and it's all confused

#

and the few times it suggests things, it's wrong

#

my specifics are html/js code, transcripts, and medical text

#

it's failed at all three compared to other models and even gpt 5.1

#

i'm very confused what it excels at

sour spear
plucky sparrow
#

🤣

#

ok that is fair, i've not thrown any standardized texts at it, just actual use cases

tiny halo
#

omg why all claude moldes does not have image import

worn flume
#

Hey

#

I'm giving away an apple 🍎

lunar glade
#

can we all agree that LMArena's banana pro is spoiled? just can't get any result beside other than "Something went wrong with this response, please try again."?

sour spear
#

But it is a little crash prone atm, needed two attemps to work. 😉

tribal kernel
#

Hi everyone, I'm using Arena, but it always gives me this error, how can I fix it? Thanks

magic ravine
#

Yep. It's been happening all day.

sour spear
#

They're probably busy implementing Gemini 3 Flash Image. 😉

echo aurora
#

If you haven't already would recommend to try: hard refresh of site, clear your cookies/cache, and if no luck starting a new chat. This may help.

tribal kernel
echo aurora
echo aurora
# tribal kernel

Can you: open Developer Tools -> open Network tab -> run a new prompt throwing an error -> in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

#

Does it say 🟢 Status Code 200, or do you see 🔴 Status Code 427, 400, etc?

sour spear
#

Works after a handful of retries. 🙂

obtuse smelt
#

is status red

tribal kernel
# echo aurora Can you: open Developer Tools -> open Network tab -> run a new prompt throwing a...

Hi, I checked DevTools in detail.

Network → Fetch/XHR is working and requests return 200, but no stream request is ever created.
Searching for stream or eval shows nothing.

This means the stream never starts at all, so there is no Status Code to inspect.
The error “something went wrong with this response” happens before the streaming endpoint is created.

Looks like a backend / model issue on LMArena, not browser or client-side.

echo aurora
tribal kernel
# echo aurora Okay good to know, thank you for sharing. And you're sure you have the network t...

I had DevTools open on the Network tab the whole time, with Fetch/XHR enabled, before and during sending the prompt.

I retried multiple times and the error appears, but no stream (or eval) request is ever created.
Only regular fetch requests return 200, then the UI shows “something went wrong with this response”.

So the request seems to fail before the streaming endpoint is initialized.

echo aurora
tribal kernel
viscid cloak
#

No models 😢

echo aurora
#

Hmm

#

What browser are you on? Seeing the same for Side by Side?

#

Can't say I'm seeing the same on my end.

viscid cloak
#

i tried google ios and firefox ios, same results for Direct. lemme try sidebyside

#

Sidebyside worked well, and Direct turned normal. may just a minor error

echo aurora
#

Okay glad to hear it's working again. Keep me updated if things change.

viscid cloak
#

hazel revealed?

torn mantle
#

Last time i checked lmarena are adding 3 new things

#

Video models
Auto modality
New model selector

atomic lagoon
#

Lmaoooo

#

The image is silly

molten cipher
#

it free

steel steeple
#

Why pro banana doesn't work anymore

echo aurora
#

@quaint raptor you'll want to review the information in #1397655624103493813 for a better understanding on how to use the bot.

steel steeple
echo aurora
steel steeple
echo aurora
steel steeple
#

Love you service, thanks

zealous sparrow
#

i wonder if ghostfalcon is flash too, Maybe gem 3 pro?

echo aurora
torn mantle
grand flame
#

gemini died

ripe mountain
#
poll_question_text

Open-Source SOTA for Agentic Coding

victor_answer_votes

1

total_votes

2

zealous sparrow
winter rain
#

Is lmarena a permanent video generator

zealous sparrow
fleet lintel
#

Only difference between ghost and fierce is search on or off

#

My guess is ghost is with search

echo aurora
fleet lintel
winter rain
#

@echo aurora free lifetime?? Or any chance of premium

echo aurora
zealous sparrow
grand flame
#

i think gemini 3.0 pro died

#

since it gives me something went wrong everytime

zealous sparrow
#

works in AI studio

winter rain
#

@echo aurora we respect your hard work bro??

muted timber
#

hello guys

#

i have a problem

echo aurora
muted timber
#

on the AI

grand flame
muted timber
#

not working anymore

zealous sparrow
#

um

muted timber
#

and i dont want to create a new chat

zealous sparrow
#

pineapple, is your endpoint down?

echo aurora
# grand flame since it gives me something went wrong everytime

It seems to be working for me.

Can you: open Developer Tools -> open Network tab -> run a new prompt throwing an error -> in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

zealous sparrow
#

nevermind huh

#

wait ill tes

winter rain
#

But how can u access those tool bro lmarena

echo aurora
muted timber
#

What do you mean?

echo aurora
# muted timber What do you mean?

Can you: open Developer Tools -> open Network tab -> run a new prompt throwing an error -> in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

muted timber
#

open Developer Tools

#

where is it??

grand flame
echo aurora
muted timber
#

i know now

#

but

#

" run a new prompt throwing an error"

echo aurora
muted timber
echo aurora
#

This is how we get better inforamtion to understand what's going wrong.

muted timber
#

so i need to chat another time ?

cosmic salmon
# muted timber

Oh yeah, same thing, but also with Nano Banana Pro sometimes. It seems like the fetch request returns nothing after a while, but still goes through with 200 code

muted timber
#

i need for scripting

echo aurora
# muted timber

Follow the rest of the steps: in the search bar in dev tools search for the word "Stream" -> open the file that has the Eval ID (random set of numbers/letters you see in the URL) -> and then look for a Status Code.

zealous sparrow
echo aurora
echo aurora
#

Need to wait to use again

muted timber
#

i waited one day

#

i text for fisrt time this day

#

15 minutes ago

echo aurora
#

I'm not sure whatyour Status Code is

cosmic salmon
muted timber
zealous sparrow
fossil fable
#

3pig broken smh

echo aurora
# muted timber

Okay yeah it's rate limit for why you're hitting this error

muted timber
#

Yes, but when I chat with him, he lets me use it, and I don't want to because I have all my information on that chat, and he already has all the information from all the files, because I have over 20+.

muted timber
fossil fable
#

can ppl stop calling chatbots he smh

cosmic salmon
echo aurora
fossil fable
#

where can i smooch off 3pig access whilst it's not working on lmarena

muted timber
#

That's why yesterday he told me to wait 49 minutes (last night), and now I have to work continuously, it will be much harder for me this way... but what can you do...

cosmic salmon
#

I'll try to log out and change my browser to see if this issue still persists or nah

zealous sparrow
echo aurora
cosmic salmon
cosmic salmon
long jackal
#

Is there any way to upload files other than images for the AIs in LMaren?

half mist
#

ChatGPT’s new image model just released!

zealous sparrow
#

or well

#

gpt image 2

half mist
cloud zinc
#

nano banana pro is better

half mist
#

Also is way faster

#

than gpt image 1

fossil fable
#

wait did it

half mist
zealous sparrow
half mist
#

It’s out right now

cosmic salmon
fossil fable
#

holy sh t it's releasing

but it's not out right now is it

openai discord just says this though

half mist
half mist
#

It’s out

#

I tested it myself

fossil fable
half mist
grand flame
#

bruh broke again

zealous sparrow
half mist
#

Anyways, here is an image made with gpt image 2

zealous sparrow
fossil fable
#

so openai launches it on the one day of the year i need image edit

#

...

half mist
compact flame
zealous sparrow
#

who has spongebob rights

fossil fable
fossil fable
half mist
fossil fable
#

at least now we know to rush before it's locked up

#

we didn't get to rush sora 2 because we thought it'd be normal

cloud zinc
#

just use nano banana pro

zealous sparrow
magic ravine
half mist
#

Here is an image of Donald Trump with Sam Altman using GPT Image 2

zealous sparrow
#

paramount is already planning to sue if they see this

compact flame
grand flame
fossil fable
cloud zinc
zealous sparrow
#

trust

half mist
fossil fable
#

let me try and get it to expand an image like nbp can do

grand flame
fossil fable
#

nbp lets me seamlessly expand images to 17:6 for use as a profile banner

i think i'm finally gonna update my banner after like 2-3 years

magic ravine
fossil fable
magic ravine
half mist
fossil fable
#

i dont use twitter so i have no reason to generate 500 political images but enjoy the gold rush people

fossil fable
half mist
fossil fable
#

that's why i said 2.5fig

zealous sparrow
#

you know why

#

google is dropping nano banana 3 flash today

#

lol

fossil fable
fossil fable
#

are you sure

zealous sparrow
half mist
zealous sparrow
#

tuesday is new day

zealous sparrow
#

it hasnt dropped yet

#

google always drops stuff on a tuesday

torn mantle
#

image 2 model is already out for some people

#

also gemini 3 flash prob tomorrow?

zealous sparrow
#

and there is already an entry on vertex for 3 flash/fiercefalcon

half mist
void elm
#

???

#

how is that a question

zealous sparrow
neat apex
#

the Flash 2.5 09 have near 2.5 Pro level (since always when they release a new flash model)

#

and looks nothing changed, 3.0 Flash must have a very near level to 3.0 Pro, but without overreasoning issue now

magic ravine
#

Oh, image 2 has just released?

zealous sparrow
#

yea

magic ravine
#

How is it? Still nerfed with fictional characters?

neat apex
#

nope, they fixed that issue

zealous sparrow
magic ravine
zealous sparrow
#

this is neat too

torn mantle
#

flash wen 😖

zealous sparrow
#

this could be images v1 im not sure

#

i dont know if i have v2 yet

grand flame
#

hell nah

#

status 200 btw

zealous sparrow
#

also flowith tweeted this

#

so flash might be today

grand flame
cloud zinc
torn mantle
echo aurora
# grand flame hell nah

Thank you for sharing. I'm not surprising the infinite generation bug wouldn't have an error status code associated with it

zealous sparrow
#

they are already prepping

#

its this week sure

#

but when

neat apex
#

i am hoping they launch Gemma 4 (what is very likely soon) and someone finetune it to be responsive like the new generation at all

#

it would have same level of Opus 4.5 lmao

zealous sparrow
half mist
zealous sparrow
#

I give em 4h to release it

#

if it aint released

#

oh well another day

autumn snow
#

hwo to use promt htere

empty stump
#

8% chance it releases today

zealous sparrow
neat apex
#

i would say 50%

zealous sparrow
#

logan tweeted 3 thunders today

neat apex
#

they only delayed Gemini 3 pro a lot to make it 100% consistent

grand flame
#

my chat died

#

sad

grand flame