#general

1 messages Β· Page 215 of 1

sterile tartan
#

Like u?

zealous sparrow
#

its a flash model so obv its worse than 3 pro

sterile tartan
#

Just kidding

zealous sparrow
#

Here's the thing

#

we are testing the EARLY versions

#

the final flash could be worse or better

whole sundial
torn mantle
#

but expected better tbh

zealous sparrow
#

OpenAI already put hazel as a new image model into LMArena

torn mantle
#

lets see the cost first

zealous sparrow
#

its OpenAIs so its prob gpt img 2

whole sundial
#

officially called gpt image 1.5

sterile tartan
#

@torn mantle do you remember we were used to chat in pplx server?

whole sundial
#

there are two editing and two image gen-only checkpoints on lmarena, but more exist on OpenAI's side

sterile tartan
zealous sparrow
#

Here's the thing with it

#

It had a style that identified it as gpt img 1

#

When i saw it i knew what model it is

whole sundial
#

it's good at making studio ghibli edits, reproducing album covers, putting a yellow filter on images, and not much else

sterile tartan
#

@whole sundial is Nova Actually Good?

whole sundial
#

nova canvas image gen is terrible, editing is even worse, stable diffusion img2img would do better

zealous sparrow
#

hazel-gen-2 looks like a solid model ngl

sterile tartan
#

The Benchmarks are kinda good on artificial analysis

whole sundial
sterile tartan
zealous sparrow
#

seahawk and skyhawk models are experiencing deployment failed errors, so we wait for pineapple to fix

zealous sparrow
sterile tartan
quartz light
zealous sparrow
#

i think you meant silentnova @quartz light

sterile tartan
#

@whole sundial did u test Tensor 1.5 More?

whole sundial
zealous sparrow
#

and we suspect its amazon

#

because of nova in the name

whole sundial
#

sounds like google or grok to me

zealous sparrow
#

Why would we need 4 google models

#

on battlemode

sterile tartan
zealous sparrow
#

google usually drops one or two stealth models for battle

#

never 4

whole sundial
atomic lagoon
#

Is movementlabs good? idk who its done by and what its best at

torn mantle
sterile tartan
torn mantle
#

i see

#

if its from amazon then its not that good

sterile tartan
#

I thought is good because of Benchmarks

#

But seems like it might be not after all

zealous sparrow
#

im tryna get silentnova to get the info of who made it

torn mantle
#

the issue is that they are training for the benchmarks

#

not for making a good model

sterile tartan
#

Is a paradoxical situation

torn mantle
#

:/

sterile tartan
#

They can't afford to lose on Benchmarks

zealous sparrow
#

@hardy swallow

sterile tartan
#

The world basically judges by it without experience

whole sundial
#

<@&1349916362595635286>

main nexus
#

Dude every time I ask claud 4.5 opus what model it is it says they don't have a model πŸ™

zealous sparrow
#

W mod

#

pineapple thanks

sterile tartan
#

When i ask GLM it says im claude

#

πŸ’€

zealous sparrow
#

wait a sec

main nexus
zealous sparrow
#

nvm

#

wanted to check if silentnova is an img model

#

it aint

zealous sparrow
#

lmao sonnet what

pale rain
zealous sparrow
#

why is silentnova so rare

#

Ok, be honest. Anyone got silentnova in textarena yet?

zealous sparrow
#

its a textarena model

old garden
#

i haven’t used it

echo aurora
zealous sparrow
#

I still refuse to believe holo-scope is google..

#

Why would google make 3 models for Battle??

whole sundial
#

baidu? minimax? some other chinese ai company?

zealous sparrow
#

So like

#

frame-flow used to say its google right

#

later changed to be ERNIE

#

this migh be the same case

sterile tartan
#

U can't predict by asking them

#

The Ai Identity Crisis

weary galleon
zealous sparrow
#

Their coding results are what reveals them

hardy lion
austere sundial
#

Metter btw

sterile tartan
hardy lion
weary galleon
weary galleon
weary galleon
fickle venture
#

Looks real to me

hardy lion
#

ok, sorry if it's a joke that's ok, just I've seen a lot of people spreading fake LMArena scores around recently and it got on my nerves

fickle venture
weary galleon
fickle venture
weary galleon
hardy lion
# fickle venture I mean who would believe it

sadly people do 🀦
Posts like this got a bunch of comments and retweets https://x.com/teslaownersSV/status/1997008559469650261

🚨 BREAKING: As of December 4, 2025
Grok 4.1 has taken the #1 spot on the LMArena Leaderboard
Highest Elo ever recorded.
Thinking mode dominating. Fast mode already breathing down its neck.
The rest are playing catch-up.
xAI just dropped the mic.

quartz light
fickle venture
#

Is it that good?

quartz light
quartz light
fickle venture
#

I remember I went to their official website and there was no model

weary galleon
quartz light
#

they revamped ui i hate it

#

bloated

fickle venture
fickle venture
#

Some times they just write it on output

weary galleon
obsidian spire
#

I accidentally deleted a chat. Does anyone know how to fix it? Thanks.

whole sundial
#

<@&1349916362595635286>

obsidian spire
echo aurora
quartz light
#

@zealous sparrow

#

☹️

sterile tartan
zealous sparrow
quartz light
zealous sparrow
#

nickname

#

use the note steg on silentnova

lusty tinsel
#

when will be able to upload files other than images in lmarena for coding etc...? is that in the plans?

quartz light
#

@zealous sparrow

#

@zealous sparrow

#

SAME EXACT RESPONSE AS SEAHAWK

zealous sparrow
#

okay jee-

sterile tartan
#

Flash 3 is definitely coming

zealous sparrow
#

ok but like

#

what can holoscope be

#

seahawk and skyhawk are flash models

#

then what is holo-scope??

compact sleet
#

what do you expect on X anyways

#

it's full of bots and elon praisers/haters

sterile tartan
#

Grok 4.2

#

Possibly GLM 5

compact sleet
#

Grok itself aint a bad model from the pure perspective of a casual user, Iunno about programming

weary galleon
sterile tartan
#

It could be a new qwen model too

hardy lion
sterile tartan
#

πŸ’€

craggy cloak
#

hello does this site contains subscription ?

zealous sparrow
weary galleon
compact sleet
sterile tartan
compact sleet
#

it is what it is

sterile tartan
#

πŸ’€

weary galleon
craggy cloak
#

so if i have an account do i have unlimitied chat?

weary galleon
#

There are plenty of limitations

sterile tartan
weary galleon
#

Expacialy for opuses, and nano bananas.

quartz light
#

new lmarena feature

#

in testin

craggy cloak
#

gotcha so the limits in chating wise is a bit low?

sterile tartan
sterile tartan
weary galleon
sterile tartan
#

Mainly for Opus 4.5 and Nano Banana Pro and such expensive models

craggy cloak
#

ah yes

#

many thanks guys

weary galleon
sterile tartan
#

I thought they count together

weary galleon
quartz light
# quartz light new lmarena feature

@echo aurora this should not override the last used setting if set manually, please tell team to fix this before they push this update ☹️

sterile tartan
golden ocean
#

grok imagine 5

#

prompt: "turn his frown upside down to turn it into a smile and make him purple and make his team green"

burnt sinew
torn mantle
#

idk whats the prompt here, but if its google then we can assume its flash lite

quartz pike
#

yall. how good is ernie 5.0?

#

is it ass or nah ass?

celest lark
#

Idk if you notice, but does it seem like that gemini nano banana pro on lmarena is down almost every day?

quartz pike
fleet lintel
#

Any news on new gpt model?

stray aspen
#

what new gpt model

stray aspen
#

why are you even asking

quartz pike
#

idk

#

just wanted to know the opinion of someone who tried it

fleet lintel
cloud zinc
#

no its thursday

fleet lintel
slim spire
#

What is this new gpt garlic model thats being talked about

zealous sparrow
slim spire
supple vector
#

LMARENA API WHENNNNNNNNNNNNNNNNNNN

#

and free (pls)

zealous sparrow
#

LMARENA pays for other apis already

sweet topaz
#

πŸ‘‹

zealous sparrow
#

it will never get an API

supple vector
#

i gotta stick to openrouter then

#

n

#

also

shell crypt
#

πŸ‘‹

supple vector
#

anyone know any free unlimted video gen apis

#

or do i have to selfhost on comfyui

#

why are there so many bots

zenith scarab
#

πŸ‘‹

upper locust
#

πŸ‘‹

pale reef
#

πŸ‘‹

supple vector
#

πŸ‘‹

#

why everyone spam ts emoji

quartz light
#

its gemini 3 flash

half pelican
#

πŸ‘‹

quartz light
#

???

surreal creek
#

what was the prompt? is holo-scope another ERNIE then?

quartz light
quartz light
quartz light
#

@zenith scarab @proud sail @shell crypt

warm zodiac
#

<@&1349916362595635286> the hand wavy guys are bots It seems? at least some of them

zealous sparrow
quartz light
torn herald
#

πŸ‘‹

quartz light
warm zodiac
#

really weird

quartz light
#

they're all bots

ornate ginkgo
#

there are three video arena groups how i know they use which ai model ai tool

quiet spire
#

Fast is fine but accuracy is everything

custom chatbots for support, onboarding and communities
automation agents for daily tasks and workflow operations
RAG search tools for documents, knowledge and internal data
speech-to-text pipelines for meetings, summaries and reports
content automation tools for writing, planning and rewriting
AI integrations with major platforms and APIs
small custom AI tools for everyday use

I am an AI Developer focused on delivering stable, high quality results.
feel free to reach out anytime if you need support on your AI project.

quartz light
#

what the #### <@&1349916362595635286>

stray aspen
#

what

neon idol
quartz light
#

and bots

stray aspen
#

lmao

neon idol
knotty fable
stray aspen
#

thats messed up

#

@echo aurora

neon idol
#

@torn herald imagine not knowing how to use a self bot. Looser πŸ₯€πŸ˜­πŸ€‘

zealous sparrow
knotty fable
#

If anyoine don't read spanish....."you have a transfer of $100,000 at Banco Pichincha and it is completely secure." Typical scam phrasiology.

zealous sparrow
knotty fable
neon idol
#

U forgot a thing

#

Ip grabber?

compact flame
#

Guys did chatgpt 5.2 release yet?

neon idol
knotty fable
#

Well I pointed out that multi account guy to Pineapple and the other mod, what I know they did not do nuffin.

compact flame
zealous sparrow
compact flame
#

Instead of this December

knotty fable
compact flame
#

But anyways I hope 5.2 is gonna be good

#

Though it seems kinda rushed so idk if it's gonna be better than gemini

proud bobcat
#

theyll probably say "oh its delayed because we need to refine it"

#

or theyll release it last minute

zealous sparrow
#

im sharing bc

#

of cool design and 3d

#

also there is no victory

#

you just die

proud bobcat
#

no way

zealous sparrow
#

do you know who dis

#

tho

#

walmart on the left
original on the right
admit its identical kind of

proud bobcat
#

it is

#

in a way

fickle venture
zealous sparrow
#

4.20 will flop

#

i can tell

#

grok 5..

#

If it does good somehow.

short ermine
#

whats up fam

torn mantle
#

grok 4.2 will be sooooooooooo bad

#

we can already tell

#

i mean he usually hype slop models but i havent seen him sayin anything like 'IT WILL BE THE BEST MODEL'

zealous sparrow
#

seahawk can have a stroke

#

i just saw it

stray aspen
#

elon musk yapping again

proud bobcat
#

please dont be ass

stray aspen
#

only thing xAI is good is at is UI design for ai websites

zealous sparrow
#

skyhawk is my beloved

fleet lintel
queen veldt
#

Gpt 5.2 just broke lmarena elo rating leaderboards

#

And grok 5 imagine is #2 in image editing

#

And also the gpt image 2 is in top 5

neon idol
#

@queen veldt wth are u talking about?

queen veldt
#

People spreading misinformation in the last few days

#

X is full of posts

#

This grokipedia is terrible

knotty fable
#

'He' considered wikipedia to be too left leaning [wiki as quite bad but not for that reason.]- so he launched an alternative.

weary galleon
#

Grokipedia is a piece of crap

torn mantle
#

agree

#

its so bad :3

neon idol
#

Never tried

#

And idc

#

:3

echo aurora
#

Hey everyone - sorry I was in a meeting and couldn't respond to the πŸ‘‹ reports. I've gone ahead and let our mods know to start removing the "hellos/waves" content from the #leaderboards channel, so we'll start to be more active in keeping the discussion in #leaderboards about leaderboards. In terms of scams they're making with Video Bot we also want to be removing this content, but would note this stuff is a bit more difficult to spot (compared to the other stuff we're modding out), but yeah if you do come across content that has bad intentions don't hesitate to ping our mods.

For those that are a little suspicious we won't be moderating them out, unless they start breaking server rules; being disruptive, etc. What this looks like can be a bit subjective, so don't hesitate to let us know if you think that's happening. But overall I don't want to start booting people from the server just because they're sending a πŸ‘‹ emoji (unless it continues to escalte into something that we find annoying). Hope this helps. Let me know if you have any thoughts or feedback on this. At the end of the day we want to build a server that benefits the community, and you all sharing your thoughts will contribute to that.

neon idol
#

Elon musk stuff = low quality stuff

viral cedar
neon idol
#

They were spamming with self bot

#

I tracked all of their messages and never sent a message different from the wave

weary galleon
neon idol
#

Also the member doesn't have any role that increase the % of bot user

#

And also no pfp

#

And we are at 78% of a possible ai user

#

Then, do whatever u want. U are the leader here not me :>

echo aurora
#

Now if these bots were causing harm, that's a different story.

#

But at the moment it seems like a lot of πŸ‘‹ in leaderboards (which we're now going to be actioning).

#

This can change though, I'm very very much open to changing how we do things here if it's what the community watns to see.

neon idol
weary galleon
echo aurora
echo aurora
weary galleon
echo aurora
#

Would note too these server rules can bend a bit depending on the context. If two people are having an in-depth convo about leaderboards, and the convo starts to go into a new direction, for the most part we're not going to step in. However, if two people immediately hop in there and start discussing something unrelated, that'll be treated differently.

A lot of the moderation stuff we do is going to boil down to ✨ ameowsparkle it depends ameowsparkle✨ .

echo aurora
#

But again want to reiterate - let me know what you all think. I'm happy to make changes.

weary galleon
compact sleet
#

This is a bit ironic innit?

#

I just noticed

#

And the top one was also the same

weary galleon
neon idol
#

It come out???

#

Oh

compact sleet
#

Why would you do that?

weary galleon
neon idol
#

Naaaaah

burnt sinew
golden ocean
neon idol
weary galleon
golden ocean
compact sleet
misty vault
hollow flicker
#

why does LMArena ai have rate limits?

viral cedar
#

think thats common sense

hollow flicker
#

the rate limits are insanly high

viral cedar
hollow flicker
#

idk

#

honestly

viral cedar
hollow flicker
#

oppsie

#

i kinda use it so i dont have to pay for claude

hollow flicker
weary galleon
hollow flicker
weary galleon
whole sundial
#

<@&1349916362595635286>

#

<@&1349916362595635286>

queen veldt
#

It's not an ad πŸ’€

#

It's spintronic memory

cloud zinc
#

what do u know about spintronic memory?

queen veldt
#

But it's expensive and impossible for regular humans to achieve

cloud zinc
#

sounds like scam

queen veldt
#

I've just read about it

cloud zinc
#

its been since 2004

queen veldt
#

It's super expensive tho so i don't see it in near future

#

But we got the ibm quantum computers

#

You can get 10 mins of ibm quantum computer for free πŸ˜‚

#

Datacenters aren't the future

#

They must find some other way to make ai power-efficient

#

What would i do for 10 mins with quantum pc

surreal creek
#

voted for ChatGPT-4o over it though, lol

golden ocean
# queen veldt What would i do for 10 mins with quantum pc

Ten minutes on a fault-tolerant, large-scale quantum computer capable of running Shor’s algorithm at meaningful key sizes would be enough to collapse most classical public-key systems if the attacker had already captured encrypted traffic. RSA and ECC would fall. Any stored handshake, any archived encrypted session, any intercepted key exchange becomes readable.

Such a device does not exist. Current quantum machines cannot factor RSA-2048 or break ECC; they are orders of magnitude too small, too noisy, and too slow.

If a future machine reached the required qubit count and error-corrected stability, preparation by pre-recording traffic would make the ten-minute window sufficient to run the needed quantum circuits and extract private keys, which cascades into broad compromise of internet security for any system not migrated to post-quantum cryptography.

sand trail
#

For how long does LMArena host a website that was generated at my behest?

echo aurora
undone ravine
#

How can I create a one-minute video using AI for free?

#

Please πŸ™πŸ™

unreal hatch
#

What year do yall think OpenAi will release Garlic?

pearl drum
#

...This has happened to me ever since today, every model says "something went wrong" even for test messages. Did anything change overnight?

whole sundial
fickle venture
#

This one has free models kinda

#

They might have a video generator

fickle venture
#

Maybe refreshing could fix it or there might be a bad word in the prompt, So lmarena refused to send it.

fickle venture
astral blaze
#

holy
What did they do to make the site even more unusable now

native yarrow
#

what does NB pro on open router even do?

#

its obv not uncensored right?

stiff nymph
#

hello

obtuse smelt
#

hi

lapis imp
#

So this might be a stuuipd question someone mentioned token limit? After you get the retry how lognn do you agve to wait 1 hour or 2hr or? Is the toke limit aka the rate limit?

#

Hope you'll understand what I'm trying to ask lol.

#

Also forgive my spelling mistakes

burnt flax
#

Hello Andre and I'm all here cuz I want to learn how to edit in create videos and edit photos and images I'm trying to create a podcast and I want to be able to edit my own material as well as create some material myself for skits

keen beacon
#

Told u

jovial sapphire
#

hi guys

#

have u ever seen an edit like this?

keen beacon
jovial sapphire
#

like this style

keen beacon
#

No

jovial sapphire
#

like tracking style

jovial sapphire
#

is it like beautiful

keen beacon
#

Ya

proud bobcat
#

But

#

It has a little less guardrails

native yarrow
#

ah

timid yew
#

how to generate video in LM arena?

desert abyss
keen beacon
#

I figured out how to unlock PokΓ©mon

jovial solar
#

Why grok don't have images button

cloud zinc
plucky sparrow
#

did gpt 5.3 come out or was it fakenews as expected?

#

*5.2

modest prism
#
poll_question_text

Will Gemini 3 flash be better than Gemini 2.5 pro?

victor_answer_votes

3

total_votes

4

victor_answer_id

2

victor_answer_text

No

vivid coral
#

Has anyone found any anons in the search arena? I haven't. Weird there's 1000 anons in text arena but not search arena

hot pebble
#

Hello guys, is anyone else experiencing β€œ Something went wrong with the response, please try again later β€œ error mid chat ?

digital forge
#

Has a 50-minute waiting time been added to the regular Claude Opus 4-5 session?

hot pebble
hot pebble
vivid coral
#

Claude has the worst rate limits on this app, by far. They are literally run by communists. That's why they are going for regulatory capture. Government approved monopoly and they will be able to charge insane rates for unlimited or increased limits.

digital forge
# hot pebble Waiting time ? After how many prompts?

I don't know exactly how many requests, but I created a new chat with Claude 4-5 and, after a few messages, I was blocked and can only use the chat again after 50 minutes. Until an hour ago, this waiting time didn't exist in this version, but it did exist in the Claude 4-5 (Optus Thinking) version.

vivid coral
#

We need a full on boycott of Claude to send those commies a message we won't bow down to their crap

hot pebble
bleak lake
#

Then log out and log in back

hot pebble
hot pebble
bleak lake
#

moreover log out and log in back

#

usually it works for a ton

hot pebble
digital forge
hot pebble
bleak lake
bleak lake
#

and clear cache

hot pebble
bleak lake
#

you should try both

hot pebble
#

I cleared my cache memory, logged in again…. Typed a new prompt….

#

@bleak lake any other suggestions? πŸ˜Άβ€πŸŒ«οΈ

#

I also forgot to mention, switching to different models yet pops up same error…. So is this a problem with the Lmarena itself ?

hot pebble
chilly nexus
#

Number of captcha is insane, i have to fill a captcha for each new message

hot pebble
#

@quick jackal bro you here?

weary galleon
bleak lake
#

I did it for you

hot pebble
vivid coral
#

He's probably asleep, its 2am, lol

hot pebble
thorny schooner
#

You know a glitch is bad when even after a full-on reset it does not go away ( was turning on developer mode)

grave plaza
#

hii

hot pebble
#

Now this is seriously bothering me alot….. after every few prompts i am getting same errors….

If the limit is reached, it should mention that.
If i switch to another model, it should atleast respond and not show same error….

compact flame
#

Idk if he fixed it or not

#

Probably could be due to bad internet maybe

valid dust
#

hello, came here for the fun video making

hot pebble
#

But every time i open a new chat i have to load the data from previous chat again, and if there is a limit of prompts, then i am already using multiple prompts just by typing old data

hardy lion
coarse cradle
#

How can I create a video with sound?

hardy lion
valid cobalt
#

The overlapping brand icons is a chart crime and LLMarena should be ashamed of themselves. I can promise you, nobody is going to be fooled. You should fix it or lose all credibility

fringe fern
#

hello

valid cobalt
#

grok is second or it isn't. if it's second, it's brand icon should overlap anthropic. Simple as that.

hardy lion
#

it was generated using https://app.flourish.studio/ we didn't set that deliberately it's probably like alphabetical or based on the placement at the beginning. The video has the names in order the ranking is clear at all times.

atomic lagoon
#

oH SORA

whole sundial
queen veldt
obtuse smelt
nova pivot
#

Please why is it I can't choose my video model

brisk turret
#

when flash and flash lite?

#

is flash lite being tested atm?

queen veldt
#

Why are y'all so hyped for flash?

#

It's a smaller model

golden ocean
#

fr

#

last time too when gemini 2.5 flash released

#

everyone scking it off with youtube thumbnails like "the new best model" like tf its a smaller model

#

i think its the poor people community celebrating

flint fog
#

Hello, I'd like to report a problem with the LM Arena software, specifically with the Nano Banana Pro model. Sometimes the model stops responding and displays an error message. This has happened quite frequently recently. I hope this issue can be fixed. The same problem also exists, but to a lesser extent, in Cluade 4.5 Opus Thinking.

elder merlin
#

general

queen veldt
#

It might be the problem with nano banana itself

sterile tartan
flint fog
sterile tartan
flint fog
sterile tartan
quartz light
quartz light
#

"its a wrapper for glm 4.6"
"its a wrapper for gemini diffusion"

1 million-token context window

MANGOMANGOMANGOMANGOMANGOMANGO67

torn mantle
#

its a wrapper

#

for gemini flash

fickle venture
#

In this video, I'll be discussing the recent launch of Gemini 3.0 Pro and the appearance of new models called Skyhawk and Seahawk on LM Arena, which are likely early checkpoints for Gemini 3.0 Flash. I put Skyhawk to the test with my King Bench questions to see how it stacks up against competitors like GPT-5.1 and Sonnet.

--
Key Takeaways:

...

β–Ά Play video
fickle venture
#

But wait what is tensor

#

I have never heard this name before

torn mantle
#

there are tons of good free TTS and small models and hes still using this

golden ocean
#

and intro

torn mantle
#

yea

#

that intro is annoying as hell too

#

I know he thinks that the voice now is kinda his signature but he needs to c hange it

fickle venture
torn mantle
#

how are you 100k and you cant afford a good TTS

#

even if you dont want to there are gazillions of free TTS

sterile tartan
waxen fern
#

Is rate limits still there in Lmarena???

sterile tartan
#

Unless Ai becomes absolutely cheap

waxen fern
sterile tartan
#

U can try Yupp Ai if you want

waxen fern
#

@echo aurora please remove lmarena rate limits

sterile tartan
#

Compute is expensive bro

#

πŸ’€

torn mantle
#

like so many

#

you can even run them locally

sterile tartan
torn mantle
#

brotha

sterile tartan
torn mantle
#

i dont have a long list

#

i gave you two

sterile tartan
torn mantle
#

should be enough

vivid coral
#

Happy Gemini 3 Flash day everyone. Is it on the arena yet?

sterile tartan
sterile tartan
torn mantle
#

there are better but i dont have a list

sterile tartan
torn mantle
#

these are more lightweight

sterile tartan
torn mantle
sterile tartan
#

No worries I don't need for now anyways

sterile tartan
vivid coral
#

It's already been on there as an anon. It should be coming out today, according to sources

torn mantle
#

its an api wrapper based on microsoft api

#

its good

sterile tartan
#

Gotcha

#

Tysm again

#

May you be more useful in future

#

As well

torn mantle
#

you can try it here

#

ava is a good voice actor + andrew

#

there are like some 4 decent onces

sterile tartan
sterile tartan
torn mantle
#

ava + andrew are good tho

torn mantle
sterile tartan
#

Now you are more useful

torn mantle
#

i kinda like andrew better ngl

sterile tartan
sterile tartan
quartz light
torn mantle
torn mantle
sterile tartan
#

Not bad at all

#

I could sleep on this

#

There is elevenlabs and minimax speech too

quartz light
torn mantle
sterile tartan
torn mantle
#

lol since when

sterile tartan
sterile tartan
#

Are these yours?

sterile tartan
#

πŸ’€

fickle venture
#

It was sooo goood

sterile tartan
#

Maybe that's why it got removed

#

Alot of people would had been using it

torn mantle
fickle venture
sterile tartan
fickle venture
#

Hugging face is just so good they provide free working open source models

quartz light
#

but they're good

quartz light
sterile tartan
fickle venture
sterile tartan
fickle venture
torn mantle
#

i think its 12gb RAM

fickle venture
sterile tartan
#

There is also colabs i guess

#

Kaggle

torn mantle
#

its handy

sterile tartan
#

But idk if they allow to host ai

sterile tartan
fickle venture
sterile tartan
torn mantle
trim portal
#

Hey sorry to bother does the model in direct mode also hide their identities? I asked gpt 5.1 from lmarena and it claims to be gpt 4.

I used real gpt5 to evaluate the response from the model, real gpt5 said the response is clearly gpt4

#

Am confused

sterile tartan
sterile tartan
#

The ai identity crisis

#

The models are as stated

trim portal
#

I asked gpt5 to make some reli hard reasoning problems

#

give that to the model and fetch the response back to gpt5

#

Gpt5 claims that's clearly gpt4

#

Maybe that's not good validation?

sterile tartan
#

Any ai will say that

#

Is not good validation

#

And asking a model for its identity is just not Efficient

fickle venture
#

Has this released to Lmarena?

echo aurora
sterile tartan
fickle venture
keen beacon
#

why now in nano banana have limit for make images?

keen beacon
fickle venture
sterile tartan
keen beacon
fickle venture
#

So it won't be helpful

sterile tartan
#

Pro users get 25 minutes

echo sinew
floral quarry
#

Why I got error when I upload image made from lm arena and I want to edit it with ai?

neon idol
#

Not here lil bro 1emojipat_pat

bright shard
#

@echo aurora Nano Banana Pro is having many errors again

echo aurora
bright shard
glass zealot
#

this one little error is so annoying… i think it depends on time when i use lmarena. sometimes my generations are consistent successful and sometimes i get this error 9 times per 10 requests

twilit delta
#

what fix of this

torn mantle
#

antigravity gotta be the worst IDE ever

#

omg

balmy mist
#

did it get worse?

#

cause it was doing crazy stuff that I could not even do with other IDEs and ai clis

torn mantle
#

The file on disk STILL shows the broken syntax! The multi_replace_file_content tool is reporting success but the changes are not persisting to disk. This is a serious issue with the editing tool.

#

even opus 4.5 is furious lol

#

ive been trying to fix this issue last 2 hours

#

IDE tool call is so messed up

#

"This is a serious issue with the editing tool." lol

#

finally opus fixed it

#

took me like 40% of opus usage

#

holy

proud bobcat
#

they put gpt here and not deepseek??

#

you gotta be kidding me

torn mantle
#

cursor new model

#

probably released today

hardy lion
#

hi @tulip tusk please read #1397655624103493813 and use the video-arena channels for video requests, thanks!

fleet lintel
iron glen
#

is there a way to upload 2 images to generate one video?

echo aurora
wispy cloak
#

HI everyone, hope to learn new stuffs here.

echo aurora
proud bobcat
proud bobcat
hardy lion
echo aurora
# bright shard For now, I've only used the regular version; I haven't tested the 2K version muc...

Gotcha. Unfortunately, we have been experiencing high error rates with this model. The team has been made aware and are looking into solutions to lower this.

cc @glass zealot

sometimes my generations are consistent successful and sometimes i get this error 9 times per 10 requests
It's difficult to say what's causing this error as the Something went wrong is a generic error message which can be caused for different reasons. However, what you mentioned here points to rate limit being the culprit.

neon idol
#

<@&1349916362595635286>

queen veldt
#

I've just tried it

dusk phoenix
#

Hello!
Can someone tell me what this model "Hazel-gen-4"?

#

and how to select it in direct chat in the image generator? i found it only in the battle mode

heady pine
#

gotta follow the rules πŸ˜ƒ

zealous sparrow
#

if this beats gemini 3 pro the bench is rigged

#

also new battle model that has a search output [searcharena only]

#

first battle model for searcharena in a while

fossil fable
#

burn openai

zealous sparrow
twilit delta
#

what fix of this

zealous sparrow
#

Can't

#

this is to prevent people from spamming to get models they want on battle

neon idol
echo aurora
echo aurora
echo aurora
# neon idol Hru?

Doing well. We had our company holiday party last night so a little tired, but overall doing well.

#

Yourself?

zealous sparrow
neon idol
#

Nothing special (⁠≧⁠▽⁠≦⁠)

echo aurora
neon idol
#

For nothing

torn mantle
#

but its in the metadata

#

so its probably coming tomorrow

whole sundial
#

in fact, if you use that model right now it still identifies itself as menlo by big sur ai

zealous sparrow
#

why would it be on searcharea

#

arena if its grok 4

#

I meant battle.

whole sundial
#

...because they want to evaluate their new search model before release? like any other model on lmarena?

zealous sparrow
whole sundial
zealous sparrow
#

Will Menlo always mean grok?

whole sundial
#

it's an lmarena-only designation anyways

zealous sparrow
#

Alright then

#

they removed a raptor model from textarena

hushed gyro
#

Uhh chat did chatgpt release adult mode?

proud bobcat
#

5.1 is just a finetune of 5

#

theres no way 5.2 will be eons better magically

#

if 5.2 is a flop i think its safe to say openai will take a backseat in the market

zealous sparrow
#

than be google and make people wait

#

google took months to make a model and people were happy

#

openAI took a few days to make one and people were unhappy

sterile tartan
#

@whole sundial is this good?

cloud zinc
hushed gyro
cloud zinc
hushed gyro
cloud zinc
#

its gonna be behind ID

zealous sparrow
cloud zinc
#

good, we dont need little kids using it

zealous sparrow
#

OpenAI takes the risky route for operation, gemini 3 pro releases, Oh no! Panic panic, we need a model that beats it so we don't get forgotten.
Instead of, oh congrats, now we take time to work on a model to beat it.

meager harbor
#

Does anyone else feel that LM Arena doesn't punish lying enough?
People are more likely to vote for an AI that makes stuff up than for one that admits it doesn’t know!
There should be an option to justify your vote by saying the other model lied, so LM Arena can add a β€œtruth” category where the highest score goes to the model that lies the least.

proud bobcat
#

that would be cool

#

you could prob add it to feedback

proud bobcat
#

id give you a thumbs up

proud bobcat
#

openai were the PIONEERS

cloud zinc
#

openai needs those datacenter up asap

proud bobcat
#

datacenters wont fix their ass architecture

meager harbor
proud bobcat
#

gpt 5's architecture is plain ol ASS

#

ineffecient

#

bulky

#

too dense

cloud zinc
#

how u know

echo aurora
proud bobcat
#

4o was perfection

#

sure it wasnt a benchmark king, but it was a reliable model

#

4o consistently hallucinated less for me than 5

#

5 came out and despite them saying that it hallucinated like 50% less or whatever

formal dagger
proud bobcat
#

for me it hallucinated around 80% of the time

#

and that is not an exaggeration

zealous sparrow
#

he legit said it

proud bobcat
#

openai had such a good oppurtunity with 5 and they blew it

#

now deepseek is basically taking over their spot as the reliable workhorse

zealous sparrow
#

it wont beat anything

#

i would argue grok beats it

proud bobcat
#

its not even the frontend

#

the model just sucks

#

they need to do a good rerouting of their entire company

zealous sparrow
#

frontend still sucked

#

but

zealous sparrow
#

also btw

#

me and nickname figured out why google put 4 models into battle

#

2 are flash-lite [one thinks, one doesn't]
2 are flash [one thinks, one doesn't]

torn spear
#

Is there anyone here looking for developer?

torn mantle
#

πŸ˜–

zealous sparrow
hot pebble
torn mantle
zealous sparrow
#

wait

#

2.5 flash tts is being uh

#

removed

#

so people think its today

rain drum
#

Can anyone tell me how to fix this

torn mantle
#

when do they usually release their products

zealous sparrow
#

when did gem 3 pro release

torn mantle
#

no i mean time

#

is it still possible to be released today or nah

zealous sparrow
#

wait let me uh

#

compare with g3 pro release

#

it was a tuesday @torn mantle

#

Wouldn't get your hopes up

fleet lintel
frosty lava
#

Why do cursor have access to new model before anything else ?

zealous sparrow
fleet lintel
#

tweet is confusing between tts and flash model

zealous sparrow
#

because integrated-info and holo-scope might be flash-lite models

proud bobcat
#

gemini 3 flash is gonna be so good

zealous sparrow
#

yeah but the thing is

#

tts models are seperated from the model arent they

fleet lintel
queen veldt
#

Why are people hyped about the flash model

#

????

#

It's a flash model??? Nothing special

fleet lintel
zealous sparrow
#

guys did skyhawk and seahawk get removed from codearena or are they just rare

#

i haven't gotten it for a few gens now

fleet lintel
#

very good chance (in fact willing to wager) that gemini 3 flash is going to be better than 2.5 pro

queen veldt
#

Or just the flash-lite one

#

Doesn't have thinking

fleet lintel
#

Lmarena needs to remove these useless Nova models. Total waste of tokens and energy. 😠

fiery vault
#

i want to generate ai videos

compact flame
#

Any news on chatgpt 5.2?

#

Cuz I think it was supposed to be released yesterday

sweet topaz
#

πŸ‘‹

zenith scarab
#

πŸ‘‹

upper locust
#

πŸ‘‹

pale reef
#

πŸ‘‹

gleaming roost
#

πŸ‘‹

sterile tartan
#

Let me break this chain

#

And make it rain

#

Gemini 3 flash will be SOTA 2

half pelican
#

πŸ‘‹

frosty lava
#

or this time it will be something else

sterile tartan
#

My prediction says

#

Google is Cooking

#

Like they cooked Gemini 3

neat apex
#

Ok i will stop now

hollow ivy
#

-# still 3 weeks of 45Β² left..

neon idol
#

<@&1349916362595635286>

frosty lava
#

honestly they should all focus on hallucinations

neat apex
#

Overreacth final boss

neon idol
#

@echo aurora Come immediately here

compact flame
#

Why did the chat just suddenly revive

sterile tartan
neat apex
#

Someone did a chain, but we stopped naturally

torn herald
#

πŸ‘‹

neat apex
#

Nahh, lets stop

compact flame
neat apex
#

We are giving slop to the new vicuna model they must be training

compact flame
#

So guys any news on chatgpt 5.2?

echo aurora
frosty lava
#

I don't think we actually need very capable ai if they hallucinate like that i want an ai that WON'T hallucinate cause its time wasting and risky

neon idol
#

Mass ban? 1YouSureDog

neat apex
echo aurora
neat apex
#

Since fine turning does that

sterile tartan
#

πŸ’€

#

What's going on

echo aurora
#

But yeah I don't want to go on a mass ban for people that πŸ‘‹

frosty lava
fleet lintel
#

i recommend 24 hours ban

sterile tartan
neon idol
weary galleon
neat apex
sterile tartan
#

#Pen + Apple =

sterile tartan
compact flame
#

Well maybe

sterile tartan
sterile tartan
compact flame
#

It had Gemini watermark

sterile tartan
neat apex
#

Yeah, i edged too much here, stopping now

sterile tartan
#

πŸ€”

echo aurora
weary galleon
compact flame
zealous sparrow
sterile tartan
#

Maybe is alright

#

Idk tbh

neon idol
sterile tartan
#

πŸ’€

weary galleon
neat apex
#

I already know pineapple, he is also joking a little too (most times at least)

#

I think Garlic is Gpt 5.5 not Gpt 5.2

echo aurora
# neon idol Fine

I'll start removing folks that do πŸ‘‹ multiple times + if it's their only contribution to the server

neat apex
#

Since they need do a lot to suprass Gemini 3 pro confortably

slim bobcat
#

Does anyone know what is holo-scope?

echo aurora
#

But for folks that are just πŸ‘‹ they can stay

sterile tartan
weary galleon
sterile tartan
#

And it will be renamed as Closedai

neat apex
#

Garlic Pasta Temperoni

compact flame
sterile tartan
#

A ai can't stink

#

πŸ’€

weary galleon
#

Garlic has wonderful smell

compact flame