#general

1 messages · Page 311 of 1

native flame
#

I remember it used to be kinda that way before the rebranding

plucky sparrow
#

but why would they do that if the answer is actually good?

atomic lagoon
honest verge
#

You don't lose them because they disappear when you open this model

red chasm
#

And how much longer do we have to wait for at least GPT 5.4 to return?

storm dust
#

ban

plucky sparrow
icy ice
#

is gpt-5.4 better than sonnet-4.6?

atomic lagoon
#

Oh well yea but then it moves to another model that could be better

red chasm
plucky sparrow
#

or worse

atomic lagoon
#

People will still abuse it just to abuse it

plucky sparrow
#

so you have an incentive to vote for the better answer. there's no easy way to abuse it

icy ice
#

when I do a side by side/battle its usually both are good. there's very few cases where one is clearly better. i dont know if it actually helps arena if you do that

plucky sparrow
#

sure, you can say 'both suck' and keep cycling through hoping to get a better model, but it may never com e and youi're wasting a lot of time

icy ice
#

i try to evaluate the answer and pick one is better but there are many times I dont know enough to say that

plucky sparrow
#

and you won 't know if you cycled away opus 4.6 thinking, until you say both suck, but then you already lost it

icy ice
#

yeah i dont do the cycling with both are bad. it seems like a waste of their compute resources

plucky sparrow
icy ice
#

this is already a free service thats hurting and losing money

plucky sparrow
#

different tasks

echo aurora
plucky sparrow
#

research questions, generate a webpage, generate some code, what does this error code mean, etc.

icy ice
#

also a suggestion for people - grok isnt that bad at a lot of research/code tasks, i find it produces concise answers that work

echo aurora
red chasm
icy ice
#

people dont use grok but it has much higher usage limits, either here or on grok.com

plucky sparrow
#

i liked grok 4.0/4.1 better

#

4.2 seems nerfed for search

icy ice
#

it has a very different way of thinking and sees less over engineered

echo aurora
plucky sparrow
red chasm
#

Well, we'll wait until they tell us at least approximately about the timing.

icy ice
plucky sparrow
#

what topic? what language apps ?

#

and how simple are these quick apps?

icy ice
#

app - web/python. eg i am working on a catalog app in which i can store stuff i own, files on disk etc

plucky sparrow
#

and it's one-shotting with success? interesting

icy ice
#

im not a vibecoder so i know the tech involved, so i can ask it detailed prompts

plucky sparrow
#

which grok model are you using?

icy ice
#

im bad at ux and design, so i will probably end up using some of the coding tools. arena is just for quick design research

plucky sparrow
#

in that case, that explains it, python is probably the easiest model for LLMs but it still makes mistakes in architecture, but since you're asking it very specific architecture and using python, basically any model with sufficient training data will work for you

icy ice
#

i also want to use ai to ask about my taxes but i dont think its a good idea because of the privacy issues, but local llm arent possible for me to run

echo arch
#

The problem is, by the time you fix the problems, people will already be gone...

#

Please include at least 2/3 important models, not that you remove them all and leave the worst ones to the Users...

icy ice
plucky sparrow
#

design?

#

also which grok 4.2 are you using?

echo aurora
icy ice
#

i mean design spec/architecture. eg with using python as language and sql backend

#

i am using grok-4.2, either thinking or normal

honest verge
#

Imagine using ai when a Socrates comes up to you and asks "If Ai is your power what are you without it?"

plucky sparrow
#

i mean the same can be said about most modern living

#

if your job is social media, what are you without the internet?

glacial swan
#

Hi everyone, I’d really like to hear your thoughts on this. I’ve done quite a deep analysis myself, and I have a lot to say and ask.

Given the recent changes and restrictions on the Arena, I’ve had to switch to other services over the past few days because using the Arena has become almost unbearable due to all the limits and cuts.

However, I’ve noticed something interesting. When I use the same Nano Banana 2K on other platforms, even on the original and most official Gemini, the image quality and accuracy are noticeably worse compared to the Arena.

This makes me wonder: is it possible that models on LM Arena are somehow enhanced or fine-tuned to perform better? Because I genuinely don’t understand how the same model, used in the same way across three different sites, produces consistently poor results everywhere else, but significantly better results on the Arena.

So my question is: do LM Arenas apply any additional modifications or optimizations to the models? And if so, why does this difference in quality happen?

plucky sparrow
#

the gemini app harness is sort of terrible

#

you'd be likely better off using one of the third parties that offer it

#

cause they have to use it direct api, most likely

glacial swan
# plucky sparrow you'd be likely better off using one of the third parties that offer it

But which ones exactly? I'm ready to pay, as long as the result is identical to what I get on LM Arena. So far, I haven't found a single service that matches it. Take something as simple as text within an image: on LM Arena, it comes out without distortions or defects, but on other sites, everything looks 'dumb' and 'plasticky' somehow, full of artifacts and 'AI-slop'

glacial swan
#

Flow
Google AI Studio
Poe
Telegram bots

plucky sparrow
#

poe i'm a bit suprised at, flow is basically google harness

calm lagoon
glacial swan
#

Yes , But why does it work without any distortions there [on LM Arena]? To be honest, I haven't tried the paid version in AI Studio yet, but I have a feeling the result will be exactly the same

edgy magnet
#

yo um wheres the channel where i can turn my photo into a video clip im not seeing it can anyone help me

gentle crag
#

I feel strange among so many people who speak English.

echo aurora
astral blaze
#

wow what happened to the direct models

#

Might as well just cut the whole mode

gentle crag
honest verge
#

Imagine using ai when a Socrates comes up to you and asks "If Ai is your power what are you without it?"

astral blaze
#

At least image look fine

desert pendant
#

dude

digital eagle
#

Oh my god, there's already a reference to Socrates here.

#

He is everywhere.

desert pendant
#

i think that arena isn't gonna last until 2027 rn

plucky sparrow
#

depends what youi mean by last

gentle crag
desert pendant
digital eagle
desert pendant
#

they could've keep Gemini atleast

#

wait

digital eagle
desert pendant
#

why don't they remove the worst models?

#

???

plucky sparrow
#

i'd im agine it won't have half as many users unless they bring back some of the top models or better ones

desert pendant
plucky sparrow
#

but, i don't know about dead

desert pendant
#

dude

#

there is no reason to keep paying for those bad models

plucky sparrow
#

and it helps them with their stats on their leaderboard

digital eagle
#

I don't think Grok is as good as the ones that were removed.

desert pendant
digital eagle
#

But that's what we have.

plucky sparrow
#

they can be like "we've tested over 500+ models! and we host 100+ models on our site!"

echo aurora
digital eagle
#

What are the chances of me getting the opus in the battle? 😥

gentle crag
#

I was making up a story with Claude Opus in direct mode, well, one is an understatement, 300 minimum XD attention deficit

plucky sparrow
digital eagle
desert pendant
#

and we are back to 2023 - 2024

echo aurora
rough heath
#

Direct mode has removed a lot of top-tier models.

desert pendant
#

how arena feels right now

storm dust
digital eagle
icy ice
#

guys remember this is a free service that gives you access to a ton of premium models. there is really no reason to complain about removal of models. try to complain to openai or anthropic about how they keep reducing limits

storm dust
#

how pineapple feels right now

#

🧃🍍

digital eagle
#

I hope Opus returns before stage 2 sbr

plucky sparrow
#

what's stage 2 sbr?

pseudo hemlock
#

can max route to opus?

digital eagle
#

Steel Ball Run dude

storm dust
pseudo hemlock
#

bruh

gentle crag
digital eagle
gentle crag
indigo knoll
#
poll_question_text

Best AI/platform overall for free users rn?

victor_answer_votes

12

total_votes

13

victor_answer_id

2

victor_answer_text

Gemini

storm dust
#

definitely gemini

honest verge
digital eagle
storm dust
#

hell nah

digital eagle
#

Hell yeah

gentle crag
#

I can't use Gemini XD, damn country!

storm dust
digital eagle
#

What place do you live in that doesn't have Gemini available? Excuse my language.

honest verge
digital eagle
#

Okay...

gentle crag
#

XD

honest verge
digital eagle
#

:⁠-⁠|

digital eagle
storm dust
#

i call my dad to ban you

digital eagle
#

Oh, sorry man.

gentle crag
#

I need a VPN for almost everything, even Lmarena needs a VPN.

storm dust
#

you hurt my feeling

honest verge
# storm dust ban

Anthropic should make Max ultra subscription 1000$ WITH 2X ACCESS TO MYTHOS

storm dust
#

you over

digital eagle
#

My apologies.

storm dust
honest verge
storm dust
#

over

celest kettle
#

hello guys .. where can i chat a support team ?

gentle crag
honest verge
#

He will help you

digital eagle
agile wharf
#

why is arena removing models like cluade 4.6 and gemini

storm dust
#

you can also ping the moderator role for support but only if it is important

digital eagle
agile wharf
#

yh they removed cluade opus 4.6

digital eagle
storm dust
#

i got my own clouds

celest kettle
# storm dust <#1466486650170245435>

i tried , My chat hit the max length and I can’t continue it. It’s important for my project 😅

Any way to keep using the same chat or fix this? delete the first half of it or something ?

storm dust
#

fare im not mod but i can answer that

#

no you cant

digital eagle
storm dust
#

at least yet if i am correct

honest verge
#

Is this new anthropic model

#

Cluade 4.6

echo aurora
honest verge
storm dust
#

pineapple fare needs your help

honest verge
#

Do you need any help?

digital eagle
#

Calm down, Leha.

storm dust
storm dust
feral kernel
#

this might be the worse day of Arena 💔

storm dust
#

i like being near with my beloved clouds

desert pendant
digital eagle
desert pendant
#

this is kinda funny

storm dust
#

yes thats exactly where they are

storm dust
#

on here

digital eagle
desert pendant
#

wait THERE IS

#

A V4 FOR DEEP SEEK?

#

IM TESTING THIS THING OUT

quartz light
#

new arena model!! ernie-image

desert pendant
digital eagle
#

Still without an opus...

storm dust
quartz light
#
fast dew
desert pendant
quartz light
#

the minute i said it did

fast dew
#

O

digital eagle
#

Huh

storm dust
#

ai week 🥀

honest verge
honest verge
#

DDDDDDDDDDD

desert pendant
honest verge
#

Deepseek v4?

#

Is it finally here?

digital eagle
#

Ernie

storm dust
honest verge
#

Oh

desert pendant
#

deep seek v4 early acess

digital eagle
#

Ernie...

quartz light
#

ernie-image is in direct chat

#

so test it

honest verge
agile wharf
#

are there alternative to arena ai

honest verge
#

When you can use Gemini flash image 2.0

celest kettle
#

sorry guys its my first time using it .. so there is no way to continue in the same chat ? should i give up ?

digital eagle
quartz light
agile wharf
#

are there

quartz light
#

it just released

agile wharf
#

are there alternative to arena ai

honest verge
echo aurora
pseudo hemlock
#

hi

dense sphinx
#

Opus again gone 😕

storm dust
#

it was better if i didnt talk about clouds

agile wharf
#

are there alternative to arena ai

quartz light
#

i asked ernie-image for a rubiks cube with a mirror reflecting it kek

pseudo hemlock
storm dust
#

what a nonsense i made up lol

desert pendant
#

ok so

#

guys

pseudo hemlock
#

whats up kenny

gentle crag
digital eagle
#

What????

desert pendant
#

i will test deep seek v4 right here

pseudo hemlock
quartz light
honest verge
agile wharf
#

are there alternative to arena ai

pseudo hemlock
desert pendant
digital eagle
agile wharf
storm dust
quartz light
#

I prompted "roblox" not really roblox but looks cool i guess lol

pseudo hemlock
#

are the top 5

#

i think

agile wharf
#

that are free

pseudo hemlock
#

well

desert pendant
#

im gonna use Roblox cuz it's the easiet option

pseudo hemlock
#

they're all sorta free

#

nothing is 100% free

desert pendant
#

-# too lazy to install godot rn

digital eagle
pseudo hemlock
agile wharf
digital eagle
#

Oh...

#

I don't know either...

agile wharf
#

bruh

digital eagle
#

😥

quartz light
desert pendant
#

wish me luck

pseudo hemlock
echo aurora
pseudo hemlock
desert pendant
quartz light
desert pendant
#

-# his text is normal too

#

ernie doesn't look so boring

quartz light
pseudo hemlock
quartz light
#

prompt was just "donald trump"

digital eagle
#

LoL

desert pendant
#

with glasses

quartz light
pseudo hemlock
#

ask it to make this

digital eagle
#

All that was missing was for them to put a demonic horn on it, since it's from China XD

pseudo hemlock
#

dont ask where i found this

desert pendant
#

WHY THEY DID TURN PINEAPPLE INTO JUICE

#

NO

quartz light
pseudo hemlock
#

iykyk

quartz light
quartz light
desert pendant
#

not a whole background

#

holy

pseudo hemlock
#

has anyone tried realistic image edits?

echo aurora
pseudo hemlock
#

i want to edit a pic to make it look good for my linkedin lol

#

wondering which model to use

desert pendant
#

try this new model nick is showing to us and gemini

pseudo hemlock
#

gemini > openai's image edit?

desert pendant
#

yeah

pseudo hemlock
#

i will try

desert pendant
#

(for what i used definitly)

pseudo hemlock
#

thank u frend

desert pendant
gentle crag
#

I usually use image AI more for editing than creating images; I'm lazy.

fast dew
#

Idk this model kind of sucks bro

pseudo hemlock
fast dew
#

The ernie one

pseudo hemlock
#

i mean its their first image model

fast dew
desert pendant
pseudo hemlock
#

ill do better

quartz light
#

@echo aurora i tried crine

desert pendant
#

in a room

#

with nothing

#

and i will do nothing

#
  • not me
pseudo hemlock
#

pineapple

quartz light
#

is this ernie image?

pseudo hemlock
#

gemini and openai

desert pendant
pseudo hemlock
#

ok

quartz light
#

ernie image only allows text input so i had to specify every detail

#

clear glass filled with yellow liquid, floating green pineapple stem emerging from top of glass. lime slide on the rim, yellow and white striped straw sticking out. two black paper circle eyes and a pink 90 degree rotated 'D'-shaped paper smile decal on the front. cartoony 3d render style.

pseudo hemlock
#

ernie keeps giving me errors

quartz light
#

same

#

but i just retry

#

ernie-image!!!

pseudo hemlock
#

W

#

thats good

desert pendant
#

peak

#

guys which fruit do yall like?

somber pagoda
#

we should have a shareable link for chat

pseudo hemlock
somber pagoda
#

^

desert pendant
pseudo hemlock
#

i dont like

desert pendant
quartz light
pseudo hemlock
#

NOOOOOO

#

thats so sad

desert pendant
#

damn gpt cooked

#

peak edit

quartz light
desert pendant
#

let me test ernie here

#

oh ernie can't edit

quartz light
#

yea 😔

desert pendant
#

for a first model it's actually good

pseudo hemlock
#

why am i getting "Which is better" with max and it gives me deepseek v3.2 and 2.5-flash

#

😭

#

im using max for a reason bro

desert pendant
#

🥹

pseudo hemlock
#

pineapple

#

fix pls good sir

desert pendant
#

ok gemini actually cooked too?

pseudo hemlock
#

why B&W

desert pendant
pseudo hemlock
#

o

quartz light
pseudo hemlock
#

same

#

i tried like 3 times and didnt work once

quartz light
#

yall ernie image is gone

#

@pseudo hemlock @desert pendant

#

RIP ernie image

desert pendant
#

THEY TOOK MY "why they don't remove the worst models" TOO LITERAL

#

NO

quartz light
#

nvm its back

desert pendant
#

oh

#

jeez

quartz light
#

🎉 lololol

desert pendant
#

wait

#

they added a new option to deep?

#

THEY READ WHAT I SAID?

#

NO WAY

#

🥹

radiant hollow
#

Anyone experiencing this also?

quartz light
pseudo hemlock
#

nanogpt, arena, claude, chatgpt

quartz light
#

i asked ernie image for gta 6 gameplay footage

pseudo hemlock
#

DAAAAMN

#

ernie cooking

radiant hollow
pseudo hemlock
desert pendant
pseudo hemlock
#

supposedly expert is their new model

#

MAYBE v4

quartz light
#

not 4.5

desert pendant
#

-# expected not true tho

quartz light
pseudo hemlock
#

hi

radiant hollow
barren burrow
#

Nns

desert pendant
#

caps lock mb

#

deep couldn't even do something like that in just 2 prompt

#

🥹 kinda liked the result

indigo knoll
desert pendant
#

(original website)

viral notch
#

hard to believe that deepseek is totally free. been using it and i dont see any paid options

#

is it being ran for competition reasons or what?

storm dust
#

free?

viral notch
storm dust
#

interesting

pallid crypt
storm dust
#

hm but arena's leaderboard shows that it does waste money

radiant hollow
#

I just started a new chat and usage limit has been exceeded after a prompt now I'm being asked to start a new chat, wtf is going on with arena

echo aurora
radiant hollow
echo aurora
#

Unclear what you mean by "before" though

faint epoch
#

finally realized making almost every model available to be accessed by everyone with almost no guards is unsustainable

echo aurora
faint epoch
#

think they used to be substantially higher though

radiant hollow
floral canyon
#

i think most of us suspected it's not gonna last forever once the site grows

radiant hollow
echo aurora
floral canyon
#

but this site is like 3 years old

radiant hollow
#

Anyhow, it's free to use.

floral canyon
#

I' m happy gemini 2.5 pro is still here

fathom basin
#

arena does not seem to be able to create AI videos that is vertical. anyone knows how to solve this? or any other software/programs that can do this?

radiant hollow
#

But I'm frustrated with everything

radiant hollow
#

Lol, maybe worse

#

I really have projects to work on and I'm just sitting here seeing different problems

echo aurora
floral canyon
fathom basin
radiant hollow
radiant hollow
echo aurora
echo aurora
fathom basin
sterile dust
#

What is april26-chatbot2?

#

Is it Claude Opus 5 Pro/Ultra or Claude Mythos Mini?

#

It outputs very slow, maybe it's a huge model

whole sundial
#

i believe it's nvidia, probably the 400B Nemotron 3 Ultra model

median ember
#

I want to buy "claude-opus-4-5-20251101-thinking-32k" from somewhere. The responses I received while using it on arena.ai were excellent. Is there an API service I can use similarly? I'm thinking of buying it with cryptocurrency. I couldn't find "claude-opus-4-5-20251101-thinking-32k" on OpenRouter. OpenRouter has "Opus 4.5" but no Thinking mode. Can you recommend a place where I can buy it?

desert pendant
#

you can use open router or original claude

median ember
# desert pendant you can use open router or original claude

I was using "claude-opus-4-5-20251101-thinking-32k" in arena.ai. Would Claude Opus 4.5 or Claude Opus 4.6 in OpenRouter give similar output?
Have you ever bought anything from OpenRouter?
I really like the web interface on "arena.ai". Is there a place where I can use the OpenRouter API in a similar way? Does the API have a feature to remember previous messages?

whole sundial
whole sundial
#

openrouter has its own chat interface that you can use your credits in

median ember
stray aspen
#

guys is this the downfall of lmarena

#

all the good stuff is gone

whole sundial
whole sundial
median ember
#

Opus 4.6 on arena.ai was legendary and great.
Opus 4.6 on antigravity is terrible.

Could you please check if there's a reasoning feature for Opus 4.5 in OpenRouter?

median ember
# whole sundial it should, i don't see any reason why it wouldn't

I considered buying it from Claude's own website, but people have left very negative reviews. Even for $25, they only allow you to ask 3 questions and get answers within 5 hours. They say it's bad.

I think what they're offering isn't the API, but the Pro membership.
I think the API is better than the membership.

#

I don't know much about it either, I'm new and don't understand much about these things.
Thank you all for giving me the opportunity to speak on this platform.

plucky sparrow
novel geode
#

yeah anthropic doesnt have a good reputation on rate limits xd

whole sundial
median ember
ocean venture
#

Well This issue is getting very complicated, if the Arena team makes a wrong move in this situation, I don't know what will happen next... I hope the Arena development team will soon find a mutually beneficial solution with users that use Arena.ai xd thanks to @echo aurora and other's developer for the hard work onto Arena.ai

median ember
median ember
whole sundial
#

idk if this counts much, but it's the closest arena ever had to its own ai model

gloomy onyx
whole sundial
#

also the "demo" link i think just takes you to arena now

whole sundial
#

it's based on llama 2, but it's a finetune

gloomy onyx
#

it's not a 100% fully custom model

whole sundial
gloomy onyx
whole sundial
median ember
stiff goblet
#

best AI for editing and writing draft assignments?

marsh horizon
stiff goblet
#

any specific models

marsh horizon
#

cant use opus or 3.1 anymore so just use claude sonnet 4-6 or gemini 3 flash

gloomy onyx
#

I'm not rich, I can't afford to buy an H100

#

maybe I could rent it, but still..

thick spade
#

is there a paid version of this platform?

thick spade
#

damn

ocean venture
carmine shard
mild spade
carmine shard
heady whale
pseudo hemlock
#

Hi

gray isle
#

doing hard prompts again and again 😭

sullen creek
#

/image to vide o

ivory schooner
#

oh,claude opus models
gpt 5.4, gpt-5.4-high
gemini-3.1-pro-preview
oh,claude opus models
gpt 5.4, gpt-5.4-high
gemini-3.1-pro-preview,
I'll wait for you to come back!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!
!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!

hollow mulch
#

The heck why im got empty response?

barren burrow
#

claude opus

#

claude opus

#

claude opus

plain gazelle
#

it's kinda glitchy recently

vernal meadow
hearty spire
#

最近是不是把gemini 3 pro和opu 4.6都移除了

civic plaza
#

Find a way to make them more sustainable

echo aurora
short sluice
#

i wish that there would be an eta though because it just seems like it wont be available forever without one.

i still expect it to be here again by the second half of the year

echo aurora
short sluice
#

even if it doesnt come this month i still feel like if it was to be announced it would be announced here and probably next month or june

echo aurora
#

Overall we’re going to avoid giving expected dates unless we’re very confident in that date. Shifting priorities, unexpected variables, etc just make it too difficult to say at this point

short sluice
#

true. the credit system though needs some adjustments definitely, but with it i think frontier models will come back faster. again people were saying that ads could help in monetization of this website

echo aurora
gray isle
#

whoever is this Assistant B. it's good

#

somehow the model just didn't appear, but that's fine to me 😊

echo aurora
echo aurora
gray isle
echo aurora
#

Is that all of it?

#

Ty ty

gray isle
#

it's just one request for my favourite prompts of all time (which is asking for LaTex) or creating PolSci Scenarios (What If)

stone forum
#

@echo aurora when is the arena gonna have an ai music arena session?

dreamy lynx
#

are the models back or are they gone for good?

velvet furnace
#

It is said that the GLM-5.1 AI model has surpassed Sonnet 4.6 — is that true? Has anyone used it? Please share your thoughts.

gray isle
#

but not in Direct or neither Side by Side

short sluice
#

basically every chinese llm is benchmaxxed

#

no exceptions

fossil lark
#

uh oh

#

I've been out for days

#

back in arena and found claude opus 4.6 and gemini 3 are both gone

#

but they are still on the leaderboards

#

is there an document arena update, so they removed them temporarily

#

or just permanently

dreamy lynx
fossil lark
stone forum
gray isle
#

like its treatment of LaTex is non existent. it just blank

empty sky
#

Whar did he say

#

You say slur?

#

Oh very ofansive

austere kiln
deft spruce
#

..what the i used max 1 time but the
{"error":"Chosen Model(s) are no longer available"}
WHAT THE HECK???

keen moth
#

Wheres is opus and opus thinkking and gpt 5.4 with files?

olive spruce
keen moth
#

why?

patent bane
gray isle
balmy shadow
#

Hi, I’m experiencing an issue with a long conversation on Google Gemini. Today, when I send a message, it keeps loading for more than 30 minutes without any response. I’ve tried refreshing the page, logging out and back in, using a VPN, and even switching to another device, but the problem still persists. Could you please check and help resolve this issue? Thank you.

hollow mulch
#

Yeah abd fix rate limit and the empty response in searching model

hollow mulch
#

Is make me so mad 🙂

viral notch
#

we dont even have gemini 3 pro

gray isle
placid wyvern
balmy shadow
viral notch
hollow mulch
balmy shadow
#

ok np thnx

supple scaffold
viral notch
supple scaffold
#

I mean, Google removing it from AI Studio and Google disabling the API are different things after all

odd geyser
onyx shore
#

any got imagine v2 acces in arena?

stone forum
#

CAN SOMEONE RECOMMEND ME AN FREE AI WEB THAT HAS A COMMON SENSE, GOOD MEMORIES AND CREATIVE WRITING AND RESEARCH 😭😭😭😭

whole sundial
opaque maple
#

GPT 5.4 in Codex is very good!

#

but it talks too robotic

halcyon ingot
opaque maple
ocean venture
# opaque maple

What was that wbsite or interface? are you use local? base on that photo?

halcyon ingot
gray isle
#

whoever is that Assistant B. i would love to know it and say thank you

scenic echo
#

reklam

narrow copper
#

Guys where are the opus gemini gpt 5.4 models? Why they got deleted

narrow copper
hollow mulch
gray isle
hollow mulch
narrow copper
dreamy lynx
#

they were too expensive, so they took em away

#

you can use gemini 3 flash if you want

gray isle
dreamy lynx
#

i use llms for making up stories for me, that are enterntaining, so flash works well for me

bleak lake
#

well it's not fault of arena, these models are expensive to provide. Moreover the purpose of arena was to test these models, and provide accurate benchmarks without any bias.

But people started abusing it by creating new accounts and using same model in direct chat over and over again

#

it was eventually gonna happen

gray isle
#

there's seven ais (reduced to two right now) who can do my request (a very long hard prompts that i do a lot)

hollow mulch
#

Well

thorn hornet
light sleet
#

gemini-4.5-flash is great tho

#

use 4.5 flash

bleak lake
# thorn hornet Can flash translate in perfect Albanian for an Srt file I don't think so

honestly none of the current models can perfectly translate a language to another, imo, it's very close tho, if we remove the gaurdrails it's gonna get almost perfect. There's a reason why experts say gaurdrails are holding a model back.

If the language you are translating consists some swearing words or words that might get flagged by gaurdrails then Ai is not good at translating them hence resulting in a not perfect translation.

I'd advice if you know the material you are translating contains some words that might get flagged then its better train an agent yourself or eh grok works better than most models at this sadly

light sleet
bleak lake
light sleet
#

4.5

#

4.5 flash

thorn hornet
light sleet
#

pineapple got 4.5 million dollars by Gemini 4.5 cuz he is now a pineapple juice

#

3 days left till pineapple juice goes

light sleet
#

Claude mythos Thinking

#

This one

#

or Claude Pineapple Ultra could work too.

silk dew
#

is that real

light sleet
#

In parallel universe, yes.

silk dew
#

wow!

echo dome
thorny schooner
#

I am actually starting to get so annoyed why is verification happening every time I tried to do a retry😭

zenith steppe
echo dome
deft spruce
desert pendant
subtle rose
echo dome
desert pendant
desert pendant
#

in arena website then edit the text

#

and doneee

deft spruce
#

wait....arena can edit with CHROME DEV TOOLS?

desert pendant
echo dome
deft spruce
desert pendant
#

website stuff

#

blah blah blah

desert pendant
#

💔

zenith steppe
#

wait what is F12 ?

desert pendant
#

developer tools

zenith steppe
#

i know about f1

#

🏎️

desert pendant
#

i can do like

echo dome
grave hatch
#

How to unlimited video create

desert pendant
subtle rose
desert pendant
#

f12 changes aren't that hard tho, here is an example

#

-# he didn't said this ANY time

echo dome
#

recaptcha is sinful

deft spruce
desert pendant
#

gang

#

they just added some new feature to deep seek

ivory latch
#

i haev a doutbt where i can see archieve chats

desert pendant
echo dome
#

pressed f12 to see sources

ivory latch
#

and one more thing is claude opaus is removed ?

desert pendant
#

yes

echo dome
#

wait what does tools uses for ai models (lmarena)

solid scaffold
honest verge
#

What...

#

1970?

gusty zephyr
#

Just say you're broke and follow openrouter 50rpd.

desert pendant
honest verge
echo dome
#

here's recaptcha one:

they collecting:
IP addresses - tracks your location
Mouse movements and clicks - behavioral profiling
Device information - OS, browser type, screen resolution
Cookies - persistent tracking across sessions
Time spent on pages
Browser fingerprints - screen size, resolution, language, plugins, JavaScript objects
Keystroke dynamics - timing and patterns of keyboard input
Screenshots of browser windows - Google literally photographs your browser
All data transferred to U.S. servers

and reCAPTCHA Violates GDPR

echo dome
honest verge
#

I don't see anything bad in it

echo dome
#

they got fined how is this not bad?

honest verge
echo dome
honest verge
#

Except cucumber on the table

#

It watches me

#

Straight in my head

echo dome
gray isle
echo dome
gray isle
#

RuNet and China ROM for example. does the same stuff like you said

gray isle
echo dome
honest verge
#

It's just a contact with the government

echo dome
honest verge
echo dome
honest verge
#

You can contact russian government with it

gray isle
#

my colleagues doesn't even know Yandex, like i'm probably the only one in my class (that is dumb for the most part (Mathematics/Physics) that knows random stuff like that

echo dome
honest verge
#

It's basically everywhere

gray isle
echo dome
gray isle
#

not PH the site, the country

echo dome
gray isle
#

treating itself like a douchebag

honest verge
gray isle
#

if S.D wins, well that's good for the dumb ones

echo dome
#

when teams picked google login where recaptcha: problems

elfin sail
#

It's me or Flux 2 never actually work XD

gray isle
gray isle
elfin sail
#

GLM was pretty "decent" for the text to wait for claude to come back

gray isle
#

if somebody says that it's better than sonnet 4.6, then think about it, why it can't PRODUCE a good LaTex

elfin sail
#

I wasn't able to resist and took a month subscription on Anthropic

modern flame
#

Arena might lose their large community if they don't bring top models back(at least Gemini 3.1 pro or gpt 5.4)

sly cedar
#

Nah, id learn code than burning my brain with ai enforced code 💀

gray isle
elfin sail
#

it's to create stories

#

so it's okay for me :3

gray isle
echo dome
#

id prefer not to learn code than burning my brain with learning code for years
like i already want to be composer

elfin sail
elfin sail
#

I mean, I'm paying and I can't use it as much as I was using it on Lmarena when it was there

gray isle
#

oh hell nah, not my stool erupting again

desert pendant
#

time to continue my test with deep seek

gray isle
#

if only Gemini 3.1 pro preview have File and Video File Upload in Arena, i would have been transcribing for a long time

#

sadly it's only file upload

#

for it's existence

#

except for AIstudio

echo dome
#

when arena turns into arena

desert pendant
#

of you want an good AI for coding

rigid copper
#

i asked ai to suggest an idea to keep arena.ai sustainable, lol

#

not sure is it a good idea, or a dumb idea

elfin sail
#

some good ideas though

#

adds/donations

rigid copper
#

it was provided by claude :)

lavish ibex
#

are there any good alternatives to arena? where opus 4.6 is available with the same limits as here 😖

lavish ibex
#

yupp is stopping

desert pendant
obsidian cargo
#

probably because they were offering free opus 4.6 lmao

lavish ibex
#

still wondering how they can offering all the (top) models for free

elfin sail
#

I guess

rigid copper
amber marten
#

Yupp is already stopped.

golden ocean
#

how does lmaerena benefit from direct chat

lavish ibex
#

they may contact me too haha

golden ocean
#

are people suppose to use the 👍 and 👎 buttons for each message

#

does anyone even do that

desert pendant
fossil lark
#

I did

desert pendant
#

i do

#

for help them

fossil lark
#

ppl also provide the original data to train

desert pendant
#

max can't handle this more bro

#

💔

lavish ibex
#

Whats the reason opus is gone?

golden ocean
#

is jt gone again

fossil lark
#

opus is toooo expensive

rigid copper
lavish ibex
#

get it

rigid copper
#

they have to cope the cost

fast dew
#

Yh thats why they doing a credits system

mild spade
mild spade
lavish ibex
#

it was a monster model

golden ocean
#

it returned for like a day

#

a few days ago

fast dew
golden ocean
#

i thought

fast dew
#

In like the future

golden ocean
#

now it gone for me again

mild spade
desert pendant
#

im using deep seek rn

mild spade
#

im so scared,,,

desert pendant
#

he is saving the day

mild spade
#

deepseek what

#

official site?

desert pendant
#

ye

mild spade
#

cool

abstract hinge
#

I felt like Cooper from Interstellar watching you guys argue yesterday about AI, and I couldn’t say anything because I got a 24-hour punishment for speaking Portuguese.

rigid copper
#

then we need to work together to figure it out what to do to ease it

lavish ibex
#

deepseek is also a monster

desert pendant
#

they added this new option and im using it

rigid copper
#

my own advice is to impose stricter limit, stopping bots.

mild spade
#

What option

desert pendant
golden ocean
#

can't u just intercept lmarena api request and replace the modelAId with claude opus's id?

mild spade
golden ocean
#

if anyone saved claude opus model id on lmarena

desert pendant
lavish ibex
#

free

desert pendant
#

💔

mild spade
#

what it good at

desert pendant
#

wait

lavish ibex
#

I asked deepseek to splits a CSS 4000+ lines file into 10 seperate css files for each function, did it perfectly

rigid copper
#

that's why arena.ai art and coding contest/event is gone, because they are dealing with budget

desert pendant
fast dew
#

🙏

desert pendant
abstract hinge
#

The best way to make this viable is to assign a point cost to every model. These points would be earned by using battle mode.

Instead of just choosing which model is better, users would also need to provide a reason for their choice. That reason could then be verified by a smaller model.

For example, something like a 7B Mistral could check whether the reasoning is actually valid and reflects what the user experienced with the model’s response. This would make the system more reliable and reduce meaningless votes just to farm points.

desert pendant
#

battle should be a bit more tho

abstract hinge
#

The problem the admin mentioned earlier was how to validate the votes. By requiring a reason, and having that reason verified by a model, this issue would already be addressed.

desert pendant
fossil lark
#

i gave him a 120p pdf, 5MB to analyze

outer flicker
#

still obessive with opus hmm guy

fossil lark
#

how much time would it cost with expert mode

#

pretty quick, almost instantly reply

#

well it's done in 1 min

#

this is a thesis i've been reviewed for days

#

compared to my paid claude opus 4.6, deepseek expert is more merciful

abstract hinge
#

Has anyone managed to use the Meta Spark model? I tried it yesterday, but it would start and then crash midway with an error.

hollow ivy
abstract hinge
willow sleet
#

@abstract hinge signed up for yup then they closed

still musk
#

Guys, GPT IMAGE 2.0 will be released tonight after GPT image 1.5

honest verge
willow sleet
#

opus 4.6 understandable, then gpt 5.4 high, and now Gemini 3.1 pro all top models disappearing.. how much message per day is their limit before they disappeared guys?

still musk
willow sleet
#

why models are disappearing guys ..

civic plaza
#

Most likely price. And to find out how to make them work better

gray isle
dark valve
#

when will top model(gpt 5.4, gemini 3.1 and opus 4.6) comeback again?

gray isle
#

although it looks like GPT 5.4 High

#

since 5.4 High gave me that kind before in the past

outer flicker
#

please we need them

bleak lake
#

well it's not fault of arena, these models are expensive to provide. Moreover the purpose of arena was to test these models, and provide accurate benchmarks without any bias.

But people started abusing it by creating new accounts and using same model in direct chat over and over again

#

It was eventually gonna happen

pine raptor
#

Hi

obsidian cargo
#

but yeah gpt image 1.5 is already out

#

and is kinda ass ngl

balmy mist
#

whats the best cheap or free model? i feel like that is the meta now, all paid models are pretty much the same now

mild spade
polar horizon
#

when it comes to being realistic af it's ahh

#

it's so obvious to spot a gpt generated image

obsidian cargo
#

yeah its realism is bad but also when it does digital art style it has this wonky weirdly vibrant art style that I don't like

still musk
#

Many users on X are trying it, just do a search

obsidian cargo
#

thanks for the source :D
I know some people have access to it in chatgpt right now but I don't

#

and I don't use twitter, not even before elon musk bought it, and now I'm even less likely to

eternal thicket
#

im switching to claude bro

#

gemini so ahh

still musk
polar horizon
polar horizon
obsidian cargo
#

can any model do good pixel art yet?

polar horizon
still musk
#

Here are some images leaked on Twitter of GPT IMAGE 2

polar horizon
#

not good enough but really good

polar horizon
still musk
obsidian cargo
still musk
balmy mist
#

i thought we solved image gen, why should we care about a new image model lol?

still musk
obsidian cargo
balmy mist
#

but by a small amount

obsidian cargo
#

hell even gpt image 2 screws up background details a bit

still musk
distant bison
#

How can you generate videos now ?

still musk
gray isle
#

and then i just ask, a very hard prompt, and what's the specific structure of the thing

night moat
#

because detect ip. ykwim

mild spade
vernal zodiac
#

damn why disable multiaccount usage?

mild spade
#

Claude 4.6 models? Aren't there only one (4.6) and then 4.5 of sonnet

vernal zodiac
#

sonnet and opus are 4.6

gray isle
#

if you're gonna do a LaTex reviewer, don't do it under LuaLatex or Xelatex, because it's gonna crash

#

you're gonna do PDFLaTEX

#

if a credit system should be hourly. (in the new usage system feedback eme eme) i would disagree, but let's say 12 hours.

#

or 16 hours lol

eternal thicket
gray isle
devout shale
#

Why don't I have claude-opus-4-6-thinking...do you know other alternatives where there is claude-opus-4-6-thinking?

gray isle
half yew
#

This what I generated with image v2 before it got took off arena, my prompt was simple

grim cliff
#

Hey why cant y'all add a donation system?

obsidian cargo
#

cant believe image v2 recommended dr disrespect

paper island
half yew
#

Chatgpt and it’s only available to select users

#

But they saying it’s coming out today at 1 pm

magic imp
#

What is image V2 is it available at arena??

paper island
#

comeback of the century if true, google's nanobannana has been the best by far for so long

half yew
# magic imp Really??? Share prompt

A twitch screenshot of a semi popular dude, he just got a trickshot in modded cod server
Be super detailed and very specific and very realistic

paper island
#

god damn

obsidian cargo
#

in 2 hours?

half yew
obsidian cargo
#

nice

obsidian cargo
paper island
#

can't wait for it to get lobotomised 2hr after launch

half yew
paper island
#

Yeah but this is openAI we're talking about

#

They always just let it fly with no guardrails for a while to build hype then restrict it so much it's nearly unusable

half yew
#

Basically what they did with sora

half yew
#

Also here’s what I got with nb2

paper island
obsidian cargo
#

also they actually dialed back the copyright restricions on gpt image 1 after a while

paper island
#

good point

grim cliff
#

hi guys

#

is there any AI that has like a coding similarity to Claude Opus?

still musk
half yew
magic imp
still musk
paper island
grim cliff
#

Nah i do like desktop apps and stuff

paper island
half yew
magic imp
#

Bro what you guys think is gpt iv2 will be lauch on arena ??

paper island
grim cliff
paper island
grim cliff
#

I know that qwen 3.6 plus is reallyy gosh darn good

half yew
magic imp
#

Bro 3 pro just did a mistake 😭😭😭

grim cliff
#

beautiful

magic imp
#

Yaah it's beautiful

#

And did you notice the gamer background wall

paper island
obsidian cargo
#

TOABS

magic imp
#

It's having detail

paper island
half yew
#

This image v2 nd I tried to create a aesthetic photo

paper island
magic imp
#

Let me give one more try

paper island
magic imp
#

Personally I love it not bad

paper island
#

yep i think NB has officially been dethroned

echo aurora
#

chefkiss perfection

still musk
#

I have to make a fake girlfriend and make people believe I have one hahahah

half yew