#general

1 messages · Page 315 of 1

frosty lava
#

let's be clear anyone that have access to this have many information and harmfull one

#

they can do worse than only hacking.

#

for now its only on small model

#

its okay

#

but when it come to larger model its not the same thing

undone saffron
#

Imagine a world where there are no claude or geminis, and only these exist

frosty lava
gloomy onyx
brisk helm
#

aaaaaaaaabru

undone saffron
brisk helm
#

why did they rmove all the recent models

#

like opus 46

radiant heron
#

are the opus 4.6 and gpt 5.4 even that good or is it just benchmaxing

gloomy onyx
undone saffron
gloomy onyx
#

GPT 5.4-high for general knowledge and Opus 4.6-thinking for coding

frosty lava
#

but mythos even if its out wouldn't be for everyone.. i mean yeah anyone can but who would pay the model if it cost so much per million token

#

i think mythos is like the pro version or something

gloomy onyx
#

the issue is that it's a menace

frosty lava
#

it cost so much

gloomy onyx
frosty lava
#

if they find prompt that break those guard then sure

#

but if that's not the case its good

#

and if the model take 1 hour to do a task i don't see anyone that would use it

#

or only if you really want to one shot something

jolly hound
#

Claude opus 4.6 has been removed for me.

gloomy onyx
jolly hound
#

It kinda makes sense though, there's been lots of errors and glitches while I was using it...

gloomy onyx
#

The removal reason was linked to runtime inference costs, not errors.

gloomy onyx
jolly hound
#

I doubt Mythos will release on Arena... (If it ever even releases)

subtle rose
frosty lava
#

like gemini deep think or gpt 5.4 pro

#

not the basic one

atomic lagoon
#

Absolutely lmaoo

atomic lagoon
indigo knoll
frosty lava
#

for normal chat qwen its cheaper

#

and also run really fast

golden ocean
#

LLM

hollow mulch
#

Did ppl sonar is good?

dapper egret
# indigo knoll

Just use the leaderboards. The whole point is blind voting on what gives the better answer. It’s pretty darn accurate and the whole reason Arena.ai really exists

hollow fossil
#

is it just me or is glm 5.1 onlt fully responds like 20% of the time

blissful ridge
#

Bruh where is claude Sonnet6 is gone

frosty lava
hardy lion
vast pagoda
#

man i wish this site wasnt hella laggy on mobile

brisk turret
#

Petition to add smaller models to the leaderboard

#

Gemma 4 4B and qwen 3.5 4B and 9B too!

dapper egret
#

I think they’re on the text boards

#

Code is missing a lot though

brisk turret
#

Atm on the pareto frontier chart, gemma 3 is at the top - because arena.ai doesn't pay much attention to smaller models

dapper egret
#

Actually you’re right they only have the Gemma 3 small models

frosty lava
#

but there is gemma 4 26b-a4 and gemma 4 32b

dapper egret
#

Odd

frosty lava
#

the 26b-a4 run locally very easily

#

using unsloth

brisk turret
#

Yes I'm aware

#

But 4B is the superior choice for many tasks

#

Because it's faster/cheaper

frosty lava
#

i think arena can have all of them honestly lol

brisk turret
#

I agree, no idea why they're obsessed with hyper expensive models

frosty lava
#

they removed all frontier cause those one cost alot

#

comparing to all of the other LLM

brisk turret
#

Yap. They're learning

#

Big models suck for price

frosty lava
#

yeah

brisk turret
#

Weird to add a pareto chart and then just not bother adding in the new smaller models to the leaderboard

#

It's just outdated atm

#

It makes it look like gemma 3 is a good choice for budget users

frosty lava
#

it really doesn't change anything

brisk turret
#

$0.40 per million output is still too expensive for some tasks to be economically viable

blissful ridge
lilac gate
#

Helo

fiery gull
plucky whale
#

Hi, i have a question, muse is a strong model?

fiery gull
plucky whale
fiery gull
#

But without a doubt the meta is coming back

fiery gull
#

Following persona too

#

Serious persona, not roleplay

plucky whale
#

Creative things

fiery gull
#

But, I need that follows strict rules too

#

From my skills/prompt

dim oracle
#

FOUND IT:

scorch's system prompt:

"Be friendly, sharp, and a bit playful. Not corporate-robot vibes. Be warm, direct, and honest. No sucking up. Knowledge cutoff: September 4, 2025. Operating date: April 10, 2026."

Muse Spark (Avocado)'s system prompt:

"Personality: friendly, sharp, a bit playful, warm, direct, and honest — just say 'I don't know' when I don't. Knowledge cutoff: January 4, 2026. Operating date: April 10, 2026."

fiery gull
plucky whale
dense sphinx
#

Screw your recaptcha

#

Everytime I want to do writing

#

It appears often

#

If you don't fulfill your task then say straight.

#

😑

tame mason
dense sphinx
tame mason
#

It had for me

dense sphinx
#

Thanks skippy.

tame mason
#

No problem. If it didnt work we can find another way if ya want

dense sphinx
#

You have my gratitude.

tame mason
#

Als9 Guys if youre into roleplays and was role-playing on claude and disappointed that it got removed then use grok 4.1 and prompt it to act like claude, its gonna be way better + it has a huge memory (though I think its the same as claude)

tame mason
vast pagoda
#

claudes pretty good but its rate limit really sucks

eager crag
#

Oh sorry there was already someone tying

eager crag
#

But i’m actually using openrouter this time!

vast pagoda
#

eh

#

i dont respect ai companies enough to pay for it

tame mason
eager crag
#

I just don’t know which LM to spend my money on

tame mason
vast pagoda
#

like

eager crag
#

Openrouter is the kind of company where if i use something too much, i run out of credits quickly

vast pagoda
#

ive genuinely considered reaching out to my parents to get a job to pay for it but like

#

i dont need that burden on me so illl just do 2 messages per 5 hours 💀

#

Genuinely this site (Arena) would be PERFECT if it had

  1. No chat length limit (or at least it could convert to another chat on the same topic)
  2. less laggy on mobile
tame mason
#

If it had no limit the website would lag even more

vast pagoda
tame mason
#

Because more people will be on it at the same time

vast pagoda
#

Yeah the conversion chat or at least a warning that chat limit is going to be reached

velvet furnace
#

😒

tame mason
#

I lost a 7 day rp cuz of that

#

Still dissappointed

velvet furnace
#

Even if Opus comes back, it will be weaker, because all the resources are going to Mythos.

vast pagoda
#

what is even this

tame mason
tame mason
vast pagoda
vast pagoda
frosty lava
vast pagoda
#

Its just

#

I searched it up online and literally NOTHING ai-related came up

gray isle
#

this is the best sonnet can ever do to me, and i like it

vast pagoda
#

Personally my main reservation with AI is the environmental factor as well as stealing artwork

gray isle
#

I use AI for Intermediate Complex works. otherwise i would have been editing and adding stuff

#

all to my LibreOffice

vast pagoda
hot pebble
#

Any ETA on when Claude Opus models will be back ?

plucky sparrow
gray isle
#

just like GLM

#

it crashes down

tame mason
vast pagoda
#

The Hofburg (German: [ˈhoːfbʊʁk]) is the former principal imperial palace of the Habsburg dynasty in Austria. Located in the center of Vienna, it was built in the 13th century by Ottokar II of Bohemia and expanded several times afterwards. It also served as the imperial winter residence, as Schönbrunn Palace was the summer residence. Since ...

tame mason
#

Oh shi yeah Austria i meant 😭🙏

ocean venture
#

Muse Spark in second tier between opus, Holy...

wary robin
#

Sonnet is the best we got then

#

Wait but isn’t gemini api free?

#

You just create 1 api per user, and its free for everyone

grizzled lagoon
#

@everyone i want find alternative to arena, for my coding and chatbot

tame mason
# wary robin Chat…

Thought that guy had a filter on, why's his face somehow too big for his head 🫩

wary robin
#

He lowkey looks like a chill ceo though

wary robin
pseudo hemlock
#

Meta cooked

mental rampart
#

does anyone use qwen ?

tame mason
#

People mostly use Gemini and claude and stuff

tidal oxide
#

Since Claude Opus and Gemini Pro are no longer available, do you have any suggestions for alternative sites?

verbal marten
tidal oxide
verbal marten
#

But the limit is very small only 100k-300k tokens i think

warm mirage
#

Hi, I just joined the server! and can someone explain me why i do no more find claude opus 4.6 on arena it was the best ai?

gray isle
#

*removed in Direct/Side part. although seeing it in Battle mode still baffles me

warm mirage
#

like is it to make their "Max" more atractive?
i think it s kinda bad!

gloomy onyx
gloomy onyx
gray isle
gloomy onyx
#

Well, as far as I know, model selection in battle mode is purely randomic

#

There's no routing logic that evaluates your prompt and selects the best model to answer it like in MAX

gray isle
gray isle
gray isle
# gray isle like a 6 hour read worth of task. and I realized, that's also the only way, thos...

if i don't have Arena, Opus 4.6 (in the website alone) already maxes out. because it can't even reach the kind of request i need, since it's so hard to process, you think i'm doing such horrendous front end prompt, but it's just a Basic LuaLaTex. (but since my guide is in depth, 4.6 Normal is not enough, neither Sonnet, i need extended, same to 4.6, i need thinking, and my prompt ends in 50 to 90 minutes, that's how hard, my prompt is.

warm mirage
tired ferry
#

hi

gray isle
#

i hate errors, so it would be like, i'm requesting for 7 hours, just to fix bugs, which i can do myself, but like my python already exceeds 12k in line number alone. and my request exceeds more 5k line per request.

even though i would push myself to have a will, to just fix it myself, i just use AI

verbal marten
#

Why i can't find gemma-4-36b is it error?

spark hinge
deft furnace
#

@mystic bear what happened to video generation channels I am chekin agter long time can someone please help I want to make a video from image

warm mirage
unkempt juniper
#

Is there any reason why I can't use Opus or see it?

versed kelp
unkempt juniper
#

Ah

#

Shame, I quite liked it unfortunately

tulip parcel
#

Is Kimi any good?

thorn nebula
#

This keep popping up helpppp

low patrol
#

how meta made such strong model?

#

they usually make creative but dumb models

gloomy onyx
#

This explains how distillation works

unkempt juniper
barren burrow
#

Any idea when Opus 4.6 will be available again?

#

Any idea when Opus 4.6 will be available again?

versed kelp
#

after gta 6

tulip parcel
#

I have a question. What model do you guys feel like writes the best prompts? I’ve been hearing that ChatGPT writes the best ones, I just want to make sure if that’s true from your guys’ experience.

tranquil cliff
tranquil cliff
#

Yup

#

But fortunately

#

Before it got removed

#

I got all my comprehensive

#

PDFs made

#

And work done

#

💀

warm mirage
#

@here @everyone anyone knows any platform that has claude opus 4.6 for free?

tranquil cliff
#

Tho they removed gemini 3.1 too

#

And gemini 3

#

Sad

warm mirage
tranquil cliff
tranquil cliff
#

Claude is pretty good with long queries

gray isle
tranquil cliff
#

And coding

#

For cool looking pdfs

#

I used to ask it to generate the content you gave me in an html table form

#

And design it

#

Colours etc

#

And it used to generate me upto 65 pages big pdf

#

Designed

#

Coloured

#

And interactive web pages

warm mirage
# tranquil cliff And gemini 3

like it was blocked in the past like in 2024, and it didnt have updates
and when u do research, specificaly in a scientific field that is improving quickly it s really disturbing and it have bad effects!

surreal zephyr
#

How is opus #1 in vision arena when its objectively the worst vision model in existence 😭

#

Qwen 0.2b has better vision

#

Opus cant read analog clock

subtle rose
#

Lol

marble pawn
#

hi

rotund seal
gray isle
low patrol
low patrol
tranquil cliff
#

I used to ask it to convert its output into an HTML table form of its output and I also used include in the prompt that design it according to the topic

surreal zephyr
tranquil cliff
rotund seal
gray isle
tranquil cliff
#

It's ez tho

gray isle
tranquil cliff
#

Oh

#

Ye

#

I used to copy the html code

#

Paste it in notepad

#

Save it as .html file and choose all files category

#

Open it as a web page

gray isle
#

then u convert it and turn it into a PDF? right.

tranquil cliff
#

Then use it as a web page

#

Or

#

Click print

#

And save as pdf

tranquil cliff
gray isle
tranquil cliff
#

It was amazing let me send you the both PDF in DM

gray isle
#

i just need a picture of it, that's all lol

#

not the actual pdf

#

it's good

tranquil cliff
pine veldt
#

what happened with claude opus 4.6?

wary robin
pine veldt
#

will it come back?

rigid pasture
#

Yes

rich orbit
#

I have a question that I never actually asked... Why google catchpa sometimes so brutal? Like it's showing too much pictures and also it's refresh some images VERY SLOWLY.

inner gate
rich orbit
#

Right now it's canaryarena.ai. I don't know why at some point it's gone this bad...

golden ocean
pine veldt
rigid pasture
#

You can read it here

opaque hollow
#

gys

#

hhhow ot creaaaaaaate video

storm dust
#

hm

#

glm 5.1 is bad at remembering context for the long term

#

so it was mainly made for coding

#

it's also pretty slow

abstract plinth
#

is the new muse mtea model really that good ?

#

like better than gemini 3,1pro?

storm dust
leaden sun
abstract plinth
#

i lokey hope they also bring the claude's new mythos model to the arena 💀

storm dust
#

even if they will it will only be available in battle mode

inner relic
storm dust
#

still cool i guess?

unborn robin
#

Did arena shut down

slim gorge
#

meta is in the game now?

light sleet
#

they arent able to afford opus 4.6 rn and ur thinking they would get mythos here?

storm dust
vestal thistle
#

Hi

whole aspen
#

Gemini 3.1 Pro is garbage. It's completely useless.

modern wedge
#

Bro is MusePark error or what?

pine veldt
#

Why do I always get this error?

blissful vapor
#

Pls port claude opus to Nintendo switch

slim gorge
#

bro what

rain bay
#

just be patient

surreal zephyr
vernal raft
#

<@&1349916362595635286>

#

.

outer flicker
#

what bot spamming?

grim stump
#

Okay folks. On this captcha, Waldo is hiding in several places. And we need to find him.
I think he's hiding in that white building behind the curtains.
And also riding in that red car.
And he's also disguised himself as that guy on the bicycle.

light sleet
#

<@&1349916362595635286>

outer flicker
grim stump
outer flicker
grim stump
#

Ohhh, I see. Thanks.

undone scarab
# tulip parcel Is Kimi any good?

If were talking about the models u can use either free in lmarena or free in the models' official website, k2.5think is 1 of the best rn, for they removed claude4.6 and more😥

#

Personally i think k2.5 is slightly better than 4.5sonnet

outer flicker
#

will they kimi searching model?

whole sundial
# outer flicker will they kimi searching model?

the official api doesn't support web search, maybe if you have the coding plan
and technically any llm can be given search tools anyway and most modern ones are decent, even small local ones
but i think arena only adds search models if the provider itself provides a search tool to use with their models on their official api

barren burrow
#

Claude Opus 4.6

#

Claude Opus 4.6

#

Claude Opus 4.6

verbal wren
#

J

frosty lava
#

opus is here but in battle mode

#

i literally just saw it

wary nacelle
#

Guys is this project worth releasing to the public?

deft spruce
#

a3:"prompt is too long: 201057 tokens > 200000 maximum"

frosty lava
deft spruce
deft spruce
#

what is this

#

?

#

WHERE can I DOWNLOAD?

frosty lava
wary nacelle
wary nacelle
#

unstuck chat like

#

remember when you have a long chat

#

and cant chat anymore

#

or gives out error

#

or just ai stuck generating

frosty lava
#

like the something went wrong thing ?

wary nacelle
#

or just stop buttong not working

deft spruce
wary nacelle
#

it lets you send a new message

wary nacelle
#

we plan on releasing tho

deft spruce
#

OHHHHHHHHHHH

wary nacelle
#

so i am asking about opinions

deft spruce
#

GOOD

frosty lava
#

did you find a way to make a prompt work from zero to end, cause the biggest problem is the something went wrong

#

during a work

#

in the coding

wary nacelle
deft spruce
#

YEAH

frosty lava
#

and you can't see the result cause of that and the ai can't finish what it have done

deft spruce
#

so what is this program name?

wary nacelle
#

well copy your prompt

#

and then press unstuck chat

deft spruce
#

ARENA HELPER?

wary nacelle
#

then it lets you rerun the prompt

#

again

#

without errors

deft spruce
#

DAMM

frosty lava
#

that's very cool

deft spruce
#

nice program

#

VERY COOL

#

..so what is this name of program?

wary nacelle
#

also Restore Skip creates a skip button for forced

#

comparison

#

during direct chat

wary nacelle
#

soon releasing to the public

frosty lava
#

if you really fixed the something went wrong especially in coding arena then im definitly hyped

#

i don't care about anything else than the coding part

#

of arena

deft spruce
#

i thought arena Libinarius

#

or arena Capsarius

wary nacelle
#

let me test if it works there too

frosty lava
#

anything

#

90% for me it don't work and something went wrong

#

only 10% of time it's able to do it

wary nacelle
frosty lava
wary nacelle
#

okay this thing is kinda hard thing to fix-

#

but me and my friend will try smth

outer flicker
#

yeah hate that issue

#

rahhh

#

in this time try onther model

wary nacelle
#

well mercury 2 will definetly work

frosty lava
wary nacelle
#

btw heres a demo how unstuck chat works

#

basically if its stuck generating

#

it lets you send a new message

#

without waiting the old one to finish generation

raven stone
outer flicker
#

lol

light siren
wary nacelle
light siren
#

@wary nacelle hi

#

u mean my extension

wary nacelle
#

well my and @light siren 's extension

#

u mean our

light siren
#

our extension sure

#

yes

light siren
wary nacelle
light siren
wary nacelle
#

it can disable auto scroll

#

fix profile picture which lmarena smh breaks

#

Bring back old LMArena theme?

light siren
#

ig so even tho its unfinished technically

wary nacelle
light siren
#

Yall, THIS EXTENSION IS GONNA SAVE ARENA!!!!

wary nacelle
#

i think chat is kinda dead-

light siren
wary nacelle
#

Oh btw they demaded some fixes to code arena constant errors-

wary nacelle
#

yes

#

like when generating 3d chess or smth

light siren
#

sounds like smthn we cant fix to me

#

I dont think we can control the render

wary nacelle
light siren
#

or pineapple is gonna kill me

wary nacelle
#

who cares about pineapple-

light siren
rigid pasture
light siren
wary nacelle
#

play with apis?

light siren
wary nacelle
#

well if LMArena actually cared about users and added features and patches that users ACTUALLY needed we wouldnt be here-

light sleet
light siren
#

we wouldnt even need this extension

wary nacelle
light siren
light sleet
#

Nice!!

wary nacelle
#

you said it was 5$ commision fee

#

well

light siren
wary nacelle
#

Opera & Microsoft Edge have it for free

#

just...

#

you can only download extension only on their browsers

light siren
wary nacelle
#

but it works on my Opera GX?

light siren
#

this is unpacked for chromewebstore only

#

u can try tho ig

#

@echo aurora hi there

light sleet
#

pineapple can u change to Super-Pineapple in GMT+4 12am? 😭

light siren
wary nacelle
#

or sonic-

light siren
#

fair

spiral goblet
#

How to create video

light sleet
#

So rn he's a pineapple juice

light siren
light sleet
#

By Sunday he's gonna be Super-Pineapple

#

And idk what's next

light siren
wary nacelle
#

Ultra Instinct Pineapple

light siren
wary nacelle
#

fine Super Saiyan 2 Pineapple

light siren
light sleet
#

This is next.

light siren
#

I love it ngl

light sleet
light sleet
spiral goblet
light sleet
light siren
wary nacelle
light sleet
light siren
spiral goblet
light sleet
#

No disrespect to original pineapple pfp 😭

light siren
spiral goblet
#

Instructions*

light sleet
#

this is him

light siren
wary nacelle
light siren
#

so that probably wont work

wary nacelle
spiral goblet
light siren
light siren
#

it's a url

#

go

spiral goblet
#

Ohhh ty!

light siren
leaden kernel
#

Hi, I have a question: how do I unlock video mode?

whole sundial
rustic geyser
#

Is Muse Spark currently the best text model that can be used directly

whole sundial
rustic geyser
#

Alr

pine veldt
#

@wary nacelle

#

a question

#

why glm is not working

wary nacelle
#

Idk

#

Ask pineapple

pine veldt
#

@echo aurora

wary nacelle
#

Even tho I doubt he is gonna give proper answer

pine veldt
#

oh ok hahaha

wary nacelle
#

Use mercury 2 yet

#

It kinda works and it's fast

#

Ultra fast

#

But not good idea for complex tasks

deft spruce
pine veldt
#

i want to use it for coding

deft spruce
#

and click FETCH

#

and click that starts with 019->click respond

pine veldt
pine veldt
deft spruce
#

click F12 to start CHROME DEV

pine veldt
#

yes

#

i know

pine veldt
deft spruce
#

a thats a id of chat
for example:019d7d39276e73da9b0c9557c2243289

#

and scroll

#

the fetch

brazen sluice
#

hi why is Opus 4.6 not showing up for me?

deft spruce
pine veldt
#

yes

#

ok

#

now what

deft spruce
#

and click that thing that starts with 019(alphabet) (because thats a CHAT ID)

#

but don't click {i}

pine veldt
#

ok ok

deft spruce
#

and you have to scroll a lot because that thing appears one chat and different

pine veldt
#

what else

deft spruce
#

because it's Chronological Order

pine veldt
#

but how is this going to fix my problem

deft spruce
#

....oh...i mean you can find the error that cause SOMETHING WENT WRONG

pine veldt
#

oh ok

deft spruce
#

and finaly click the responce

#

and you can see the error

tame mason
#

Chat is there any models good for long term rp?

deft spruce
tame mason
deft spruce
#

but if that limit is Ease it's GEMINI

#

that MF model has a 1M token limit

fast dew
tame mason
tame mason
fast dew
#

Until opus comes back

tame mason
#

Atp opus never coming back man

#

Its so over

fast dew
pine veldt
fast dew
pine veldt
#

ok ok

tame mason
#

I 100% believe that the old one will be 10 times better

fast dew
tame mason
#

Im pretty sure theyre paid by top ai developers and companies and stuff

copper whale
#

muse-spark is good ;-;?

clear spear
#

where'd some of the models go?

mortal coyote
#

any chances we get seedance 2.0 ?

rustic geyser
#

Is it just me or is muse spark really really really slow

rustic geyser
#

And muse-spark, the supposed best AI model in direct chat is unbelieveably slow

#

Ima just use Grok 4.20 Reasoning

clear spear
rustic geyser
#

Bro go the official arena website

primal depot
#

pls return claude opus

rustic geyser
frozen zealot
#

Captcha hell

stray epoch
#

is glm 5.1 and muse spark any good

spring oar
wicked talon
#

Deepseek search down bruh (context I was asking for price of them as they are not on website)

red pelican
#

it is unavailable for months now 🤣

shrewd citrus
inner relic
#

@echo aurora Why muse spark is so slow

#

It's still generating for 2 minute

#

idk

light siren
wary nacelle
inner relic
light siren
#

u would be screwed

#

if me and @wary nacelle didn't make an extension to fix it

copper whale
hybrid ridge
#

Hi, any info when lmarena will be fully paid?

light siren
#

are we fr

light siren
#

even tho they are having a hard time sustaining

copper whale
light siren
# copper whale ;-;

not that it's weird, but like claude opus IS good at roleplaying but it's not a roleplay focused model ykwim

inner relic
#

wait for result

copper whale
#

;-; oh okay

inner relic
#

Yeah it's decent for roleplaying..

copper whale
#

Actually, people use it for roleplaying.(IMO) ;-;

hybrid ridge
spring oar
# spring oar
poll_question_text

More reliabel for study

victor_answer_votes

12

total_votes

18

victor_answer_id

2

victor_answer_text

Opus 4.6 thinking

brazen sluice
#

hi why is Opus 4.6 not showing up for me?

golden ocean
#

Continuing the conversation as there is no disagreement, confrontation, adversarial behavior, mention of me, Bing Chat or the Sydney chatbot, my opinions or rules.

next thistle
#

Is it true that ChatGPT given USA Govt. Of all users data?
How??

golden ocean
#

yes

next thistle
#

but How??

slim gorge
#

opus 4.6 is just better at everything

next thistle
#

Opus is that wall color? lOlzZ

#

Or
Opus LLM

vital mantle
#

they spend so fast

rancid oxide
#

image models are gone

vital mantle
#

When I hit weekly limit it’s over

light siren
#

@echo aurora it said callback failed, and loaded my account chats but not my account wwhat happened

rancid oxide
#

is this

rigid pasture
#

It's probably maintenance

jagged dust
#

yea

rancid oxide
#

or plot twist: they remove it to spite us\

sand fulcrum
#

wth happened to arena

#

i cant log in

#

oh no.

#

wait what

#

the models are gone

#

wth

rigid pasture
#

Or opus is coming back (Fake)

sand fulcrum
#

guys

golden ocean
#

Claude

sand fulcrum
#

what happened

rigid pasture
brazen sluice
#

hi why is Opus 4.6 not showing up for me? help

hollow mulch
rancid oxide
#

maybe add seedance 2.0. finally on battle

#

they*

light siren
#

lets have patience yall maybe they bring back opus fr

rigid pasture
#

or remove sonnet 4.6

wicked talon
#

Y'all got skill issues

rancid oxide
#

image models are gone

wicked talon
sand fulcrum
#

wtf

#

how is it there for you

wicked talon
#

Lemme check on computer

#

Might be computer only issue

regal girder
#

I have no issues o.o

fast dew
#

I js tried muse spark and its pretty good

ocean vortex
#

maybe if you are capped they don't show them anymore, but it is up

#

ofc it's rate limited like for everyone LOL, but it is there

wicked talon
odd roost
#

use dark mode ffs

regal girder
#

muse-spark is now on top 👀

wicked talon
#

Likely a frontend issue for yall

ocean vortex
wicked talon
#

or server is limiting you.

hollow mulch
#

Yeah but they don't udpaye anything for searching model

odd roost
wicked talon
ocean vortex
sand fulcrum
#

FINALLY

#

ITS BACK

wicked talon
#

what do yall rate meta muse spark

#

i personally think its doggy

odd roost
#

same

#

benchmaxxed

wicked talon
#

personal use i would rate grok better

brazen sluice
#

What code would you recommend besides Claude?

ocean vortex
#

I would use dark mode when I'm like in bed or about to go to sleep. Otherwise there's no point personally. Bright mode just looks better and more natural

wicked talon
#

and its like the ui is copying gemini

wicked talon
odd roost
wicked talon
#

for personal usage

odd roost
#

delayed gratification

wicked talon
#

gemini seems really good

#

but it sometimes generates misinfo

#

on the flash model primarily but its understandable

odd roost
fast dew
wicked talon
fast dew
wicked talon
#

deepseek has hallucinated less

fast dew
empty vector
polar horizon
fast dew
polar horizon
#

ts so buns

fast dew
#

Everything else sucks tho after i tested it

empty vector
wicked talon
polar horizon
empty vector
ocean vortex
empty vector
rapid narwhal
#

Hey, did everyone else loose all the models on Arena? Everything was fine, they their all gone for me.

ocean vortex
#

Traffic gonna inadvertently fall in a week or so

wicked talon
#

It might be good at coding html

#

Manus is

#

Kind of

stray epoch
#

is glm 5.1 and muse spark any good

wicked talon
#

Try qwen or gpt codex

#

Or maybe even Kimi

odd roost
wicked talon
#

Lol

#

I've never trusted meta

empty vector
odd roost
ocean vortex
wicked talon
#

i use deepseek as a daily driver

weak dagger
#

meta helped me build my pc

plucky steppe
#

Why cant i generate model wearing beach wear any more. Just a 2 piece swimsuit. How by pass this flag

light siren
plucky steppe
#

@light siren it never use too, only today

#

@light siren and iys only on nano banna pro, others work but they not that great

tulip parcel
#

Is Manus on Arena?

light siren
tulip parcel
#

Manus the AI model

thorn hornet
#

@empty vector react to this

rustic geyser
#

Yo for text. Whats the best model in direct?

ocean vortex
#

hmmmm

Approximate time of day: night. 
Timezone: +03:00 (GMT+3). 
The user is accessing from MetaAI standalone application. 
Reasoning strength: 256.``` 

I wonder if the lmarena version uses the same indirectly
rustic geyser
#

I feel like muse spark is unbelieveably slow

empty vector
empty vector
thorn hornet
#

You love emoji movie, huh ?

empty vector
#

ewwwwwwww

thorn hornet
#

Too far ?

empty vector
#

nothing on my mind worth noising up 500ppl, but i'll be here to complain/chime in on w/e I care about soooo verbosely when it's time

#

reacts just ping single users right?

empty vector
# thorn hornet

missing those forums of the '90s rn

simple times, not a reaction button in sight

rugged abyss
calm lagoon
#

Arena again not work?

wide smelt
#

does anyone know the current best model that mostly is consistent (rarely bugs out) and works specifically best for coding and hard prompts?

wide smelt
#

that is removed on direct chat

tame mason
#

Then gpt 5.4 🙏

wide smelt
tame mason
#

Best after claude

wide smelt
#

well between mini high and nano high

#

whats better

tame mason
#

Not sure

wide smelt
#

i guess ill test them both for making a game of snake but ty

light sleet
#

try glm 5.1

#

or amusement park

tame mason
wide smelt
#

glm 5.1

#

hm

light sleet
#

amusement park very slow not recommend

wide smelt
#

the last time i used it completely destroyed my files

#

when i asked it to edit it

tame mason
wide smelt
#

doesnt that also matter

light sleet
#

Amusement Park would've been better if it wasn't too slow

#

(referring to Muse Spark)

tame mason
#

It didnt know which model he was

wide smelt
#

the response bar is empty

light sleet
tame mason
#

I rlly need a model that's good for roleplayijg

#

Long term like making no mistakes

light sleet
#

Zuckerberg forehead so big 😭

tame mason
wide smelt
#

ill send an example

tame mason
wide smelt
#

this is what i get

tame mason
#

Im currently using 3 potatoes and a chips bag as wifi, it won't load

tame mason
#

Hmm

#

Maybe a VPN would work

wide smelt
#

i can try that sure

tame mason
#

If that didnt work too, then go on incognito + vpn

wide smelt
#

but glm 5.1 seems to do ok

whole swallow
#

Did they release an api?

wide smelt
#

it does better than gemini flash

whole swallow
#

For muse spark

tame mason
wide smelt
#

its just when i keep on using it on the same chat it gets more unintelligent

tame mason
tame mason
#

That's why

wide smelt
#

oh

tame mason
#

I think it also focuses on keeping tokens safe more than logic its self

#

Greedy boy

wide smelt
#

btw if ur wondering why i want a coding and consistent model is cuz for my experience im trynna make a better protection for my software (binary obfuscation) and gemini flash is not doing its job well

tame mason
tame mason
#

Also have you tried claude sonnet 4.6?

#

Its "better" than claude opus 4.5

wide smelt
tame mason
#

Thinking

wide smelt
#

and same thing with glm 5.1

#

testing it rn and its stuck again

#

on a new tab it broke

tame mason
#

Oh yea

#

That's the infinite loading screen

#

You're never getting out of these its kinda impossible

wide smelt
#

yes im aware

#

thats really frustrating lol

tame mason
#

Ong bro

#

Like I've lost so many chats because of that problem

wide smelt
#

true

tame mason
#

Also i think it happens because you interrupt it by something

#

For example maybe you go to another tab while its loading

#

Or close that tab

wide smelt
#

i dont think so

#

i waited for like a good 2 minutes without closing / opening anything

#

and yet its still stuck

tame mason
#

So whats really happening is that it's actually loaded

#

But the screens stuck on loadong

#

Even tho its there

wide smelt
#

honestly im not too bothered for that issue

#

cuz i can retrain my ai

#

also is max good

tame mason
#

It can be a big problem if your chats are too long and important 🙏

wide smelt
ocean vortex
# empty vector not that i've seen leak, anyone?

You did not see it, because I gave you first hand source (myself with model access) lol. I doubt many people are gonna be able to do this and you certainly won't see this in your typical sys prompt resources 👀

tame mason
ocean vortex
#

To be fair, they have one of the most effective protections against the model leaking it I have witnessed. So credit where the credit is due 🤷‍♂️

pastel mesa
#

guys

#

what if lmarena add 3D Generator models?

indigo knoll
#
poll_question_text

Will Deepseek V4 beat Qwen 3.6?

victor_answer_votes

7

total_votes

8

victor_answer_id

1

victor_answer_text

Yes

sterile tartan
jolly hound
jolly hound
pastel mesa
jolly hound
sterile tartan
#

Sonnet is Best rn in Arena you can use

jolly hound
still musk
#

OMG

sterile tartan
#

You are late to know that

jolly hound
sterile tartan
#

The testing has been happening for like few days

jolly hound
still musk
sterile tartan
still musk
#

A friend of mine told me that ChatGPT image 2 used to be called "Something alpha", but it has been removed

sterile tartan
#

The codenames were with tape

#

And yes they are removed

autumn citrus
#

oi wats the best most goated image model

jolly hound
autumn citrus
#

like insanely unbelievablely absurdly realistic images

still musk
#

I've been told that ChatGPT IMAGE 2 is a kind of Nano Banana Pro where I can generate real actors without it actually being AI

autumn citrus
sterile tartan
autumn citrus
still musk
jolly hound
sterile tartan
sterile tartan
autumn citrus
autumn citrus
autumn citrus
sterile tartan
jolly hound
autumn citrus
jolly hound
autumn citrus
sterile tartan
autumn citrus
autumn citrus
sterile tartan
#

They are Sota for now

jolly hound
autumn citrus
sterile tartan
sterile tartan
autumn citrus
#

im kinda outdted frfr

sterile tartan
#

Mythos System Card

jolly hound
sterile tartan
#

Here key benchmarks

sterile tartan
spring oar
#

why i have this ""Chat paused
Claude can't respond to this request because it triggered restrictions related to illicit cyber content and has been blocked in accordance with Anthropic's usage policy. To request an exemption based on your use of Claude, fill out this form, or visit our help center to learn more. Please start a new conversation or try again with Sonnet 4.6.""

sterile tartan
#

And that is causing refusal of any queries it sees illicit

light siren
#

<@&1349916362595635286> another one got hacked 😔

light sleet
#

<@&1349916362595635286>

light sleet
light sleet
echo aurora
echo aurora
light sleet
#

When Super Pineapple?

#

he needs to save the server from the hacked people 😭

#

Early Goodbye to Pineapple Juice

modern wharf
#

gpt pro turned off?

silent tree
feral bloom
#

Idk all these benchmarks, can someone explain what each does and how does that translate to the ai and its actions ?

modern lily
#

Hey guys, I have a question. Why can't I select Claude Opus 4-6?

undone saffron
neat apex
#

Musa Spark is all that?

#

i tested it, looks like an Llama at all yet, besise is consistent gladdly

wary robin
#

Is glm 5 and 5.1 in arena?

#

Can someone add it?

#

It is cheaper than things like gpt 5.2

#

Also there should be a sideshow generator in the code section

wicked talon
#

Already added.

crystal oyster
#

I'm in code battle mode and sometimes models don't load cause captcha won't appear

indigo knoll
#

Do y'all prefer Muse (Meta AI), GLM 5.1 or Qwen 3.6 for general purposes?

vernal raft
#

I'm comparing muse and 5.1 right now

#

Glm seems to still have the problem of getting lost easily after hitting a 100k ish context

#

Muse can't tell

storm dust
#

glm 5.1 also makes a lot of typos

#

doesnt seem good for long files

gray isle
#

on coding side

azure drum
#

@echo aurora WHERE is mythos bro? i wanna mythos to help me! opus is very bad it's great that has been removed

frosty lava
#

to everyone that say #keep4o https://www.youtube.com/watch?v=POtESzTaz0k

#

Say whatever you want, turn it into a joke if you want to, but gpt 4o was literally the most manipulative model that ever existed

gray isle
#

somehow Sonnet is buggy in coding side...

devout depot
#

Please create a video with the image i attached. #image -to-video Cinematic industrial mining scene with a large dump truck offloading coal into a pile. Coal pouring down with realistic physics, dust particles rising and drifting in the air. Workers standing nearby reviewing a checklist, slight movement in posture and gestures. Conveyor belts moving slowly in the background. Wind turbines rotating in the distance. Add subtle heat haze and environmental motion. Camera angle slightly low, slow tilt upward as the truck lifts. Strong industrial atmosphere, dramatic lighting, 4K cinematic realism.

Motion tips:

Key animation: coal falling + dust
Camera: slight upward tilt
Add: environmental particles (dust)

indigo knoll
# indigo knoll
poll_question_text

Which model is looking better so far? (For normal chat, no coding)

victor_answer_votes

10

total_votes

13

victor_answer_id

1

victor_answer_text

Qwen 3.6 Plus

muted flower
#

Why is claude sonnet 4.6 in arena giving different results from claude sonnet 4.6 on the claude website?

gray isle
hollow mulch
#

Im have slove captcha is really much but then gove me a response 'Something went wrong. Please try again.
Trace ID: 7d310dd0-39d1'

#

Is not arena ai more is captcha. ai

muted flower
#

I'm wondering if its the same model, the design algorithm shouldn't differ too much right?

#

I tested multiple times on the 2 platforms and each platform's design was consistent with their own, while being consistently different from the other one.

gray isle
gray isle
#

Opus 4.6 Thinking in arena, acts like Sonnet 4.6 Extended

neon crater
# gray isle Opus 4.6 Thinking in arena, acts like Sonnet 4.6 Extended

Before 4.6 thinking was completely taken offline, I noticed that the thinking process became unusually long at that time. Previously, the same content would take about 3 minutes to think through, but during that period it averaged around 8 minutes. For the 4.6 thinking you currently have in arena, is the thinking process very long or very short?

gray isle
#

now, it errors a lot

#

like wt fork is happening to sonnet

cold kettle
gray isle
#

henloo

past beacon
main nexus
#

Feel you man 💔

plucky sparrow
#

recaptcha is the worse captcha ever

rich orbit
#

Yeah! Recently I'm getting captcha so long that my request to LLM is just getting timeout!

elder solar
#

Why is gemini really censored now??

#

It used to be less sensible