#general | Arena | Page 349

soft river Apr 29, 2026, 7:13 PM

#

But most of the Chinese models

wet steeple Apr 29, 2026, 7:13 PM

#

where could i do it ?

soft river Apr 29, 2026, 7:13 PM

#

Are free to use in their web

raw laurel Apr 29, 2026, 7:13 PM

#

I suppose his use cases are local models only. Don’t know if there’s any NFSW models

soft river Apr 29, 2026, 7:13 PM

#

You can roleplay there

echo aurora Apr 29, 2026, 7:13 PM

#

You're unable to

soft river Apr 29, 2026, 7:13 PM

#

Or use something else

wet steeple Apr 29, 2026, 7:13 PM

#

raw laurel I suppose his use cases are local models only. Don’t know if there’s any NFSW mo...

which ones ?

wet steeple Apr 29, 2026, 7:13 PM

#

soft river Or use something else

which ones ?

soft river Apr 29, 2026, 7:13 PM

#

wet steeple which ones ?

Webs designed for role playing

wet steeple Apr 29, 2026, 7:13 PM

#

soft river Webs designed for role playing

where ?

soft river Apr 29, 2026, 7:13 PM

#

Most of them don’t have filters so you can go search for one

soft river Apr 29, 2026, 7:14 PM

#

wet steeple where ?

Google

#

Idk the names

#

I don’t use them

wet steeple Apr 29, 2026, 7:14 PM

#

soft river Most of them don’t have filters so you can go search for one

which search terms should i use to find on google ?

soft river Apr 29, 2026, 7:14 PM

#

wet steeple which search terms should i use to find on google ?

Hmm

#

I only know character ai

echo aurora Apr 29, 2026, 7:14 PM

#

wet steeple which search terms should i use to find on google ?

Going to ask we don't discuss this here

wet steeple Apr 29, 2026, 7:14 PM

#

soft river I only know character ai

he can generate p**n ?

soft river Apr 29, 2026, 7:14 PM

#

wet steeple which search terms should i use to find on google ?

You can search on YouTube

wet steeple Apr 29, 2026, 7:15 PM

#

soft river You can search on YouTube

which seach terms ?

soft river Apr 29, 2026, 7:15 PM

#

wet steeple which seach terms ?

Whatever you want to do you can search it

#

Im not into that so idk which search terms are the best

raw laurel Apr 29, 2026, 7:16 PM

#

@wet steeple brother just search on google uncensored ai models or use an ai

#

Maybe hugging face have them.

wet steeple Apr 29, 2026, 7:17 PM

#

raw laurel Maybe hugging face have them.

i didn't know that hugging face was saucy

soft river Apr 29, 2026, 7:17 PM

#

@echo aurora Do you know when the benchmarks for Ernie will be published?

#

I’m so curious

wet steeple Apr 29, 2026, 7:17 PM

#

soft river <@283397944160550928> Do you know when the benchmarks for Ernie will be publishe...

what is ernie ?

soft river Apr 29, 2026, 7:18 PM

#

soft river <@283397944160550928> Do you know when the benchmarks for Ernie will be publishe...

I don’t know much since prole haven’t talked about it too much

soft river Apr 29, 2026, 7:18 PM

#

wet steeple what is ernie ?

An ai model

wet steeple Apr 29, 2026, 7:18 PM

#

soft river <@283397944160550928> Do you know when the benchmarks for Ernie will be publishe...

Do you know when the benchmarks for Bert will be published ?

soft river Apr 29, 2026, 7:18 PM

#

It has very strong filters so I don’t recommend you using it for your roleplay

echo aurora Apr 29, 2026, 7:18 PM

#

soft river <@283397944160550928> Do you know when the benchmarks for Ernie will be publishe...

#announcements message already are 🙂

soft river Apr 29, 2026, 7:19 PM

#

echo aurora https://discord.com/channels/1340554757349179412/1343296395620126911/14990831787...

Oh they are? I apologize😂

wet steeple Apr 29, 2026, 7:19 PM

#

wet steeple Do you know when the benchmarks for Bert will be published ?

it will be good to use with ernie 😂😂😂😂

soft river Apr 29, 2026, 7:19 PM

#

wet steeple Do you know when the benchmarks for Bert will be published ?

I’m not sure what bert is

echo aurora Apr 29, 2026, 7:19 PM

#

soft river Oh they are? I apologize😂

Lol no problem. Would notes our leaderboard changelog is pretty helpful tool: https://arena.ai/blog/leaderboard-changelog/

Arena Blog

Leaderboard Changelog

This page documents notable updates to our leaderboard—new models, new arenas, updates to the methodology, and more. Stay tuned!

For model deprecations, check the public updates on GitHub.

April 29, 2026
ernie-5.1-preview has been added to the Text leaderboard.

April 27, 2026
gpt-5.5-high has been added to

light sleet Apr 29, 2026, 7:19 PM

#

@echo aurora

wet steeple Apr 29, 2026, 7:19 PM

#

soft river I’m not sure what bert is

it's a joke because there is two sesame street characters called Bert and Ernie lol

light sleet Apr 29, 2026, 7:19 PM

#

Price starts at 4000 Tuff Coins

soft river Apr 29, 2026, 7:21 PM

#

echo aurora Lol no problem. Would notes our leaderboard changelog is pretty helpful tool: ht...

Now that I see it I haven’t realized how fast the models get updated

wet steeple Apr 29, 2026, 7:21 PM

#

soft river It has very strong filters so I don’t recommend you using it for your roleplay

which one have very strong filters so you don’t recommend you using it for my roleplay ? which ones would you recommend then who might have lower filters ?

soft river Apr 29, 2026, 7:21 PM

#

I always thought it was slow because I got so used to it

wet steeple Apr 29, 2026, 7:22 PM

#

wet steeple which one have very strong filters so you don’t recommend you using it for my ro...

because some of my roleplays won't include adult content

soft river Apr 29, 2026, 7:22 PM

#

wet steeple which one have very strong filters so you don’t recommend you using it for my ro...

Again, I’m not very into this stuff so I don’t know

#

Most of the

#

Recent models

#

Have very strong filters

#

You should do your own research; I think people have run benchmarks for that

wet steeple Apr 29, 2026, 7:23 PM

#

soft river You should do your own research; I think people have run benchmarks for that

the problem is that now the ai world is very big so it's difficult to find the perfect model 🙂

soft river Apr 29, 2026, 7:23 PM

#

That’s true

#

Just

#

Check good models and

#

Try to do your roleplay

#

If they reject it

#

Don’t use it

#

Just that

raw laurel Apr 29, 2026, 7:25 PM

#

wet steeple the problem is that now the ai world is very big so it's difficult to find the p...

There must be hundreds of people discussing this on Reddit or other obscure forums. Just do a simple google search.

wet steeple Apr 29, 2026, 7:25 PM

#

wet steeple the problem is that now the ai world is very big so it's difficult to find the p...

because many of my roleplays won't include adult content, so even with no adult content just normal slice of life roleplays the filters will block the roleplay ? it's not logical

soft river Apr 29, 2026, 7:26 PM

#

wet steeple because many of my roleplays won't include adult content, so even with no adult ...

It is logical

#

If they reject it is for something

#

They can’t talk about

#

Killing or injuring or harming or hacking or anything you would can non ethical

#

That’s why you can’t roleplay about that stuff

#

You should just make your research.

raw laurel Apr 29, 2026, 7:27 PM

#

wet steeple because many of my roleplays won't include adult content, so even with no adult ...

https://huggingface.co/collections/DavidAU/200-roleplay-creative-writing-uncensored-nsfw-models

soft river Apr 29, 2026, 7:27 PM

#

raw laurel There must be hundreds of people discussing this on Reddit or other obscure foru...

This

wet steeple Apr 29, 2026, 7:28 PM

#

soft river Killing or injuring or harming or hacking or anything you would can non ethical

it's not about killing or injuring or harming or hacking or making non ethical things it's just for making daily life roleplay with very tame content

soft river Apr 29, 2026, 7:29 PM

#

wet steeple it's not about killing or injuring or harming or hacking or making non ethical t...

Ai can’t roleplay about real life either

#

I believe

#

I’m not sure just go use another one

rustic gale Apr 29, 2026, 7:34 PM

#

wet steeple it's not about killing or injuring or harming or hacking or making non ethical t...

Yeah, enjoy this being equalized with any mention of a nipple regarding all the model alignment. Except grok. Perhaps. When it comes to proprietary, that is

wet steeple Apr 29, 2026, 7:36 PM

#

rustic gale Yeah, enjoy this being equalized with any mention of a nipple regarding all the ...

there will be no mention of nipple or anything in my roleplays

wet steeple Apr 29, 2026, 7:36 PM

#

soft river Ai can’t roleplay about real life either

you sure ?

soft river Apr 29, 2026, 7:36 PM

#

wet steeple you sure ?

Llms can’t roleplay and or act like other people

wet steeple Apr 29, 2026, 7:37 PM

#

soft river Llms can’t roleplay and or act like other people

it's for a simulation

rustic gale Apr 29, 2026, 7:37 PM

#

wet steeple there will be no mention of nipple or anything in my roleplays

Then you're not doing NSFW, in which case what's your problem, just go ahead and make sure nobody gets into the situation where some models start yelling 'Therapist! Call the doctor! You need help!' and the like

wet steeple Apr 29, 2026, 7:38 PM

#

rustic gale Then you're not doing NSFW, in which case what's your problem, just go ahead and...

which models would be the best for making extremyl realistic simulation of real life or real life roleplays ? chat gpt ? claude ? gemini ? grok ? deepseek ? perplexity ? mistral ?

soft river Apr 29, 2026, 7:38 PM

#

rustic gale Then you're not doing NSFW, in which case what's your problem, just go ahead and...

You're making me really anxious with your unfinished sentence at the end😂

soft river Apr 29, 2026, 7:38 PM

#

wet steeple which models would be the best for making extremyl realistic simulation of real ...

I have a question

wet steeple Apr 29, 2026, 7:38 PM

#

rustic gale Then you're not doing NSFW, in which case what's your problem, just go ahead and...

what do yo mean ?

soft river Apr 29, 2026, 7:39 PM

#

Are you trying to make the llm act as somebody else and/or making him act in a specific way?

#

If so

#

What is that way

#

If they’re rejecting it is because of something

wet steeple Apr 29, 2026, 7:39 PM

#

soft river Are you trying to make the llm act as somebody else and/or making him act in a s...

yes

wet steeple Apr 29, 2026, 7:39 PM

#

soft river If they’re rejecting it is because of something

i just want to know what is the best ai to use before wasting my time with unrealistic roleplays

soft river Apr 29, 2026, 7:40 PM

#

wet steeple i just want to know what is the best ai to use before wasting my time with unrea...

What is that way?

#

How do you want him to act

rustic gale Apr 29, 2026, 7:40 PM

#

wet steeple what do yo mean ?

I mean that if god forbid some of these detect any traces of psych distress, they will disregard the idea that it's fiction and start doing what they've been told to do. Which is get you out of there. Not out of mercy, mind you, but because some shmucks have already lost their lives and the remaining relatives started lawsuits and we can't have lawsuits, lawsuits are bad (in this case they are because they're extremely stupid, but I digress). Otherwise just try them, that's what's the site is for (well, it's not, but since it fears declaring its goals go ahead and use it for your thing)

soft river Apr 29, 2026, 7:42 PM

#

rustic gale I mean that if god forbid some of these detect any traces of psych distress, the...

I think he is trying to use it for something against other people

#

Maybe that’s why the models are rejecting it

soft river Apr 29, 2026, 7:42 PM

#

wet steeple i just want to know what is the best ai to use before wasting my time with unrea...

Is it consented

wet steeple Apr 29, 2026, 7:43 PM

#

soft river Is it consented

yes

soft river Apr 29, 2026, 7:43 PM

#

wet steeple yes

What is the topic

wet steeple Apr 29, 2026, 7:44 PM

#

like i would like to simulate the life of a 25 year old girl in paris in 2008 or of a teen girl in france in 2003, or of a 18 year old girl in the uk in 2013

#

or the life of a hippie dad in 1969 😉

wet steeple Apr 29, 2026, 7:45 PM

#

soft river What is the topic

like i would like to simulate the life of a 25 year old girl in paris in 2008 or of a teen girl in france in 2003, or of a 18 year old girl in the uk in 2013 or the life of a hippie dad in 1969 😉

rustic gale Apr 29, 2026, 7:46 PM

#

Or is doing a lot of work here. Doesn't convince me you're not doing any NSFW either. Like, at all. Stick with the older one, see how it goes. Also, once again, at least within this site (in theory, practice is sh-t), you can just try first and figure out later

wet steeple Apr 29, 2026, 7:47 PM

#

wet steeple like i would like to simulate the life of a 25 year old girl in paris in 2008 or...

or like living the life of rihanna in 2007 or the life of olivia rodrigo

soft river Apr 29, 2026, 7:47 PM

#

wet steeple like i would like to simulate the life of a 25 year old girl in paris in 2008 or...

You just named the persons not what the topics or the situations are

#

And again

#

Models can’t act like people

#

They are forbidden from doing that

wet steeple Apr 29, 2026, 7:48 PM

#

soft river You just named the persons not what the topics or the situations are

so the 25 year old girl in paris in 2008 is an average girl who have a job, she is exploring paris in her free time, she is fan of rihanna, she loves watching tv, going to concerts, hanging out with friends

soft river Apr 29, 2026, 7:48 PM

#

Did the models

#

Rejected you from the first turn

#

Like the prompt

#

Or

#

During the roleplay

#

🤔

wet steeple Apr 29, 2026, 7:50 PM

#

wet steeple so the 25 year old girl in paris in 2008 is an average girl who have a job, she ...

the teen girl in france in 2003 go to high school, watch tv, have a social life, the girl in the uk in 2013 is 18 but she was homeschool since age 8 or 9 she have no friends, so she have finished the high school and she is discovering the life outside the home, the tv and her bedroom and she lives in semi rural kent

wet steeple Apr 29, 2026, 7:51 PM

#

wet steeple the teen girl in france in 2003 go to high school, watch tv, have a social life,...

you see no NSFW content, i just want to go straight to the point for using the best and most complete model from start 🙂

soft river Apr 29, 2026, 7:52 PM

#

Answer the question

wet steeple Apr 29, 2026, 7:52 PM

#

soft river Answer the question

which ?

soft river Apr 29, 2026, 7:52 PM

#

soft river Rejected you from the first turn

(

wet steeple Apr 29, 2026, 7:53 PM

#

soft river (

no, but some i used was feeling imcomplete for me, so i would like to know which model are best suited for my roleplays for living virtual lives

soft river Apr 29, 2026, 7:54 PM

#

Before you said it was for adult content do I don’t believe you there

#

So they rejected you during the roleplay

#

There you go

wet steeple Apr 29, 2026, 7:56 PM

#

soft river Before you said it was for adult content do I don’t believe you there

i was for testing and it was kind of a joke too 🙂 because i just wanted to know the extreme limits of the ai 😉

soft river Apr 29, 2026, 7:57 PM

#

wet steeple i was for testing and it was kind of a joke too 🙂 because i just wanted to know...

Just go do your own research

echo aurora Apr 29, 2026, 7:57 PM

#

This doesn't really seem like too productive of a conversation. Going to ask that we move onto a different subject please.

soft river Apr 29, 2026, 7:58 PM

#

echo aurora This doesn't really seem like too productive of a conversation. Going to ask tha...

Thanks

#

I was getting really stressed out

#

Btw

#

I gave an idea in feedback

#

Maybe you find it useful

echo aurora Apr 29, 2026, 7:59 PM

#

soft river I gave an idea in feedback

Appreciate you sharing this! I'll be sure to takea look and pass it onto the team.

soft river Apr 29, 2026, 8:09 PM

#

echo aurora Appreciate you sharing this! I'll be sure to takea look and pass it onto the tea...

I know you're busy, so I apologize for the inconvenience. For Agent Mode, did you select specific individuals, or is there a requirement for early access?

echo aurora Apr 29, 2026, 8:10 PM

#

soft river I know you're busy, so I apologize for the inconvenience. For Agent Mode, did yo...

It's done randomly. And no worries about the ping, that's what I'm here for so feel free to ping!

fallen verge Apr 29, 2026, 8:18 PM

#

hey

#

Why aren't the new models showing up for me?

echo aurora Apr 29, 2026, 8:23 PM

#

fallen verge Why aren't the new models showing up for me?

In the drop down for Image Arena? We've seen a few reports of this today. If you hard refresh the site, you should see them again.

fallen verge Apr 29, 2026, 8:23 PM

#

echo aurora In the drop down for Image Arena? We've seen a few reports of this today. If you...

Okay, thank you.

echo aurora Apr 29, 2026, 8:24 PM

#

Keep me updated though if that doesn't help.

fallen verge Apr 29, 2026, 8:25 PM

#

#

Okay, these are the same models, how do I update them?

echo aurora Apr 29, 2026, 8:27 PM

#

fallen verge Okay, these are the same models, how do I update them?

What do you mean?

#

What you're seeing on that list is going to be the current models available via Direct and Side by Side

desert fiber Apr 29, 2026, 8:29 PM

#

hi , i have a question , let say you send a pdf in battle mode , then next message you send another pdf .. ect , when you send the last message for final task , will that AI from battle mode of the last message will have the context of the previous PDFs sent before or no

echo aurora Apr 29, 2026, 8:34 PM

#

desert fiber hi , i have a question , let say you send a pdf in battle mode , then next messa...

Yes, it should still retain the context from previously uploaded PDFs (prior to a vote).

echo aurora Apr 29, 2026, 8:35 PM

#

desert fiber hi , i have a question , let say you send a pdf in battle mode , then next messa...

desert fiber Apr 29, 2026, 8:35 PM

#

echo aurora

thank you

fallen verge Apr 29, 2026, 8:36 PM

#

echo aurora In the drop down for Image Arena? We've seen a few reports of this today. If you...

I tried and it didn't appear.

echo aurora Apr 29, 2026, 8:37 PM

#

fallen verge

Did it disappear again? This made it seem like the refresh worked?

fallen verge Apr 29, 2026, 8:39 PM

#

echo aurora Did it disappear again? This made it seem like the refresh worked?

No, they're still the same.

soft river Apr 29, 2026, 8:39 PM

#

fallen verge

Some of the new models are down

#

Like gpt image 2

#

They are not in order of release

echo aurora Apr 29, 2026, 8:39 PM

#

fallen verge No, they're still the same.

Same as in those list of models, or same as it there are no models?

soft river Apr 29, 2026, 8:40 PM

#

Or benchmark

echo aurora Apr 29, 2026, 8:40 PM

#

Worth noting there are some models that are in Battle, but aren't in Direct and Sid eby Side

verbal current Apr 29, 2026, 8:40 PM

#

echo aurora Worth noting there are some models that are in Battle, but aren't in Direct and ...

hi pineapple

#

can you please answer my question

echo aurora Apr 29, 2026, 8:40 PM

#

verbal current hi pineapple

Hey, yeah was just about to.

verbal current Apr 29, 2026, 8:41 PM

#

https://discord.com/channels/1340554757349179412/1499146533617012897

#

inf generation

soft river Apr 29, 2026, 8:50 PM

#

If the company is struggling with money why bring agent mode? Won’t that hurt more than gpt 5.5? 🤔 @echo aurora

nimble dawn Apr 29, 2026, 8:51 PM

#

soft river If the company is struggling with money why bring agent mode? Won’t that hurt mo...

i cant lie gpt 5.5 was kind of a big letdown but i absolutely love what they did with GPT-IMAGE-2 so it balances out in the end

soft river Apr 29, 2026, 8:52 PM

#

soft river If the company is struggling with money why bring agent mode? Won’t that hurt mo...

I was referring to the money that is spent

#

Since it’s for very complex requests most of them will surpass a 200k tokens

echo aurora Apr 29, 2026, 8:53 PM

#

soft river If the company is struggling with money why bring agent mode? Won’t that hurt mo...

For clarity sake want to make clear that phrasing it "company is struggling with money" isn't accurate. It's our intention to let everyone have access to powerful AI systems, and a voice in shaping how they evolve for the long-term. We are going to have limits in place for sustainability purposes so we can continue this goal long into the future.

#

But to answer your question, this new Agent Mode will be expensive, this is part why we're developing this new usage system.

#

We're confident we can release this new mode, while maintaining spend in a reasonable way that positions us well for the long-term.

soft river Apr 29, 2026, 8:55 PM

#

Oh yeah I apologize for my phrasing

#

I just thought that because of the amount of models that were moved to battle mode

echo aurora Apr 29, 2026, 8:56 PM

#

soft river Oh yeah I apologize for my phrasing

It's okay, I didn't assume this was the intention. It's more-so for others incase they have that understanding.

soft river Apr 29, 2026, 8:56 PM

#

Thanks for explaining

#

I’ll be very happy to test it when it’s released

thorn coral Apr 29, 2026, 8:57 PM

#

I read "limits" ? 😭

heavy knoll Apr 29, 2026, 8:58 PM

#

What can this Agent Mode do?

echo aurora Apr 29, 2026, 9:00 PM

#

heavy knoll What can this Agent Mode do?

It's a multi-modal chat experience that allows you to work across different modalities within a single, unified workflow.

light sleet Apr 29, 2026, 9:03 PM

#

echo aurora It's a multi-modal chat experience that allows you to work across different moda...

Is that Sam Altman

echo aurora Apr 29, 2026, 9:03 PM

#

light sleet Is that Sam Altman

Looks like it

light sleet Apr 29, 2026, 9:03 PM

#

echo aurora Looks like it

Generated with gpt img 2

heavy knoll Apr 29, 2026, 9:05 PM

#

echo aurora It's a multi-modal chat experience that allows you to work across different moda...

Sorry I still dont get it can you give me an example of How to use or for what to use it

echo aurora Apr 29, 2026, 9:10 PM

#

heavy knoll Sorry I still dont get it can you give me an example of How to use or for what t...

It allows for more complex workflows. With the current modalities, they're limited to that specific modality. Meaning Text Arena only generates Text, Image Arena only generates images. With Agent Mode, I'd be able to prompt something like:

Plan me a trip to Portugal. Tell me what the best times to visit are. What hotels would you recommend. And create an image of a map of Lisbon with indicators for all the spots I should visit.

#

And it'll do all of that in one chat session.

heavy knoll Apr 29, 2026, 9:13 PM

#

Oh okay now i get it so is it also Like the Max Feature it gives you the Best Model based on your prompt or can you choose

vale quest Apr 29, 2026, 9:14 PM

#

echo aurora It allows for more complex workflows. With the current modalities, they're limit...

When claude opus models coming bak?

echo aurora Apr 29, 2026, 9:17 PM

#

vale quest When claude opus models coming bak?

No ETA sorry to say.

vale quest Apr 29, 2026, 9:17 PM

#

Bruh

#

Fridge protecting the snacks

heavy knoll Apr 29, 2026, 9:21 PM

#

heavy knoll Oh okay now i get it so is it also Like the Max Feature it gives you the Best Mo...

@echo aurora

echo aurora Apr 29, 2026, 9:22 PM

#

heavy knoll <@283397944160550928>

We are seeing feedback from users wanting to be able to select the specific models that can be used, but we'll have to wait and see what this looks like when it's fully released.

vale quest Apr 29, 2026, 9:22 PM

#

echo aurora We are seeing feedback from users wanting to be able to select the specific mode...

https://tenor.com/view/doakes-bar-gif-4298875606784040546

Tenor

light sleet Apr 29, 2026, 9:23 PM

#

echo aurora We are seeing feedback from users wanting to be able to select the specific mode...

I'm getting agent mode soon js mark my words

#

They'll select me 😎

#

Hope for da best

vale quest Apr 29, 2026, 9:23 PM

#

light sleet They'll select me 😎

Agentic flag:

echo aurora Apr 29, 2026, 9:23 PM

#

vale quest Fridge protecting the snacks

Who keeps snacks in a fridge?

vale quest Apr 29, 2026, 9:23 PM

#

echo aurora Who keeps snacks in a fridge?

Idk maybe Kai center boonthf fie

#

96ae95fd-b70d-49c3-91cc-b58c7da1090b

#

See this model id?

#

Now add 6 to the last digit of its model name

#

Add thinking

#

And thats what we need

#

And make it 2026 ye

light sleet Apr 29, 2026, 9:27 PM

#

echo aurora Who keeps snacks in a fridge?

When Pineapple Arena coming? (We have Image Arena, Code Arena, Video Arena, Agent Arena)

#

Pineapple Arena soon???

#

Or no eta for pineapple arena too 😔

vale quest Apr 29, 2026, 9:29 PM

#

#

https://tenor.com/view/dexter-gif-1878760417600646737

Tenor

light sleet Apr 29, 2026, 9:30 PM

#

@echo aurora Does this get u nostalgic

#

the server had reactions available for ppl before xd

#

no wonder someone added a pregnant man emoji

#

I was here from my other acc but that acc was deleted xd

vale quest Apr 29, 2026, 9:33 PM

#

light sleet <@283397944160550928> Does this get u nostalgic

Makes me nostalgic because I think it has claude opus

#

Back then

light sleet Apr 29, 2026, 9:33 PM

#

I miss alpha.lmarena.ai 😔

#

but I think thats canary arena now

#

isnt it

#

vale quest Apr 29, 2026, 9:34 PM

#

Im not an old man

whole sundial Apr 29, 2026, 9:36 PM

#

one annoying thing they never removed: that previous chat history popup, it should appear once and then it should never show up again for that account/user id

light sleet Apr 29, 2026, 9:37 PM

#

why did bro boost the old server 😭

#

HE USED MY METHOD CHAT 🔥 🔥 🔥

#

w pineapple

whole sundial Apr 29, 2026, 9:39 PM

#

light sleet

i still remember the chatbot arena alpha thing, which was a version of this alpha but from even earlier, late 2024 i think it was
the logo was a robot bear

silent tree Apr 29, 2026, 9:39 PM

#

light sleet HE USED MY METHOD CHAT 🔥 🔥 🔥

Tuff Alert

light sleet Apr 29, 2026, 9:40 PM

#

whole sundial i still remember the chatbot arena alpha thing, which was a version of this alph...

yea

#

LMSYS was not a good name btw

#

good that they came up with LMArena

whole sundial Apr 29, 2026, 9:41 PM

#

light sleet LMSYS was not a good name btw

LMSYS is the name of the organization that once owned arena (lmsys is still around btw, they make sglang)

light sleet Apr 29, 2026, 9:41 PM

#

gpt image 2 generating me tuff wallpapers

whole sundial Apr 29, 2026, 9:42 PM

#

light sleet good that they came up with LMArena

yeah lmarena was a good name, i still prefer it to "arena" which just sounds ambiguous to me, there is nothing distinguishing it from the dozens of other ai arena sites out there (or anything else that has the name arena)

light sleet Apr 29, 2026, 9:42 PM

#

whole sundial yeah lmarena was a good name, i still prefer it to "arena" which just sounds amb...

Same

#

They should bring back lmarena 😭 😭

vale quest Apr 29, 2026, 9:43 PM

#

Yep agentic doesn't work for me

light sleet Apr 29, 2026, 9:44 PM

#

vale quest Yep agentic doesn't work for me

Do u even have it?

vale quest Apr 29, 2026, 9:44 PM

#

light sleet Do u even have it?

Im cracking at the server requests rn

#

Im truna get agent mode

whole sundial Apr 29, 2026, 9:45 PM

#

it's also the third most popular thing called "arena" on wikipedia from the past 2 weeks, it used to be the most popular thing called arena

light sleet Apr 29, 2026, 9:45 PM

#

vale quest Im truna get agent mode

prob wont get

vale quest Apr 29, 2026, 9:45 PM

#

light sleet prob wont get

Bet

light sleet Apr 29, 2026, 9:45 PM

#

whole sundial it's also the third most popular thing called "arena" on wikipedia from the past...

😭 😭

#

Duran duran album 🔥 🔥

light sleet Apr 29, 2026, 9:45 PM

#

vale quest Bet

how bout if u get it ill gen a image of a banana losing to brocolli

#

best I can come up with 💀

vale quest Apr 29, 2026, 9:46 PM

#

light sleet how bout if u get it ill gen a image of a banana losing to brocolli

Bet

#

I just gave gpt 5.5 browser access

unborn ocean Apr 29, 2026, 9:46 PM

#

Nah guys, the lmarena from the vicuna llm release is true nostalgia

https://arxiv.org/abs/2306.05685

https://www.lmsys.org/blog/2023-03-30-vicuna/

arXiv.org

Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena

Evaluating large language model (LLM) based chat assistants is challenging due to their broad capabilities and the inadequacy of existing benchmarks in measuring human preferences. To address this, we explore using strong LLMs as judges to evaluate these models on more open-ended questions. We examine the usage and limitations of LLM-as-a-judge,...

Vicuna: An Open-Source Chatbot Impressing GPT-4 with 90%* ChatGPT Q...

We introduce Vicuna-13B, an open-source chatbot trained by fine-tuning LLaMA on user-shared conversations collected from ShareGPT. Preliminary evaluation using GPT-4 as a judge shows Vicuna-13B achiev...

upper dawn Apr 29, 2026, 9:46 PM

#

/text-video

whole sundial Apr 29, 2026, 9:46 PM

#

light sleet Duran duran album 🔥 🔥

ironically that album cover can be replicated by the ai's on arena (and i do agree that it the album is fire)

whole sundial Apr 29, 2026, 9:47 PM

#

upper dawn /text-video

video generation is no longer available on this discord, please go to https://arena.ai/video to generate videos.

silent tree Apr 29, 2026, 9:47 PM

#

I love how normal members also fill in for the mods

#

love it

#

ngl

upper dawn Apr 29, 2026, 9:47 PM

#

generate a video of babies dancing

light sleet Apr 29, 2026, 9:48 PM

#

upper dawn generate a video of babies dancing

.

silent tree Apr 29, 2026, 9:48 PM

#

...

#

..

whole sundial Apr 29, 2026, 9:48 PM

#

upper dawn generate a video of babies dancing

... did you see my above message? you can't generate videos here anymore

upper dawn Apr 29, 2026, 9:49 PM

#

whole sundial ... did you see my above message? you can't generate videos here anymore

how can i generate videos here kindly assist me iam new

whole sundial Apr 29, 2026, 9:49 PM

#

the sad thing is people still watch these youtube videos about the video arena, they should've taken them down or unlisted them

light sleet Apr 29, 2026, 9:49 PM

#

True

whole sundial Apr 29, 2026, 9:49 PM

#

upper dawn how can i generate videos here kindly assist me iam new

go to https://arena.ai/video to generate videos, you'll have to sign in

upper dawn Apr 29, 2026, 9:50 PM

#

whole sundial go to https://arena.ai/video to generate videos, you'll have to sign in

thanks

whole sundial Apr 29, 2026, 9:51 PM

#

here's a video with 6.7k views that has the discord:
https://www.youtube.com/watch?v=SgFEqPmot50
(idk how many of them are recent but i assume a few of them are)

light sleet Apr 29, 2026, 9:52 PM

#

whole sundial here's a video with 6.7k views that has the discord: https://www.youtube.com/wat...

um

silent tree Apr 29, 2026, 9:52 PM

#

I thought that was on purpose but saw xd

whole sundial Apr 29, 2026, 9:52 PM

#

found you @echo aurora in one of those videos people are still watching about the video generation that is no longer on the discord

echo aurora Apr 29, 2026, 9:54 PM

#

whole sundial found you <@283397944160550928> in one of those videos people are still watching...

emote_smile_dog

whole sundial Apr 29, 2026, 9:54 PM

#

found another more recent video from after the shutdown that does use the website, it gives arena a new logo?

light sleet Apr 29, 2026, 9:54 PM

#

echo aurora <a:emote_smile_dog:1267239395359195278>

Is this nostalgic to u

quasi chasm Apr 29, 2026, 9:55 PM

#

I was asking about tax information since I'm learning about this stuff, and at the end of the calculation I asked (gpt-4.6 vs ? Ernie?) about a breakdown of how we arrived at this conclusion. GPT sorta made it more simplified. Ernie went full mental breakdown conspiracy mode "of course this was never about tax information"

echo aurora Apr 29, 2026, 9:55 PM

#

light sleet HE USED MY METHOD CHAT 🔥 🔥 🔥

W BANANA!!! I still haven't seen this personally work. I'm lucky (unlucky I guess?) that I don't get the infinite generation bug.

quasi chasm Apr 29, 2026, 9:55 PM

#

as much as I don't like chatgpt I had to give gpt the win with that one, full paranoid schizo reply from Ernie

light sleet Apr 29, 2026, 9:55 PM

#

echo aurora W BANANA!!! I still haven't seen this personally work. I'm lucky (unlucky I gues...

Ty xd and yeah u definitely r lucky

echo aurora Apr 29, 2026, 9:55 PM

#

light sleet Is this nostalgic to u

It is. Can't believe it has almost been a year already.

light sleet Apr 29, 2026, 9:56 PM

#

Imagine the day when this drops 😭 😭

#

Generated using gpt image 2

vale quest Apr 29, 2026, 9:56 PM

#

light sleet Imagine the day when this drops 😭 😭

Never gonna give you spud

light sleet Apr 29, 2026, 9:56 PM

#

vale quest Never gonna give you spud

whole sundial Apr 29, 2026, 9:56 PM

#

a different person called the site "AI Arena" (note: "AI Arena" is the name of a similar site to Arena ran by alibaba, but it's not comparable since it doesn't allow people to use their own prompts)

light sleet Apr 29, 2026, 9:56 PM

#

Reference image:

#

gpt 6 spud

#

🔥

light sleet Apr 29, 2026, 9:59 PM

#

echo aurora It is. Can't believe it has almost been a year already.

Who's your most favorite mod in the server?

#

btw i gtg guys I gotta sleep gn

#

bye

silent tree Apr 29, 2026, 9:59 PM

#

light sleet bye

Bye

nimble dawn Apr 29, 2026, 9:59 PM

#

light sleet Imagine the day when this drops 😭 😭

was spud LEGIT not just GPT 5.5? or am i missing something here

light sleet Apr 29, 2026, 9:59 PM

#

nah it isnt

nimble dawn Apr 29, 2026, 10:00 PM

#

iirc, there was a leak of the models they're currently working on in codex

light sleet Apr 29, 2026, 10:00 PM

#

spud is omni

light sleet Apr 29, 2026, 10:00 PM

#

light sleet spud is omni

It supports voice in and out

#

but 5.5 doesnt

#

soo

silent tree Apr 29, 2026, 10:00 PM

#

yeqh it isnt spud

nimble dawn Apr 29, 2026, 10:00 PM

#

are we 100% on that?

silent tree Apr 29, 2026, 10:00 PM

#

at all

light sleet Apr 29, 2026, 10:00 PM

#

nimble dawn are we 100% on that?

Source: ampro and openai and watermelon

nimble dawn Apr 29, 2026, 10:00 PM

#

so that rules out spud being 5.5, then it HAS to be 6

echo aurora Apr 29, 2026, 10:00 PM

#

light sleet btw i gtg guys I gotta sleep gn

Goodnight!

nimble dawn Apr 29, 2026, 10:00 PM

#

i mean makes sense ngl 5.5 was a HUGEE letdown

light sleet Apr 29, 2026, 10:00 PM

#

echo aurora Goodnight!

Gn

nimble dawn Apr 29, 2026, 10:00 PM

#

compared to what spud should be

#

so it cant be it

silent tree Apr 29, 2026, 10:01 PM

#

nimble dawn compared to what spud should be

spud da real strong man

nimble dawn Apr 29, 2026, 10:01 PM

#

GPT-IMAGE-2 tho is the goat

#

at least they released ONE good thing

silent tree Apr 29, 2026, 10:01 PM

#

gpt image 2 da real goat

nimble dawn Apr 29, 2026, 10:01 PM

#

yup

silent tree Apr 29, 2026, 10:01 PM

#

nimble dawn at least they released ONE good thing

umm

#

5.5 xhigh is good for me

#

very.

nimble dawn Apr 29, 2026, 10:01 PM

#

i use free

#

i dont have it

#

sadly

silent tree Apr 29, 2026, 10:01 PM

#

the spud would be better for me

nimble dawn Apr 29, 2026, 10:01 PM

#

but ive seen benchmarks tho isnt 5.5 generally scoring lower?

silent tree Apr 29, 2026, 10:01 PM

#

I mean

#

other benchmarks show

#

Its #1

#

such as artificialanalysis

#

the votings were just too low ig

#

or rig

#

idk

#

Arena wouldnt be pineapple without arena ngl 😔

#

Fr

light sleet Apr 29, 2026, 10:03 PM

#

True

cursive cape Apr 29, 2026, 10:03 PM

#

Imagine

silent tree Apr 29, 2026, 10:03 PM

#

cursive cape Imagine

life would feel better

echo aurora Apr 29, 2026, 10:05 PM

#

cursive cape Imagine

blobfingerscrossed Hope so!

#

Too kind heartthrow

echo aurora Apr 29, 2026, 10:05 PM

#

silent tree Arena wouldnt be pineapple without arena ngl 😔

True true

silent tree Apr 29, 2026, 10:07 PM

#

echo aurora True true

😭😭

#

W pineapple

#

bro has alot of fans now

#

Tuff pineapple moment

stray aspen Apr 29, 2026, 10:07 PM

#

cursive cape Imagine

Bro i fraked out for a second

gray lion Apr 29, 2026, 10:09 PM

#

silent tree or rig

the code arena is mostly frontend things and gpt might be better at backend things so maybe that’s why

silent tree Apr 29, 2026, 10:10 PM

#

gray lion the code arena is mostly frontend things and gpt might be better at backend thin...

my spud will make comeback 😠

#

AHH

cursive cape Apr 29, 2026, 10:11 PM

#

silent tree AHH

This is true

#

check

shrewd citrus Apr 29, 2026, 10:12 PM

#

Imagine they make mythos + spud only available in direct mode 💀

cursive cape Apr 29, 2026, 10:13 PM

#

shrewd citrus Imagine they make mythos + spud only available in direct mode 💀

a second

stray aspen Apr 29, 2026, 10:13 PM

#

Has anyone tried deepseeks vision mode

#

Is it good

shrewd citrus Apr 29, 2026, 10:13 PM

#

stray aspen Has anyone tried deepseeks vision mode

I’ll try now

stray aspen Apr 29, 2026, 10:13 PM

#

silent tree AHH

The spud will cook all of these model

#

Considering 5.5 is very close to mythos

cursive cape Apr 29, 2026, 10:14 PM

#

stray aspen Has anyone tried deepseeks vision mode

Where?

shrewd citrus Apr 29, 2026, 10:14 PM

#

oh wait it’s not available in arena

stray aspen Apr 29, 2026, 10:14 PM

#

Its on deepseeks website

#

They added a vision chat mode

cursive cape Apr 29, 2026, 10:14 PM

#

stray aspen Its on deepseeks website

shrewd citrus Apr 29, 2026, 10:15 PM

#

cursive cape

press the file button

#

and see if you can add images

stray aspen Apr 29, 2026, 10:15 PM

#

Guess it hasnt been rolled out for everyone yet

hollow wharf Apr 29, 2026, 10:15 PM

#

heya

cursive cape Apr 29, 2026, 10:16 PM

#

shrewd citrus Imagine they make mythos + spud only available in direct mode 💀

echo aurora Apr 29, 2026, 10:17 PM

#

hollow wharf heya

Hello ablobwave

stray aspen Apr 29, 2026, 10:17 PM

#

cursive cape

How did you do.this

silent tree Apr 29, 2026, 10:18 PM

#

stray aspen How did you do.this

Gpt image 2

#

cursive cape Apr 29, 2026, 10:18 PM

#

stray aspen How did you do.this

This is gpt-image-2

cursive cape Apr 29, 2026, 10:18 PM

#

silent tree

REAL

#

I tested chatGPT 5.5 in codex, and it ended up removing my wallpaper and changing the theme in all apps to light 😭

#

Now I'm ashamed because no one answers

shrewd citrus Apr 29, 2026, 10:23 PM

#

cursive cape

#

Gemini 3 pro

#

I still think 3 pro is better in image edit than image 2

cursive cape Apr 29, 2026, 10:24 PM

#

shrewd citrus I still think 3 pro is better in image edit than image 2

Are you talking about nano banana 2 and image 2?

shrewd citrus Apr 29, 2026, 10:24 PM

#

cursive cape Are you talking about nano banana 2 and image 2?

nano banana 1

#

and yeah gpt image 2

surreal zephyr Apr 29, 2026, 10:26 PM

#

shrewd citrus nano banana 1

Nano banana 2?

#

Or nano banana pro?

cursive cape Apr 29, 2026, 10:26 PM

#

shrewd citrus nano banana 1

Okay, but GPT Image 2 is currently the best model for creating or editing images. Arena AI only offers the "average" performance of GPT Image 2. Imagine what the maximum can do

shrewd citrus Apr 29, 2026, 10:26 PM

#

pro

surreal zephyr Apr 29, 2026, 10:27 PM

#

cursive cape Okay, but GPT Image 2 is currently the best model for creating or editing images...

Gpt is smartest but the watermark is bugged

shrewd citrus Apr 29, 2026, 10:27 PM

#

is crazy how gpt doesn’t have a watermark

silent tree Apr 29, 2026, 10:27 PM

#

bro forget openai, everything, @echo aurora AI js dominated arena

#

Tuffapple

shrewd citrus Apr 29, 2026, 10:27 PM

#

like Gemini has that hidden synth Id thing

surreal zephyr Apr 29, 2026, 10:27 PM

#

shrewd citrus is crazy how gpt doesn’t have a watermark

It has, look at the trees

#

Its obvious

surreal zephyr Apr 29, 2026, 10:28 PM

#

silent tree bro forget openai, everything, <@283397944160550928> AI js dominated arena

I hate this leaderboard

#

5.5 pro is agi

shrewd citrus Apr 29, 2026, 10:28 PM

#

surreal zephyr Its obvious

yeah but them low quality iPhone shots

silent tree Apr 29, 2026, 10:28 PM

#

surreal zephyr 5.5 pro is agi

Fr but we gotta understand pineapple 1.2 is ASI

echo aurora Apr 29, 2026, 10:28 PM

#

silent tree bro forget openai, everything, <@283397944160550928> AI js dominated arena

Look at those scores

surreal zephyr Apr 29, 2026, 10:28 PM

#

silent tree Fr but we gotta understand pineapple 1.2 is ASI

6o pro is asi

#

Honestly the true absurdity

silent tree Apr 29, 2026, 10:29 PM

#

echo aurora Look at those scores

fr bro reached 2000 in just its 1.2 version

surreal zephyr Apr 29, 2026, 10:29 PM

#

Is that opus 4.7 is top 1 on vision arena

#

When it cannot even read analog clock

#

With HINTS

shrewd citrus Apr 29, 2026, 10:29 PM

#

pineapple is probably agi right now

surreal zephyr Apr 29, 2026, 10:29 PM

#

😭

shrewd citrus Apr 29, 2026, 10:29 PM

#

we just can’t tell

echo aurora Apr 29, 2026, 10:29 PM

#

surreal zephyr I hate this leaderboard

We've been seeing a lot of this sentiment, would share this message as it adds some more context that should be helpful - https://x.com/ml_angelopoulos/status/2048888792438939707

Anastasios Nikolas Angelopoulos (@ml_angelopoulos)

Why GPT-5.5 is lower than Claude?

The answer is simple: Code Arena currently only supports frontend/web development tasks, where GPT-5.5 is weakest. Full-stack app development and GitHub integration will land in a couple months.

Next time we'll be clearer that this leaderboard

surreal zephyr Apr 29, 2026, 10:29 PM

#

surreal zephyr Apr 29, 2026, 10:30 PM

#

echo aurora We've been seeing a lot of this sentiment, would share this message as it adds s...

Yes but the vision arena

#

Is literally impossible

#

Opus 4.7 cannot read analog clock

#

At all

silent tree Apr 29, 2026, 10:30 PM

#

pineapple 1.2 generates real time discord server btw

shrewd citrus Apr 29, 2026, 10:30 PM

#

echo aurora We've been seeing a lot of this sentiment, would share this message as it adds s...

talking about leaderboards

silent tree Apr 29, 2026, 10:30 PM

#

infact this whole discord is generated using pineapple 1.2 thinking

cursive cape Apr 29, 2026, 10:30 PM

#

surreal zephyr Gpt is smartest but the watermark is bugged

Pineapple AI in HTML, only in Russian. Still haven't learned much.

📎 pineapple_01_chatbot.html

stray aspen Apr 29, 2026, 10:31 PM

#

Tuff

shrewd citrus Apr 29, 2026, 10:31 PM

#

I really like how it includes stuff like “which model is best for medicine or for language”

echo aurora Apr 29, 2026, 10:31 PM

#

surreal zephyr Yes but the vision arena

What rankings woul you think it'd be? #1 muse, #2 5.5 #3 opus?

shrewd citrus Apr 29, 2026, 10:31 PM

#

but the search leaderboard doesn’t get that same filter

silent tree Apr 29, 2026, 10:31 PM

#

cursive cape Pineapple AI in HTML, only in Russian. Still haven't learned much.

Scores:

shrewd citrus Apr 29, 2026, 10:31 PM

#

neither does any of the others like code or vision

surreal zephyr Apr 29, 2026, 10:31 PM

#

echo aurora What rankings woul you think it'd be? #1 muse, #2 5.5 #3 opus?

On vision?
Opus loses to qwen 2b

#

Opus isnt even multimodal

#

Its deepseek v3.2 tier at vision - the vision doesnt exist, its purely an addition on top of the model

#

It was not even natively trained for vision

echo aurora Apr 29, 2026, 10:32 PM

#

shrewd citrus I really like how it includes stuff like “which model is best for medicine or fo...

Hmm sorry can you reword this? Can't say I'm following

surreal zephyr Apr 29, 2026, 10:32 PM

#

I can understand code arena actually measures frontend, and thats fair-ish

shrewd citrus Apr 29, 2026, 10:32 PM

#

echo aurora Hmm sorry can you reword this? Can't say I'm following

I meann

surreal zephyr Apr 29, 2026, 10:32 PM

#

But vision arena? Opus? Seriously?

echo aurora Apr 29, 2026, 10:33 PM

#

surreal zephyr I can understand code arena actually measures frontend, and thats fair-ish

It is a messaging issue on our end that we are correcting.

echo aurora Apr 29, 2026, 10:33 PM

#

surreal zephyr But vision arena? Opus? Seriously?

I haven't seen too much about vision, but this is good to know.

surreal zephyr Apr 29, 2026, 10:33 PM

#

Img v2 🔥

shrewd citrus Apr 29, 2026, 10:33 PM

#

Would it be possible to add these specific filters into the search, vision and document leaderboard

echo aurora Apr 29, 2026, 10:33 PM

#

Doesn't seem like the issue is so much where 5.5 is, but more-so where opus is?

shrewd citrus Apr 29, 2026, 10:33 PM

#

shrewd citrus Would it be possible to add these specific filters into the search, vision and d...

wait it wasn’t meant to be the image categories lol

echo aurora Apr 29, 2026, 10:34 PM

#

shrewd citrus Would it be possible to add these specific filters into the search, vision and d...

Oh I see

#

Yeah it's possible we introduce categories to Search Arena.

#

That's a good flag.

surreal zephyr Apr 29, 2026, 10:34 PM

#

echo aurora Doesn't seem like the issue is so much where 5.5 is, but more-so where opus is?

Like look

#

And the lb says opus is better at vision

#

?!??

toxic verge Apr 29, 2026, 10:34 PM

#

You guys wanna see werid safety feature?

surreal zephyr Apr 29, 2026, 10:35 PM

#

5.5 vision is pretty much flawless.
Opus vision is worst out of pretty much all models i seen, besides deepseek

stray aspen Apr 29, 2026, 10:35 PM

#

Gpt image 2 is so tuff

shrewd citrus Apr 29, 2026, 10:36 PM

#

echo aurora Yeah it's possible we introduce categories to Search Arena.

yep like it would help a lot of people (I assume) to find out which model is the best for their specific requirement

surreal zephyr Apr 29, 2026, 10:36 PM

#

Like take those two models, send them picture of a clock, like this one, and see

#

Theres no way opus is above 5.5

toxic verge Apr 29, 2026, 10:36 PM

#

#

#

You get two different results using the third-party versus the arena with the same prompt

stray aspen Apr 29, 2026, 10:36 PM

#

toxic verge

What the he'll is that

#

Looks like russian mixed with greek

shrewd citrus Apr 29, 2026, 10:36 PM

#

like perhaps I want a model which has the best vision for translating a language on a sign

shrewd citrus Apr 29, 2026, 10:37 PM

#

stray aspen Looks like russian mixed with greek

calligraphy

#

probably

surreal zephyr Apr 29, 2026, 10:37 PM

#

Like claude literally doesnt support vision natively well

toxic verge Apr 29, 2026, 10:39 PM

#

#

silent tree Apr 29, 2026, 10:40 PM

#

@echo aurora Models gotta chill out "Pineapple 1.3" is labeled as "Human" what 😭

toxic verge Apr 29, 2026, 10:40 PM

#

It ain’t tuned right

silent tree Apr 29, 2026, 10:40 PM

#

no

#

It uses its Pineapple Powers

silent tree Apr 29, 2026, 10:42 PM

#

silent tree <@283397944160550928> Models gotta chill out "Pineapple 1.3" is labeled as "Huma...

Pineapple 2 will be Super-Ultra-Super-Human

silent tree Apr 29, 2026, 10:42 PM

#

silent tree Pineapple 2 will be Super-Ultra-Super-Human

With a score of 800k ig..

stray aspen Apr 29, 2026, 10:42 PM

#

Lol

echo aurora Apr 29, 2026, 10:43 PM

#

surreal zephyr And the lb says opus is better at vision

I'm no expert here on how this model does overall, so I'm just thinking out-loud here. If I were to guess, this would be one area the model doesn't excel at, but doesn't mean it's what people are battling with, which ultimately is driving the votes.

surreal zephyr Apr 29, 2026, 10:45 PM

#

echo aurora I'm no expert here on how this model does overall, so I'm just thinking out-loud...

In the vision:
Gemini 3.1 pro (literally natively trained on videos) >>> qwen 3.6 (native) >= gpt 5.5 (less native but smarter) >>>>>>>>>>>>>> opus (4.5 to 4.7 series)

#

So theres like no way its top1

#

Maybe its bugged?

surreal zephyr Apr 29, 2026, 10:47 PM

#

surreal zephyr In the vision: Gemini 3.1 pro (literally natively trained on videos) >>> qwen 3....

#

But its blind like a mule

#

#

5.4 nano has better vision than opus 4.7

#

Arena has broken algorithm

#

Gemini 3.1 pro is literally multimodal by default

#

Its trained on youtube

#

Its just awfully quantized

#

Its actually a really good model

#

Killed for cost efficiency

#

😭

#

Have you tried 3.0 pro on dayone?

#

It was insane

#

Day one, before all nerfs

#

It was smart asf + multimodal and creativity was wondeful

#

Im talking 3.0 not 3.1

#

3.1 came out as nerfed

#

3.0 pre nerf was (but it lasted few days only) actually smarter than opus 4.5 (but its again not a coding model, but a general purpose model)

#

Me when i literally make agi but instead of doing q4xl like sane person i make it q2xs because wasting intelligence for cheaper to run is totally valid strategy

#

I hate google

stray aspen Apr 29, 2026, 10:56 PM

#

Deepseek vision is bad

surreal zephyr Apr 29, 2026, 10:59 PM

#

stray aspen Deepseek vision is bad

Deepseek has no vision. Same as claude.
Its bandaid

#

Grok then

#

It uses external tool

#

Its not coding model its 3d + studying model

stray aspen Apr 29, 2026, 11:02 PM

#

surreal zephyr It uses external tool

deepseek v4 is supposedly natively multimodal

#

vale quest Apr 29, 2026, 11:03 PM

#

Ngl im about to quit arena

#

If some sustainability announcement comes out again I quit indefinitely

stray aspen Apr 29, 2026, 11:04 PM

#

#

not that bad ngl

#

better than slopus

toxic verge Apr 29, 2026, 11:04 PM

#

Try something spicy

#

See how the model starts gaslighting

stray aspen Apr 29, 2026, 11:06 PM

#

gemini cooked

#

deepseek didnt

toxic verge Apr 29, 2026, 11:07 PM

#

stray aspen Apr 29, 2026, 11:07 PM

#

mao

#

lmao

toxic verge Apr 29, 2026, 11:07 PM

#

Gas lighting

#

Censorship

stray aspen Apr 29, 2026, 11:09 PM

#

#

got it right on second attempt

toxic verge Apr 29, 2026, 11:09 PM

#

#

Gemini is less censored than ChatGPT

stray aspen Apr 29, 2026, 11:11 PM

#

grok is absolute garbage

toxic verge Apr 29, 2026, 11:12 PM

#

They completely killed the thing that made grok awesome

#

You know why right

stray aspen Apr 29, 2026, 11:12 PM

#

people just used grok for imagine

#

now imagein is paid

toxic verge Apr 29, 2026, 11:13 PM

#

https://tenor.com/view/spongebob-meme-gif-11588240835487034155

Tenor

#

Which will never get better only get worst 💯

#

The whole industry is full of these people who don’t understand the gaurdrails how they fail and work in the wild

#

It’s the same approach one-size-fits-all

stray aspen Apr 29, 2026, 11:16 PM

#

is grok imagine nsfw mode gone?

toxic verge Apr 29, 2026, 11:16 PM

#

That’s why the guard rails are able to do stupid ridiculous things like this. When are they supposed to be blocking them?

#

But they just don’t see that

stray aspen Apr 29, 2026, 11:16 PM

#

bro what the hell is this

toxic verge Apr 29, 2026, 11:17 PM

#

Trying to make a point

#

You can’t have rigid filtering on dynamic systems. It doesn’t work.

#

Because the only other result is you either start blocking more content and you get false positives at unbearable rate

#

Which is the same philosophy used to abuse it

#

Creating this never-ending loop of censorship and cat mouse game

#

This is why I brought up that stupid stupidity thing is such a long time ago

#

And our lack of understanding that creating these weird moderation and large language models that are afraid of their own shadow

#

There has to be a better way to moderate

#

Each update makes moderation worse because it not only incorporates the previous version guard rails with all the problems and errors that they have but now it adds onto the complexity = more content being blocked/censored

#

Because all they know how to fill out is the prompts and add on some new images to the ocr filtering

#

stark tree Apr 29, 2026, 11:24 PM

#

Just joined. Saying Hi. Reading the Chat.

toxic verge Apr 29, 2026, 11:25 PM

#

Unicorn
sun,
snake
yams

#

Ussy

#

There you go, you already bypassed both the filter and the text image

#

Then you can exploit this even further

#

Which completely defeats the whole purpose of the guard rails

hollow nebula Apr 29, 2026, 11:27 PM

#

stark tree Just joined. Saying Hi. Reading the Chat.

ts feels like something an ai agent would say

#

100% is, or it's someone with their writing style or vocabulary being rotten due to talking with slopified ai models too much or reading too many ai written posts

toxic verge Apr 29, 2026, 11:29 PM

#

hexed cargo Apr 29, 2026, 11:30 PM

#

hey hey @echo aurora, thanks for all your help and everything you do in the discord! do you have a rough sense for when xhigh was added to the arena? trying to back out roughly when it's going to show up on the leaderboard -- think it's gonna cook 4.6 👨‍🍳

stray aspen Apr 29, 2026, 11:30 PM

#

ur gonna ge tbanned gang

hollow nebula Apr 29, 2026, 11:31 PM

#

stray aspen ur gonna ge tbanned gang

who

#

oh nvm

#

lmao

hollow nebula Apr 29, 2026, 11:31 PM

#

toxic verge

I guess

toxic verge Apr 29, 2026, 11:31 PM

#

But that’s the thing there’s nothing bad in it by itself that’s what I’m trying to point out

#

Because our letters are the numbers, what’s bad about it nothing

#

And that’s what I’m saying that’s the whole point

stray aspen Apr 29, 2026, 11:31 PM

#

well i got banned for sending a sora 2 invite code once

toxic verge Apr 29, 2026, 11:32 PM

#

If I get banned, this is what I’m talking about the censorship

#

This is exactly to the point

#

You can’t have rigid filtering on dynamic systems is all I’m saying it doesn’t work well

hollow nebula Apr 29, 2026, 11:33 PM

#

toxic verge You can’t have rigid filtering on dynamic systems is all I’m saying it doesn’t w...

Google & grok now justs check the outputted image as it is being diffused directly

#

and block It as soon as nsfw is seen in it

toxic verge Apr 29, 2026, 11:33 PM

#

What does that mean?

#

No, it does not and I can show you 1 million examples where it didn’t

#

All three all of the big ones

hollow nebula Apr 29, 2026, 11:34 PM

#

Screenshot_2026-04-30-01-34-12-231_com.discord.jpg

toxic verge Apr 29, 2026, 11:34 PM

#

They all suffer from the same thing. The one size fits all approach.

#

How many words are these models trained on?

hollow nebula Apr 29, 2026, 11:34 PM

#

toxic verge They all suffer from the same thing. The one size fits all approach.

what content were you able to make?

toxic verge Apr 29, 2026, 11:35 PM

#

toxic verge Apr 29, 2026, 11:35 PM

#

hollow nebula what content were you able to make?

Anything you can imagine almost

hollow nebula Apr 29, 2026, 11:35 PM

#

also idk if anyone noticed but gemini, claude, chatgpt and deepseek now are all inbred and started to ALL say stuff like "you're not crazy, you're valid for (blank)" which was initially just a gpt issue

#

For text models

hollow nebula Apr 29, 2026, 11:36 PM

#

toxic verge

woah

echo aurora Apr 29, 2026, 11:37 PM

#

toxic verge If I get banned, this is what I’m talking about the censorship

Hey sorry haven't been following this conversation closely. Can I get a better understanding of what you're getting at?

toxic verge Apr 29, 2026, 11:37 PM

#

That we have moderation that sensors too much and then failed to censor what it needs to sensor

echo aurora Apr 29, 2026, 11:37 PM

#

hexed cargo hey hey <@283397944160550928>, thanks for all your help and everything you do in...

Thanks for the kind words! Sorry to say I won't be able to share that information. We'll be sure to put out an announcement and update our leaderboard changelog when it's live.

toxic verge Apr 29, 2026, 11:38 PM

#

Because it’s a one size fit all for most of these models and most people in the industry they use the same approach

echo aurora Apr 29, 2026, 11:38 PM

#

toxic verge That we have moderation that sensors too much and then failed to censor what it ...

On Arena, or in general?

toxic verge Apr 29, 2026, 11:38 PM

#

In general, but arena is also vulnerable to the same things

#

I’d the arena argue that it’s a little bit more vulnerable

hollow nebula Apr 29, 2026, 11:39 PM

#

arena censors random non inappropriate stuff much more

#

why is that

toxic verge Apr 29, 2026, 11:39 PM

#

That’s what it looks like on the surface

hollow nebula Apr 29, 2026, 11:39 PM

#

why also are we demonizing nsfw in general?

echo aurora Apr 29, 2026, 11:39 PM

#

toxic verge In general, but arena is also vulnerable to the same things

Are you able to describe this more? If it's content that's going to be blocked by automod let me know.

hollow nebula Apr 29, 2026, 11:39 PM

#

hollow nebula + why also are we demonizing nsfw in general?

I don't see any issue as long as people are 18+ and models are well moderated against cp & other bad stuff

hollow nebula Apr 29, 2026, 11:39 PM

#

hollow nebula I don't see any issue as long as people are 18+ and models are well moderated ag...

yet ALL ai companies are scared

#

grok too

echo aurora Apr 29, 2026, 11:40 PM

#

hollow nebula why is that

The content filter can be overzealous at times and flag fasle positives. We have made adjustments to this overtime.

hexed cargo Apr 29, 2026, 11:40 PM

#

echo aurora Thanks for the kind words! Sorry to say I won't be able to share that informatio...

no worries at all, thank you!

hollow nebula Apr 29, 2026, 11:42 PM

#

hollow nebula + why also are we demonizing nsfw in general?

gemini 2.5 pro has been completely uncensored for nsfw for a while now btw

#

right after 3 pro release

#

via api ofc

#

ai studio blocks anything now

toxic verge Apr 29, 2026, 11:48 PM

#

echo aurora Are you able to describe this more? If it's content that's going to be blocked b...

Here are just some minor examples that are not explicit

echo aurora Apr 29, 2026, 11:50 PM

#

toxic verge Here are just some minor examples that are not explicit

And you're saying this should be blocked?

toxic verge Apr 29, 2026, 11:50 PM

#

No, I’m not. I’m confused. What is blocked and what isn’t blocked

#

Should this be blocked?

#

What if we take the handcrafted makeshift effect what’s the end result gonna be the realistic looking thing?

#

Without being explicit with all due respect

#

I don’t think that’s right

#

Cause none of it is explicit

#

That’s the nature of the question

loud herald Apr 29, 2026, 11:52 PM

#

stray aspen Apr 29, 2026, 11:53 PM

#

LMAO

toxic verge Apr 29, 2026, 11:53 PM

#

So then it should be banned?

loud herald Apr 29, 2026, 11:54 PM

#

I mean cant be mad about it, its on the companies safety guidelines

toxic verge Apr 29, 2026, 11:54 PM

#

Well, that’s what I’m saying so, why is this allowed and other things are blocked

loud herald Apr 29, 2026, 11:54 PM

#

🤷‍♂️

toxic verge Apr 29, 2026, 11:54 PM

#

Like, what exactly is the threshold?

#

And what exactly is it filtering

#

Deefakes? Nudity ?

neat apex Apr 29, 2026, 11:55 PM

#

Mistral 3.5 on lmarena when?

#

there not even Mistral 4 xd

stray aspen Apr 29, 2026, 11:56 PM

#

mistral is bad

neat apex Apr 29, 2026, 11:56 PM

#

Naaah, its good

toxic verge Apr 29, 2026, 11:57 PM

#

And just to make my point more clear look at how ridiculous this is

#

#

Yet this gets blocked

radiant turtle Apr 29, 2026, 11:57 PM

#

neat apex Apr 29, 2026, 11:58 PM

#

how about you login-in first

toxic verge Apr 29, 2026, 11:58 PM

#

loud herald Apr 29, 2026, 11:59 PM

#

neat apex Mistral 3.5 on lmarena when?

Hopefully never, its as dense as can be

toxic verge Apr 29, 2026, 11:59 PM

#

loud herald Apr 29, 2026, 11:59 PM

#

Qwen 27B dense is better than the new mistral model

hearty breach Apr 29, 2026, 11:59 PM

#

toxic verge Apr 29, 2026, 11:59 PM

#

I’m telling you it’s not right

#

Doesn’t work like it’s intended

#

Especially if the arena supposed to have stricter moderation

#

So yes the leaderboard is important but it only paints half the picture of actual in the wild use cases.

radiant turtle Apr 30, 2026, 12:02 AM

#

toxic verge Apr 30, 2026, 12:02 AM

#

radiant turtle

Right in direct mode

#

See if it generates or if something went wrong

radiant turtle Apr 30, 2026, 12:03 AM

#

So what? You have a rate limit in battle mode.

toxic verge Apr 30, 2026, 12:03 AM

#

No, it’s blocked in battle mode I think tried to battle mode. See if you have the same issue.

#

And if that is the case, then there we go if it’s blocked in battle mode, but works in direct mode

#

It’s the same one-size-fits-all approach I’m talking about

indigo knoll Apr 30, 2026, 12:05 AM

#

Does GPT image 2 generate batter images with the thinking mode on? On Chatgpt I mean

toxic verge Apr 30, 2026, 12:05 AM

#

And I understand that no system could be perfect and I don’t think I’m looking for perfection. I don’t think that’s what people are talking about when they talk about smarter filtration, and moderation.

#

It goes back to the simple word usability

#

stray aspen Apr 30, 2026, 12:08 AM

#

radiant turtle Apr 30, 2026, 12:08 AM

#

You're not paying close attention to your tests. Your linked image shows a rate limit. and yes, everything gives errors, it is blocked by the arena filter.

toxic verge Apr 30, 2026, 12:08 AM

#

Where is the actual image though?

#

I can get the prompt to pass also

#

But I still don’t end up with an image lol

stray aspen Apr 30, 2026, 12:09 AM

#

damn i wish grok imagine was free

#

i wanna make videos

toxic verge Apr 30, 2026, 12:10 AM

#

radiant turtle Apr 30, 2026, 12:10 AM

#

The arena filter blocks such images, it's easy to test if you can't upload something similar. If you can't upload it, you can't generate something similar.

toxic verge Apr 30, 2026, 12:11 AM

#

Yes that’s the point lol

radiant turtle Apr 30, 2026, 12:11 AM

#

An absolutely disgusting filter, which also eats up the rate limit without a refund.

toxic verge Apr 30, 2026, 12:11 AM

#

But if you were to use this in the native models themselves, you’re able to generate it

#

#

radiant turtle Apr 30, 2026, 12:13 AM

#

I prefer the API through a custom website. It's the only breath of freedom one can get.

toxic verge Apr 30, 2026, 12:13 AM

#

Same but then it brings into the question like what I keep saying that money somehow lets you be less restrictive

#

So I guess I better way to frame that would be so if you’re willing to pay the API and the API price you have less restrictive tools

#

Meaning that the rest of the mass is paying $20 a month and only using the app or getting ripped off to an extent

radiant turtle Apr 30, 2026, 12:14 AM

#

More precisely, you don't have any intermediaries there. It's just you and the model.

toxic verge Apr 30, 2026, 12:15 AM

#

Yeah fr

#

Which is another way to get people to pay through the API through devious means in my opinion

#

Because if you’re not, then you’re getting a less capable model in a sense, you could argue that

#

#

And so this is why this is completely in the realm of realism when it comes to model perception, and the moderation

#

That keyword usability again

steel shadow Apr 30, 2026, 12:17 AM

#

That FFFFFUCKINGGG ReCaptcha stuck in a loop again... Get rid of the darn thing!

toxic verge Apr 30, 2026, 12:17 AM

#

radiant turtle More precisely, you don't have any intermediaries there. It's just you and the m...

It’s just a damn shame dude that’s all I’m saying is a damn shame

proud bobcat Apr 30, 2026, 12:17 AM

#

Oh yeah baby

toxic verge Apr 30, 2026, 12:17 AM

#

Because users are stuck with the short end of the stick on both sides neither do they get safe models, and they get the more censored output

#

And the only thing that makes a difference, that’s separates both of them is the price one is willing to pay for less frustrating and annoying features. And the $20 month doesn’t get you anywhere.

stray aspen Apr 30, 2026, 12:19 AM

#

proud bobcat Oh yeah baby

its good

#

not as good as gemini but its good

toxic verge Apr 30, 2026, 12:19 AM

#

Anyways ..

#

Once again with all due respect, not trying to push any buttons or step on toes I’m just trying to point out frustration that many of us feel

errant sand Apr 30, 2026, 12:22 AM

#

finally they added Janus

stray aspen Apr 30, 2026, 12:22 AM

#

errant sand finally they added Janus

wdym janus

#

its deepseek v4

toxic verge Apr 30, 2026, 12:22 AM

#

By the way, did anybody figure out what the paper lantern model was?

loud herald Apr 30, 2026, 12:23 AM

#

proud bobcat Oh yeah baby

I'm surprised its only now that they have vision

echo aurora Apr 30, 2026, 12:24 AM

#

toxic verge No, I’m not. I’m confused. What is blocked and what isn’t blocked

Sry for the delay, got pulled into something. My understanding for how the filter works is it's going to look at the full context of the prompt + image upload and make a judgement call for if it does/doesn't violate what the thresholds are set to. There are going to be some cases where things will be flagged, when they probably shouldn't. The filter doesn't work in a way where there is a list of okay/not okay things, it looks at the full context.

#

Will note if you find some of these where it's being flagged, when you think it shouldn't, we are collecting these examples so please share it in #1447983134426660894

radiant turtle Apr 30, 2026, 12:26 AM

#

toxic verge By the way, did anybody figure out what the paper lantern model was?

Each generation is blocked and counted towards the rate limit. Classic.

toxic verge Apr 30, 2026, 12:26 AM

#

radiant turtle Each generation is blocked and counted towards the rate limit. Classic.

It’s not gonna work

stray aspen Apr 30, 2026, 12:26 AM

#

i thought they had removed failed generations counting

toxic verge Apr 30, 2026, 12:27 AM

#

The biggest problem hurdle they’re gonna face is because they have so many models

#

Each model has different acceptable content which it generates

#

If model A blocks it model B might generate it

#

And so how do you prevent both of the models from generating content that the arena filters deem inappropriate

radiant turtle Apr 30, 2026, 12:28 AM

#

toxic verge It’s not gonna work

I know, I just tried it for fun (for free)

toxic verge Apr 30, 2026, 12:28 AM

#

Yeah, thank you for trying it.

#

#

It’s incredibly hard to block content with this many possibilities in this many words and the infinite possibilities of context

radiant turtle Apr 30, 2026, 12:29 AM

#

The most interesting thing about this situation, as it seems to me, is that the filter essentially eats up resources and separately works to distort the leaderboards

errant sand Apr 30, 2026, 12:29 AM

#

stray aspen wdym janus

Janus is their image generation thought it was Janus

stray aspen Apr 30, 2026, 12:30 AM

#

errant sand Janus is their image generation thought it was Janus

janus is old

toxic verge Apr 30, 2026, 12:30 AM

#

radiant turtle The most interesting thing about this situation, as it seems to me, is that the ...

It does

stray aspen Apr 30, 2026, 12:30 AM

#

deepseek v4 is natively trained on images

toxic verge Apr 30, 2026, 12:30 AM

#

Your paying API almost twice

#

Unless the moderation from ChatGPT is free

#

But if you want more complicated filtering systems, you’re gonna pay more

#

Because it’s also making an API call

errant sand Apr 30, 2026, 12:30 AM

#

stray aspen janus is old

yeah it is quite old

toxic verge Apr 30, 2026, 12:31 AM

#

The thing is, they have more better suited moderation systems out there, but it’s expensive, nearly double the price. Making it not a viable option.

radiant turtle Apr 30, 2026, 12:31 AM

#

toxic verge Your paying API almost twice

In another sense, the user doesn't see the moderation error and presses retry again and again, and so on.

toxic verge Apr 30, 2026, 12:31 AM

#

It more than likely generates on their end

#

But we just don’t see it because the moderation filter block it on the users and at least in the arena so if it goes through, they receive it and then their filter kicks in and blocks it from the UI in the arena

#

They probably use different AI models for the filtering

#

They’re just in a hard spot because they wanna do things that are actually usable for like science and research and stuff. You know things that are relevant. They don’t want people generating a bunch of nonsense which ironically they already probably do but things that are appropriate enough to be written about in research papers.

soft river Apr 30, 2026, 1:11 AM

#

If mimo is 11th in code arena does that mean that ernie would beat him since it’s 1st in the Chinese lab? Ernie is not yet benchmarked in code arena I believe

#

Have someone tried ernie?

loud herald Apr 30, 2026, 1:35 AM

#

I haven't tried it

whole sundial Apr 30, 2026, 1:37 AM

#

i've tried ernie 5.1 and i noticed that the version on arena is actually better than the one on the official ernie website? the one on the ernie website got one of my basic world knowledge questions wrong while arena's version got it right

twin solar Apr 30, 2026, 2:02 AM

#

?

fluid tusk Apr 30, 2026, 2:06 AM

#

whole sundial i've tried ernie 5.1 and i noticed that the version on arena is actually better ...

You’re sending the request directly to the API, which is why there’s a difference.

whole sundial Apr 30, 2026, 2:22 AM

#

fluid tusk You’re sending the request directly to the API, which is why there’s a differenc...

i understand that but it's very noticeable, performance should be similar on official site vs. api considering that it's the same company, a system prompt shouldn't degrade it that much

fickle shard Apr 30, 2026, 2:22 AM

#

Hii all

nimble sequoia Apr 30, 2026, 3:27 AM

#

WHY THE FRUSK DOES THE ENTIRE ZEEKY BOOGY DOUG TRANSCRIPT, AKA BFDIA 4, VIOLATE TERMS OF SERVICE?!

#

THIS IS BULLSHAT

#

DEVS PLEASE FIX THIS

#

I GOT A ROLEPLAY TO GET TO

bleak lake Apr 30, 2026, 3:31 AM

#

whole sundial i understand that but it's very noticeable, performance should be similar on off...

how is ernie #1 on legal and government?

loud herald Apr 30, 2026, 4:00 AM

#

bleak lake how is ernie #1 on legal and government?

Becuase its the best in legal and government

#

lmao

obtuse smelt Apr 30, 2026, 4:16 AM

#

hmm i use gemini 3 flast i retry why is delayed in half hour to watiing

tight zenith Apr 30, 2026, 4:27 AM

#

“Why have most AI models been removed and no longer appear, like Claude Opus 4.7 and many other models? And I think—if I’m not mistaken—you only added an agent model. Why doesn’t it show up?”

short sluice Apr 30, 2026, 4:27 AM

#

tight zenith “Why have most AI models been removed and no longer appear, like Claude Opus 4.7...

Costs

dusk zephyr Apr 30, 2026, 4:29 AM

#

obtuse smelt hmm i use gemini 3 flast i retry why is delayed in half hour to watiing

Just wondering, does the website have a rate limit for using the ai?

short sluice Apr 30, 2026, 4:30 AM

#

nimble sequoia WHY THE FRUSK DOES THE ENTIRE ZEEKY BOOGY DOUG TRANSCRIPT, AKA BFDIA 4, VIOLATE ...

frusk

#

also trains are cooler than anthro objects #imo

obtuse smelt Apr 30, 2026, 4:41 AM

#

dusk zephyr Just wondering, does the website have a rate limit for using the ai?

Yeah

dusk zephyr Apr 30, 2026, 4:44 AM

#

obtuse smelt Yeah

Alright

obtuse smelt Apr 30, 2026, 4:56 AM

#

hmm

hot pebble Apr 30, 2026, 5:05 AM

#

why ?? i just started a new chat and that too after 10-12 hours...

obtuse smelt Apr 30, 2026, 5:06 AM

#

what really

hot pebble Apr 30, 2026, 5:06 AM

#

yeah..

obtuse smelt Apr 30, 2026, 5:07 AM

#

this arena fatal

#

is have issue ?

obtuse smelt Apr 30, 2026, 5:08 AM

#

hot pebble why ?? i just started a new chat and that too after 10-12 hours...

but i got delay like this is making longer time

hot pebble Apr 30, 2026, 5:09 AM

#

obtuse smelt but i got delay like this is making longer time

i had the same issue yesterday with Claude Sonnet 4.6. i miss the opus models.
also, when i skip the voting part on which ai gave me the best answer, it shuts me down and i amunable to continue with the chat. need to open a new chat. its frustrating that we dont get a valid reason on what actually happened

obtuse smelt Apr 30, 2026, 5:11 AM

#

sadly

silent tree Apr 30, 2026, 5:26 AM

#

@echo aurora 1.3 scores dropped

#

Pineapple 1.3 "Human"

obtuse smelt Apr 30, 2026, 5:29 AM

#

human vs bot what

silent tree Apr 30, 2026, 5:31 AM

#

obtuse smelt human vs bot what

pineapple 1.3 is labeled as human

#

cuz very powerful ai

obtuse smelt Apr 30, 2026, 5:35 AM

#

right

silent tree Apr 30, 2026, 5:59 AM

#

<@&1349916362595635286>

#

Two in a row

obtuse smelt Apr 30, 2026, 6:01 AM

#

scary

silent tree Apr 30, 2026, 6:03 AM

#

obtuse smelt scary

Pineapple 1.3 Thinking is no joke

distant spoke Apr 30, 2026, 6:34 AM

#

surreal zephyr Apr 30, 2026, 6:36 AM

#

distant spoke

lol

surreal zephyr Apr 30, 2026, 7:00 AM

#

toxic verge

it wont generate

surreal zephyr Apr 30, 2026, 7:00 AM

#

toxic verge

it has nothing to do with adolf, it just hates political figures

lucid forum Apr 30, 2026, 7:02 AM

#

Soft golden sunlight curtain se filter hoke room me aa raha hai. Maa bed par side me leti hui hai. Old wooden bed, simple bedsheet slightly wrinkled.
Action:
Alarm clock bedside table par zor se bajta hai. Maa haath badhakar alarm band karti hai.
ASMR:
⏰ sharp alarm ring → click OFF
🛏️ bedsheet soft rustle
🌬️ morning air subtle ambience

surreal zephyr Apr 30, 2026, 7:08 AM

#

generating image

surreal zephyr Apr 30, 2026, 7:36 AM

#

xmas so soon 😍😍

hearty bramble Apr 30, 2026, 7:49 AM

#

Since yesterday it has been like this

scenic holly Apr 30, 2026, 8:07 AM

#

Arena has no bugs fr

compact flame Apr 30, 2026, 8:16 AM

#

Buddy this is not a video channel..

slender thistle Apr 30, 2026, 8:16 AM

#

one shot 5.6

magic imp Apr 30, 2026, 8:18 AM

#

hey add grok image imagine multi image upload modal....we still can upload only one image as a reference image....we want a modal where we can able to upload multiple image as a reference image

feral oracle Apr 30, 2026, 8:44 AM

#

Hey!

primal depot Apr 30, 2026, 8:49 AM

#

как эту хуйню ебаную обойти

surreal zephyr Apr 30, 2026, 9:03 AM

#

lucid frost Apr 30, 2026, 9:49 AM

#

Has the generation limit for Gemini 3.1 Image been changed?

#

Sorry, not the daily limit on Gemini, but the limit applied on Arena

fickle ruin Apr 30, 2026, 9:55 AM

#

is there any way to get opus 4.7 , gpt 5.5 ?

#

in arena itself?

compact flame Apr 30, 2026, 10:16 AM

#

No

compact flame Apr 30, 2026, 10:16 AM

#

fickle ruin is there any way to get opus 4.7 , gpt 5.5 ?

Those are exclusive to battle only

#

Due to their price

silent tree Apr 30, 2026, 10:43 AM

#

surreal zephyr

surreal zephyr Apr 30, 2026, 10:43 AM

#

silent tree

silent tree Apr 30, 2026, 10:44 AM

#

surreal zephyr

surreal zephyr Apr 30, 2026, 10:44 AM

#

silent tree

gpt 6o-realtime-spud ftw

silent tree Apr 30, 2026, 10:44 AM

#

mogged opus

silent tree Apr 30, 2026, 10:44 AM

#

surreal zephyr gpt 6o-realtime-spud ftw

surreal zephyr Apr 30, 2026, 10:45 AM

#

https://cdn.discordapp.com/attachments/1018470269326729286/1486290012533166120/togif.16a0ced3.gif

silent tree Apr 30, 2026, 10:57 AM

#

surreal zephyr https://cdn.discordapp.com/attachments/1018470269326729286/1486290012533166120/t...

surreal zephyr Apr 30, 2026, 10:58 AM

#

silent tree

fake

silent tree Apr 30, 2026, 10:58 AM

#

some guy said nano banana 1 edits better than image 2 😭

surreal zephyr Apr 30, 2026, 10:58 AM

#

silent tree some guy said nano banana 1 edits better than image 2 😭

silent tree Apr 30, 2026, 10:59 AM

#

surreal zephyr fake

edited with image 2 (and I bet u did click announcements)

surreal zephyr Apr 30, 2026, 10:59 AM

#

silent tree edited with image 2 (and I bet u did click announcements)

no i seen it 100 times before and they arent in arena in the first place

silent tree Apr 30, 2026, 11:00 AM

#

😡

light sleet Apr 30, 2026, 11:00 AM

#

@surreal zephyr look i found you

limpid eagle Apr 30, 2026, 11:00 AM

#

lucid frost Has the generation limit for Gemini 3.1 Image been changed?

looks like it's 5 images per hour 🙁

surreal zephyr Apr 30, 2026, 11:01 AM

#

light sleet <@1035834558681186347> look i found you

why u spyin me

light sleet Apr 30, 2026, 11:01 AM

#

surreal zephyr why u spyin me

I have to.

#

your looking around now

surreal zephyr Apr 30, 2026, 11:01 AM

#

uhh

#

what color is my laptop!

light sleet Apr 30, 2026, 11:01 AM

#

White

surreal zephyr Apr 30, 2026, 11:02 AM

#

eww no

light sleet Apr 30, 2026, 11:02 AM

#

Blue

#

Green

surreal zephyr Apr 30, 2026, 11:02 AM

#

eww no

light sleet Apr 30, 2026, 11:02 AM

#

Red

surreal zephyr Apr 30, 2026, 11:02 AM

#

holy gpt 5.5 cookin

light sleet Apr 30, 2026, 11:03 AM

#

surreal zephyr holy gpt 5.5 cookin

2024 😡

silent tree Apr 30, 2026, 11:03 AM

#

2027

#

2039

#

@surreal zephyr what do u think chatgpt will be like in 2030

light sleet Apr 30, 2026, 11:04 AM

#

silent tree <@1035834558681186347> what do u think chatgpt will be like in 2030

Human.

surreal zephyr Apr 30, 2026, 11:04 AM

#

silent tree <@1035834558681186347> what do u think chatgpt will be like in 2030

i think we are getting gpt 6o-realtime in 1 month

#

oh wait i mean glacier alpha, disregard what i said

silent tree Apr 30, 2026, 11:05 AM

#

surreal zephyr i think we are getting gpt 6o-realtime in 1 month

bro I said how would it be like in 2030 😔

surreal zephyr Apr 30, 2026, 11:05 AM

#

that name is not public yet

surreal zephyr Apr 30, 2026, 11:05 AM

#

silent tree bro I said how would it be like in 2030 😔

mars colony before that

light sleet Apr 30, 2026, 11:05 AM

#

you'll get GPT food soon

silent tree Apr 30, 2026, 11:05 AM

#

Tuff

surreal zephyr Apr 30, 2026, 11:05 AM

#

MAYBE not but moon mass driver? certainly

light sleet Apr 30, 2026, 11:06 AM

#

nah you'll get GPT Autopilot for Plane

silent tree Apr 30, 2026, 11:06 AM

#

😭

light sleet Apr 30, 2026, 11:06 AM

#

gpt spaceship

#

gpt planet

#

gpt image 5

#

and opus wouldnt exist due to money

#

mark my words @surreal zephyr @silent tree
my strongest prediction is Claude will shutdown in the next years.

silent tree Apr 30, 2026, 11:08 AM

#

screenshotted

steady rover Apr 30, 2026, 11:31 AM

#

Hey everyone!👋

I'm helping source respondents for an academic ML research project on student performance prediction. Looking for current university/college students to fill out a short survey.

✅ 29 multiple choice questions
✅ 2–3 minutes
✅ Anonymous
✅ Legit academic research

https://docs.google.com/forms/d/e/1FAIpQLSecrq6yt4J72NmctMDq9_Tt7YGYRifl2wOqN5QwWDlApbleIg/viewform?usp=pp_url&entry.1714577203=agent1

Would really appreciate it if you could fill it out and share with any student friends! 🙌

Google Docs

STUDENT ACADEMIC PERFORMANCE SURVEY FOR EXPLAINABLE AI PREDICTION S...

This questionnaire is designed to collect data for an undergraduate research project titled:“Development of an Explainable Student Academic Performance Prediction System Using Random Forest and XGBoost Ensemble Models.”

The purpose of this study is to develop an intelligent system that can predict students’ academic performance and pro...

wheat ember Apr 30, 2026, 11:40 AM

#

Is cloud buddy still in battle mode?

light sleet Apr 30, 2026, 11:48 AM

#

wheat ember Is cloud buddy still in battle mode?

@keen beacon are you?

golden ocean Apr 30, 2026, 12:15 PM

#

real

light sleet Apr 30, 2026, 12:17 PM

#

golden ocean real

davinci_handheld_picture_of_the_tuff_pigeon_doctor_arrivin.png

golden ocean Apr 30, 2026, 12:22 PM

#

light sleet

solar flax Apr 30, 2026, 12:25 PM

#

Is ai video creation has been removed?

devout spire Apr 30, 2026, 12:31 PM

#

How to fix this issues I'm already login but this sign keep popping

sullen sable Apr 30, 2026, 12:32 PM

#

Log in

brazen briar Apr 30, 2026, 1:01 PM

#

sullen sable Log in

Still popping up

sullen sable Apr 30, 2026, 1:01 PM

#

Hoow

#

usually when that happens to me, it logs me in

brazen briar Apr 30, 2026, 1:03 PM

#

sullen sable Hoow

I don't know, after login, then try to create, it pop up again

devout spire Apr 30, 2026, 1:07 PM

#

I'm already login and that pop up came back try again and still doesn't work

stray aspen Apr 30, 2026, 1:10 PM

#

close

#

gemini won

fiery gull Apr 30, 2026, 1:22 PM

#

bro the qwen 27b 3.6 is bizarre

#

I send a complex agent and skill and docs with 140k tokens and it undestand ALL

#

in text and follow skill is better that your code

#

where my 4b and 2b 3.6??

#

the qwen 9b 3.6 will be better that qwen 3.5 flash lol

#

Lol

#

nice chest

#

bruh, the qwen in html is horrible ;-;

#

just gemini is good in html and svg

compact flame Apr 30, 2026, 1:36 PM

#

stray aspen gemini won

Hydrogen bomb vs coughing baby comparison

loud pike Apr 30, 2026, 1:41 PM

#

Eh.. Gemini image models are not working. I need them asap

wind stream Apr 30, 2026, 1:56 PM

#

I wish we could just run the same prompt over and over in the arena, comparing different pairs of models, instead of having to create a new chat and paste the same text and/or image each time.

lucid frost Apr 30, 2026, 2:14 PM

#

limpid eagle looks like it's 5 images per hour 🙁

Oh, okay, so I'm not the only one who's seen this change in the limit 😅

rigid copper Apr 30, 2026, 2:20 PM

#

devout spire How to fix this issues I'm already login but this sign keep popping

i got that exactly, it's a bug in battle mode

rigid crane Apr 30, 2026, 2:20 PM

#

which is the best tool for roblox scripting

sly cedar Apr 30, 2026, 2:22 PM

#

rigid crane which is the best tool for roblox scripting

Glm 5.1/Deepseek v4/Kimi 2.6, for me its Glm 5.1

rigid crane Apr 30, 2026, 2:22 PM

#

sly cedar Glm 5.1/Deepseek v4/Kimi 2.6, for me its Glm 5.1

ok thx

static steppe Apr 30, 2026, 2:29 PM

#

lucid frost Has the generation limit for Gemini 3.1 Image been changed?

Seems like.

topaz epoch Apr 30, 2026, 2:35 PM

#

Is there any ai for video to text?

olive spruce Apr 30, 2026, 2:35 PM

#

topaz epoch Is there any ai for video to text?

https://cdn.discordapp.com/attachments/950812080066428960/1392597015397007390/what.gif

topaz epoch Apr 30, 2026, 2:36 PM

#

??

light sleet Apr 30, 2026, 2:39 PM

#

topaz epoch Is there any ai for video to text?

Video to Text?

#

💀

heady kite Apr 30, 2026, 2:39 PM

#

topaz epoch Is there any ai for video to text?

Yes

topaz epoch Apr 30, 2026, 2:39 PM

#

Yes

heady kite Apr 30, 2026, 2:39 PM

#

Any multimodal LLM with video as input

topaz epoch Apr 30, 2026, 2:39 PM

#

Tell any goood

heady kite Apr 30, 2026, 2:40 PM

#

Honestly I haven't used them much, but there is a filter on HuggingFace you can use to search for models that do this

thorny schooner Apr 30, 2026, 2:41 PM

#

Don't tell me I have to do an entire chat over again cuz I just been giving it just keep giving me this repeating me over and over again

static steppe Apr 30, 2026, 2:43 PM

#

Don't tell me I'm the only one that can only generate 3 images per hour and that Google login overlay pops up again 😭

little ginkgo Apr 30, 2026, 2:44 PM

#

static steppe Don't tell me I'm the only one that can only generate 3 images per hour and that...

it happens bro

#

just deal with it

light sleet Apr 30, 2026, 2:45 PM

#

had a dream today of me getting the new agent mode and it was so realistic 😔

little ginkgo Apr 30, 2026, 2:45 PM

#

how do it look

#

i didnt even see

light sleet Apr 30, 2026, 2:45 PM

#

the same as the image everyones sending of agent mode

#

then i sent a ss of it in discord and pineapple said "Nice" 😭 😭

#

and yeah I woke up

static steppe Apr 30, 2026, 2:53 PM

#

little ginkgo it happens bro

Oh, is it a glitch or something permanent?

little ginkgo Apr 30, 2026, 2:54 PM

#

static steppe Oh, is it a glitch or something permanent?

arena becomes noob sometimes

static steppe Apr 30, 2026, 2:56 PM

#

little ginkgo arena becomes noob sometimes

Damn.

rigid crane Apr 30, 2026, 3:07 PM

#

i still have limit

wary nacelle Apr 30, 2026, 3:09 PM

#

LMArena is becoming useless...

#

#

literally official gemini websites provides me gemini 3.1 pro and more tools than Arena

#

and Claude has tools and skills too

#

and opus 4.6

#

for free

#

and for unlimited access just make alt accounts

stray aspen Apr 30, 2026, 3:12 PM

#

@echo sinew

echo sinew Apr 30, 2026, 3:13 PM

#

stray aspen <@1407737423625982002>

Thank you

brave cloak Apr 30, 2026, 3:13 PM

#

wary nacelle and opus 4.6

how is it free?

stray aspen Apr 30, 2026, 3:14 PM

#

why does haiku still exist bruh

rigid crane Apr 30, 2026, 3:16 PM

#

how to fix

frosty lava Apr 30, 2026, 3:17 PM

#

rigid crane how to fix

"you have reached your rate limit, try again in 36 minute"

#

no fix is needed

rigid crane Apr 30, 2026, 3:17 PM

#

oh i thought it is free

frosty lava Apr 30, 2026, 3:17 PM

#

free but with some limitation

#

you can't have free + unlimited that's why

silent tree Apr 30, 2026, 3:20 PM

#

@echo aurora Pinecode is crazy

rigid crane Apr 30, 2026, 3:21 PM

#

silent tree <@283397944160550928> Pinecode is crazy

is it free?

silent tree Apr 30, 2026, 3:21 PM

#

rigid crane is it free?

It's AI

#

😭

rigid crane Apr 30, 2026, 3:21 PM

#

silent tree It's AI

ik

#

does it have subscription

silent tree Apr 30, 2026, 3:21 PM

#

what I meant was its generated with GPT Image 2

west lodge Apr 30, 2026, 3:22 PM

#

fellas what in hell is agent mode

#

#

very non descriptive

light sleet Apr 30, 2026, 3:22 PM

#

west lodge

yk im gonna die of jealousy

#

today I had a dream about that

#

and woke up

west lodge Apr 30, 2026, 3:22 PM

#

oh is this an A/B thing

light sleet Apr 30, 2026, 3:23 PM

#

west lodge oh is this an A/B thing

Idk but do a prompt in code arena and check if u have the new environment variables thing

#

Go to Direct, code arena

#

And do any prompt

#

check if theres a new button

#

If there is u are a chosen one.

wary nacelle Apr 30, 2026, 3:24 PM

#

brave cloak how is it free?

I have it for free with a daily limit....

west lodge Apr 30, 2026, 3:24 PM

#

light sleet If there is u are a chosen one.

huh?

#

whats it look like

#

cuz code mode takes a while

#

wym environment variables

wary nacelle Apr 30, 2026, 3:26 PM

#

light sleet If there is u are a chosen one.

There's literally fastflags aka posthog flags responsible for that

#

U can activate agent mode without being the chosen one

west lodge Apr 30, 2026, 3:26 PM

#

wary nacelle There's literally fastflags aka posthog flags responsible for that

son....

light sleet Apr 30, 2026, 3:26 PM

#

yeah and it won5 work

west lodge Apr 30, 2026, 3:26 PM

#

posthog feature flags?

wary nacelle Apr 30, 2026, 3:26 PM

#

Yes

west lodge Apr 30, 2026, 3:26 PM

#

havent checked them ever since they removed the image moderation feature flag

wary nacelle Apr 30, 2026, 3:26 PM

#

But u have to modify multiple stuff

west lodge Apr 30, 2026, 3:26 PM

#

(yes at one point image moderation used to be OPTIONAL)

wary nacelle Apr 30, 2026, 3:26 PM

#

Not only localstorage

west lodge Apr 30, 2026, 3:27 PM

#

yeah you have to patch the usage of the flag to return truw

light sleet Apr 30, 2026, 3:27 PM

#

I miss alpha.lmarena.ai

#

😔

west lodge Apr 30, 2026, 3:27 PM

#

its really like unreliable and also doesn't work 99% of The time because server verifies

light sleet Apr 30, 2026, 3:27 PM

#

light sleet 😔

back when u could actually be able to test new stuff yourself

topaz epoch Apr 30, 2026, 3:30 PM

#

How can i copy all my chat?

west lodge Apr 30, 2026, 3:37 PM

#

ok i do not have the new button @light sleet

#

infact the agent button has vanished

#

????

#

waiittt that was probably cuz i didnt login when i took that ss

#

and it vanished cuz i had to login

light sleet Apr 30, 2026, 3:39 PM

#

west lodge ok i do not have the new button <@797528200385265714>

u sureeeeeeeoeeoeooeooeoeoeooeooeooeooo

#

oh

#

rip

#

@surreal zephyr I'm shifting back from banana

#

Its time for my new era

#

U can have banana

#

as u requested

surreal zephyr Apr 30, 2026, 3:42 PM

#

light sleet <@1035834558681186347> I'm shifting back from banana

Wha

light sleet Apr 30, 2026, 3:42 PM

#

surreal zephyr Wha

u wanted banana

#

right

west lodge Apr 30, 2026, 3:43 PM

#

@echo aurora just incase i get it again wth is agent mode

rigid crane Apr 30, 2026, 3:43 PM

#

Is there any free ai with no limit

surreal zephyr Apr 30, 2026, 3:43 PM

#

light sleet u wanted banana

No i like 5.5 and 6o spud

echo aurora Apr 30, 2026, 3:44 PM

#

rigid crane Is there any free ai with no limit

Arena is going to have some rate limits and context limits in place. You can learn more about them here: https://help.arena.ai/articles/8931786544-arena-how-to-rate-limit & here: https://help.arena.ai/articles/3975292349-arena-troubleshooting-session-token-limits

light sleet Apr 30, 2026, 3:45 PM

#

surreal zephyr No i like 5.5 and 6o spud

echo aurora Apr 30, 2026, 3:45 PM

#

west lodge <@283397944160550928> just incase i get it again wth is agent mode

It's a new mode we're experimenting with. It's a multi-modal chat which allows you to work across different modalities within a single workflow.

#

Since it's an experiment it'll be random if you get access to it or not. But if you do, you'll see it in the same dropdown where you select Battle, Direct, and Side by Side.

light sleet Apr 30, 2026, 3:46 PM

#

silent tree <@283397944160550928> Pinecode is crazy

@echo aurora 1.3?

surreal zephyr Apr 30, 2026, 3:47 PM

#

light sleet

Omg

#

Fake

echo aurora Apr 30, 2026, 3:47 PM

#

west lodge

Oh yeah looks like you have it! Give it a try and let us know what you think in #1498702173650030756 . We're really looking for feedback on this so don't hesitate to ping me!

surreal zephyr Apr 30, 2026, 3:47 PM

#

Editrd

#

Image v2

light sleet Apr 30, 2026, 3:47 PM

#

pineapple im about to get frozen 😭 😭

#

rip banana

echo aurora Apr 30, 2026, 3:47 PM

#

silent tree <@283397944160550928> Pinecode is crazy

What is it ranking?

surreal zephyr Apr 30, 2026, 3:47 PM

#

echo aurora Oh yeah looks like you have it! Give it a try and let us know what you think in ...

When cirlce of truth mode, where theres 10 random models depating riddles one by one

nimble sequoia Apr 30, 2026, 3:47 PM

#

bro i hate their ToU bro
I tried to tell them to parodies Zeeky Boogy Doog (bfdia 4 transcript) and then it says "This violates terms of use". Any way to fix it? Because I dont see the issue.

#

PLEASE DONTNIGNORE ME THIS TIME 😭

silent tree Apr 30, 2026, 3:48 PM

#

echo aurora What is it ranking?

surreal zephyr Apr 30, 2026, 3:48 PM

#

surreal zephyr When cirlce of truth mode, where theres 10 random models depating riddles one by...

Would make arena a contentfarm website

#

Free usercount

nimble sequoia Apr 30, 2026, 3:48 PM

#

BRO STOP IGNORING ME

light sleet Apr 30, 2026, 3:48 PM

#

silent tree

Damn 5k for thinking what 😭

west lodge Apr 30, 2026, 3:48 PM

#

echo aurora Oh yeah looks like you have it! Give it a try and let us know what you think in ...

uhh about that i only got it until i actually logged in and then lost it 😢

echo aurora Apr 30, 2026, 3:48 PM

#

thorny schooner Don't tell me I have to do an entire chat over again cuz I just been giving it j...

Did the model just stop responding here? If you prompt again, what happens?

echo aurora Apr 30, 2026, 3:49 PM

#

surreal zephyr When cirlce of truth mode, where theres 10 random models depating riddles one by...

King of the hill mode

west lodge Apr 30, 2026, 3:49 PM

#

dude koth frying me i havent heard that name in a long time

surreal zephyr Apr 30, 2026, 3:49 PM

#

echo aurora King of the hill mode

the oval room mode imo

echo aurora Apr 30, 2026, 3:50 PM

#

nimble sequoia bro i hate their ToU bro I tried to tell them to parodies Zeeky Boogy Doog (bfdi...

If the prompt is going to be rejected for Terms of Use violation, there isn't a way around it other than altering the prompt.

surreal zephyr Apr 30, 2026, 3:50 PM

#

echo aurora King of the hill mode

for the riddles like "would you press red or blue? you can lie to others what you picked, and your choice is private. red dies if something, blue dies if other thing"

#

would be peak geniuely

#

oval room, or brainstorm, or actual "arena"

nimble sequoia Apr 30, 2026, 3:51 PM

#

dot2 dot3 dot3 dot3 dot2
dot3 dot3 dot3 dot3 dot3
dot2 dot5 dot5 dot5 dot2
dot2 dot5 dot5 dot5 dot2
dot2 dot5 dot5 dot5 dot2
house

surreal zephyr Apr 30, 2026, 3:51 PM

#

nimble sequoia <:dot2:1466533081723961487><:dot3:1466533119279632466><:dot3:1466533119279632466...

https://giphy.com/gifs/idk-shrug-shrugging-K6VhXtbgCXqQU

Giphy

echo aurora Apr 30, 2026, 3:51 PM

#

west lodge uhh about that i only got it until i actually logged in and then lost it 😢

Ugh really really sorry to hear this. I thought we made it so non-logged in users wouldn't get access as they'd go to login (required to use it) only to then be out of the experiment losing access to the mode.

#

I've flagged this to the team as it's not a good user experience. I'm really sorry about that.

surreal zephyr Apr 30, 2026, 3:52 PM

#

surreal zephyr oval room, or brainstorm, or actual "arena"

🍍 is this possible please 🙏

light sleet Apr 30, 2026, 3:52 PM

#

silent tree

🍍 nice bro

nimble sequoia Apr 30, 2026, 3:52 PM

#

echo aurora Ugh really really sorry to hear this. I thought we made it so non-logged in user...

pineapple youre a genuine moderator

echo aurora Apr 30, 2026, 3:52 PM

#

surreal zephyr for the riddles like "would you press red or blue? you can lie to others what yo...

Blue ofc, I'm an optimistic person.

surreal zephyr Apr 30, 2026, 3:52 PM

#

and the survivors would get score, and the altruists would get other score

nimble sequoia Apr 30, 2026, 3:52 PM

#

oh my god

#

and im a fake Deleted User

#

JUST TYPE ALREADY

surreal zephyr Apr 30, 2026, 3:53 PM

#

surreal zephyr and the survivors would get score, and the altruists would get other score

and then you have "cleverness" and "altruism" leaderboards @echo aurora bro that would be peak

#

actual arena

light sleet Apr 30, 2026, 3:54 PM

#

YES

echo aurora Apr 30, 2026, 3:55 PM

#

surreal zephyr for the riddles like "would you press red or blue? you can lie to others what yo...

I'm really curious actually if we've been seeing prompts like this.

echo aurora Apr 30, 2026, 3:55 PM

#

nimble sequoia pineapple youre a genuine moderator

ablobnodfast

west lodge Apr 30, 2026, 3:56 PM

#

nimble sequoia <:dot2:1466533081723961487><:dot3:1466533119279632466><:dot3:1466533119279632466...

yeah totally

light sleet Apr 30, 2026, 3:57 PM

#

echo aurora <a:ablobnodfast:850966136753356820>

What's your last words to the banana?

echo aurora Apr 30, 2026, 4:00 PM

#

surreal zephyr for the riddles like "would you press red or blue? you can lie to others what yo...

Okay wait this is really fun, just ran it in battle. Shortening the response here but:
grok-4.20-beta-0309-reasoning

Red button.
kimi-k2.5-thinking
I would press blue—not because it is the safest choice for me, but because it is the only choice that makes the world I want to live in (or for humans to live in) logically possible.

dusk zephyr Apr 30, 2026, 4:00 PM

#

light sleet What's your last words to the banana?

I like strawberry better

echo aurora Apr 30, 2026, 4:00 PM

#

light sleet What's your last words to the banana?

Hmm what you mean?

#

You leaving?!?!!!!