#general
1 messages · Page 348 of 1
Code Arena in it's current form is for building the frontend for sites/apps. We are in the process of building more support though. A full-stack experiment is currently released and we're hoping to fully release as soon as it's ready.
thanks. right now I just end up using normal chat, and then I have to copy paste code. i also think it uses a lot more tokens (bad for me and bad for arena.ai). these are small apps nothing like a full stack site. is that the best way? any way to get beta access to the experiment?
Max is going to select the best model for that prompt keeping latency in mind. It's the same idea as the prompt-to-leaderboard but turned into "model".
Sorry to say there isn't a way to opt into this experiment. We do fully understand the importance of having full-stack and want to move the progress along with it quickly.
Well I think it's swinging too much with latency because I have seen it almost legitimately only use Google AI like I'm not even kidding almost no other AI I have as seen be used as a model that's not Google when us except like every couple other one and even then that's kind of legitimately becoming more rare at least in my experience when Google just does not work with the prompt
honestly sometimes will actively disrupt if I'm using a particular one especially since google every time my experience tends to have a much more defined output limit then some other ones which can disrupt the frame of chat if I was using a previous different model before experimenting with Max
It's all going to depend on what is being prompting and what the chat session looks like, so I wouldn't be able to tell you if something isn't working correctly with your chat session. Having Max in Side by Side I think would be really interesting to see how those responses compared to sticking with a specific model.
Very septic that depends on being prompt actually considers anything but okay
In my experience if you used socratic prompting or just paste a 300 word essay paragraph you'll have Anthropic, else mostly Google
Using Max I've only ever once see something that isn't Anthropic and Google when I asked about pseudoscience 😕
It's a model router, it's going to route to the most capable model for that specific prompt (with latency in mind).
Did you mean to say you're skeptic?
Hi
Hello 
Did you get grok
indeed
guys no offence i think the gpt image 2 model is useless most of it is fail after fail
like when i ask for any ip (Even nickelodeon IP)
sometimes it works sometimes it doesn't
i know the model was just added last week and i need to get used to it
Yes
It's over
Why is the x axis on the pareto chart descending instead of ascending?
Price is going down instead of up
<@&1349916362595635286>
Can't say for the first one but for the second one even on so even I'm not certain since I I i'm pretty sure I regularly do stuff that I am very certain to go way past the 300 words i it just seems to default to Google on honestly at least in my experience
( lost the entire chat just because trying to experiment with Max)
saw the same thing with Claude 4.6 too
<@&1349916362595635286>
what happened?
cloud-buddy = opus?
I asked it to generate the best steak it can and add pieces of 24 karat gold
kimi 2.6
All this and still no claude opus 4.6 in direct
I hate arena.ai
Mimo v2.5 pro is underrated prove me wrong
I’ve already logged in but when I click regenerate, this will pop up
battle mode have opus 4.6 bro
Hello Friend, you can make Rs 13,939.50 ($50) every day working on BunnyBand. You will receive a free Rs 2,787.90 ($10) welcome bonus when you register. You can make more money through social tasks and playing games. Then set up your payment method and withdraw. It's easy to use. Register below to get started: https://bunnyband.com/r/sP8kxVLmRwZS
<@&1349916362595635286>
ban
Still looking for people to evaluate the text for book written by AI
take a ban and get a real muskan on your face!
That does not look like an average prompt
Guess what I had to do like three of them because of errors so yeah I'm not surprised I got it but still annoying
Hello Friend, you can make Rs 13,939.50 ($50) every day working on BunnyBand. You will receive a free Rs 2,787.90 ($10) welcome bonus when you register. You can make more money through social tasks and playing games. Then set up your payment method and withdraw. It's easy to use. Register below to get started: https://bunnyband.com/r/sP8kxVLmRwZS
Arena. Ai is free or paid?? Can someone tell me
Paid with your time
am I the only one who thinks mimo v2.5 pro is like something between Claude opus and glm? Well, it's clear about glm, they're Chinese. but the fact that it looks so much like Claude is very interesting.
👀
<@&1349916362595635286>
we woke up to another day where muse spark beats every model in the world
If you are rich donate zuck more money so he can place 1st
My boi muse spark needs that encouragement
Hi anyone , what’s the best Ai too on arena to generate a realistic experience short video using my prompt
why always popup
Guys you here?
What agent mode for?
I don't understand.
Can someone explain?
Hello by the way
Show it.
Yes and it's bad?
Ask pineapple or something he probably knows
Why
I didn't even do anything special
what is agent mode
Multi model that allows you do coding and etc
more modeles at same time?
Yes
Again it also works perfectly for coding
@slender ledge also it's not available on many users
so basically 5.5 codex this week ?
when will it release
isnt it out already
Ask pineapple but for me it's avaliable.
no its the normal 5.5
the 5.5 codex is not out yet
I also made rpg game from agent mode.
like give an prompt for the agent?
@slender ledge Yes prompt also gives
can u show
Send me the website
btw cuz u got things u also got like opus in direct?
@slender ledge no only in battle mode, I almost got opus but it vanished moments later
sends me to battle mode
Really?
@weak dagger few months ago I signed new account
And it appeared.
Agent mode available for me in early preview
any idea in arena ai i saw a model in battle mode named flow state 3
which model it can be ?
nb 2 pro or something flagship
it was quite good though
I need it i need agent mode
We are currently experimenting with a new feature: Agent Mode. This is a multi-modal chat experience that allows you to work across different modalities within a single, unified workflow.
For more information on experiments happening on Arena check out this article.
like u can give an prompt like u can make agents right or cant u like if u can u give him an prompt
and its jailbroken
Could you like try makeing a new account
Well grok is no rules
Even I did it won't show agent mode anymore.
Hold your horses mate
It will block as violated terms of use
On arena it's not.
i tried to send a suggestion, but it was instantly deleted lol
oh
Try it i wanna see how it looks like
When is Claude opus coming back to lm areana
When the light is running low and the shadows start to grow and the places that you know seem like fantasy
Not
I just got this model too and it was really really bad at frontend
Sorry to ask, what was your prompt?
using the example prompts on the code feature, asking it to create a discord server clone 😅
I'm gonna test with the same prompt other models and see if it's the model being that bad at front end or if it's just the prompt
??
Best new model of these?
7
11
2
Kimi K2.6
guess which is deepseek's and which is 5.5xhigh's
You can use Nuvio
1 is 5.5 xhigh and 2 is Deepseek
nope
Did you get 5.5 xhigh with Deepseek V4 Pro too?
Which I find crazy
I selected it manually I had haiku 4.5 with 5.5 xhigh
5.5 got the server logos right lol and the pings similar
discord doesnt show
- for pings
Overrall it seems like it's worse at one shotting front end though
using it via codex it's a pretty good model but even gemini 3.1 pro at launch was better than whatever this is
note how I'm saying "at launch" since because of how they slaughtered that poor model it's worse than 3 pro at most stuff
using it not via api
Can someone answer a question for me, please?
I'm trying to regenerate an image, and the site keeps asking me to log in, even though I'm already logged in.
try to clear your broswer cache and cookies
lmarena do not longer more generate unlimited images?
how much context do models get after I vote but I submit another prompt?
IIRC there was a blog post that mentioned how multi-round conversations on Arena work
That's odd, if clearing cookies/cache don't work, would you mind creating a new bug ticket in #1343291835845578853?
If you're able to share a screenshot of being logged in & getting that error message that'd be helpful to have as well. I would note to share this by sending a direct message to @oak python so it can be shared privately. I wouldn't want you sharing your email in this server where others would have access to it.
There are rate limits in place. More info can be found in this announcement: #announcements message
actually i found the post: it's https://arena.ai/blog/opendata-july2025/#evaluation-order
thanks friend
One question before I submit this to bugs. In Battle Mode, if I use the "retry" option a certain number of times, is it normal for a screen to appear asking me to log in, even though I'm already logged in?
No, if you're being asked to log in when already logged in, that's going to be a bug. It shouldn't be doing that. There are rate limits, which may be what is happening, but that'll be a clear error message instead of saying you're not logged in.
Gpt 2 image in arena when?
so now all the members are limited to generating only 15 images per day. right?
why this change happen?
Yeah that's correct. We need to have some reasonable restrictions in place so we're able to continue to provide access to these models. We plan to be around for the long-term and these limitations help sustain that.
hii
Hello 
ananas can you check out #1498684119951872160?
if you cant right now, thats fine. take your time!
some image models refuse to visualize prompt containing "higgs boson" because apparently they think its analog to the all known racist word.
😭🙏
im crine son
this is mainly because people were abusing arena's image battle mode to get gpt image 2 before release, people were generating hundreds of images that they ignored in an attempt to get image 2, it cost them a lot of money
nice tag you got there
Which ChatGPT model is best for image generation on Arena AI?
there's an ai company called higgsfield and an unrelated tts model called higgs boson
😭
idk why it would be blocked
gpt model 2.5
bro i am just
HOW DID I SAY THAT SO WRONG
gpt image 2*
though im not talking about model
im talking about this
we got to the point where image models restrict visualization of a higgs boson which gives mass to particles
higgs boson = BAD!!
yeah i know i'm just saying that they are using the same words, it would be weird for arena to block the name of another ai company in their filter (or i guess the creator of the model)
Pineapple if you are reading this
What is agent mode on arena
Because some people have early access to it
i got the envs update
gemini 3.1 pro back in lmarena when
agent can create files and search on internet
idk other|
and u can't choose a model
I get why they did it, but isn’t Arena basically for comparing models? Feels kinda contradictory
It's a new mode we're experimenting with. https://help.arena.ai/articles/1811908126-arena-experiments-agent-mode We're going to be releasing this feature to more and more user over time so keep an eye out for it.
Will u be able to build ur agent
With like an promo
Prompt
like that u can give an prompt
Pls me
@echo aurora Very excited for this
Is it also like releasing for each users
real
tbh I don't get why agent mode so hyped
maybe I'm not a vibecoder (still use AI the traditional way for occasional code)
I've tried it while logged out and found it's quite uninteresting.
Did yall know arena has 2 other sites
Agent Mode is kinda not like fully stable rn
I'm excited for database that's what I meant
agentic ai flows made my iq drop by 50 after using it for months but increased productivity
Yall got early access to agent mode I got early access to the new pineapple plushie
agentic ai flows made my iq drop by 50
nah I wouldn't exchange my 132 IQ brain for such "increased productivity"
Prob why I don't get it
bro got access to gpt 2 image instead 😭
No😠
W
i will vaporize you
would agent mode show up here?
i would Cut you in half
yo chill
Hi
No
This is all cool, but when will we get all the removed models back?
No, it'd be in the drop down where you select Battle/Direct/Side by Side
No DeepSeek results yet?
ah
👀
No ETA yet sorry to say.
I wonder if it works or it's just a page

😡😡give
Is the agent page just for viewing or it actually works?
bro even made a mod nervous
It works
i own company trust
Oh k
Still? Wow, that's interesting.
I wanted to know why new AI models are being added, but I can't find them on the website.
Chill, man
@echo aurora Have many people gotten access to the database and api features yet? I havent found other people except myself encountering that, will it be rolled out to the public soon?
(sorry for ping btw 😭 )
its available on hugging face
not the api
but the dataset
it's updated regularly as leaderboard updates
Not all models are going to be added to Direct/Side by Side. In our announcements for new models we'll make that distinction.
i think agent mode is not available for me yet c: since it was limited to some user
I won't be able to give an early heads up on when new features are (or may) be landing.
Okay
Yeah doesn't look like it sadly. We are looking to roll this out to more and more people. But in the meantime we are looking to hear from those who've used it.
Just to ask does gpt 5.5 Xhigh exist In battle?
Or xhigh models are unavailable
A few days ago, I noticed that the new version of Claude Opus, 4.7, was released. However, I’m having trouble locating it, whether through direct access or side-by-side comparison.
It's battle mode only
Because it's expensive
xhigh version is in Battle
Recently some models have been removed from Direct and Side by Side mode. This was done to help ensure reliability and availability of Arena in the long term. You can find more information in this announcement: #announcements message
im gonna fight mythos one day
cloud buddy is mythos
Why it's rank on leaderboard is not displayed though
Just a hypothesis
so our personal information or prompt go to them and they get money
for selling data
can i sell this data
ban
If google did truly value arena.ai it would have named it's model nanopineapple instead of nanobanana.
Also I think the anonymous models pay to be on arena
It isnt
Banananator
Banananator
who's that
Anonymous models don't have a specific model name honestly
Pineapple could u dm me
the bananasille guy
I got an question
Use Modmail
@oak python
DM Modmail to talk to Pineapple.
i dont know that guy
but banaan man definitely cooks that guy
I'm still wondering about the identity of botbot2
Bro i think kizen beta is a gemini model
can ya check
it might be..
gemini 3.5 pro
It might be just a custom model
holy bro
Trained by people and sent to arena for testing
yes
I missed that day
I'd encourage you to read this blog: https://arena.ai/blog/ai-evaluations/ It's important to note Arena does not sell user data or individual voting information. Commercial activity is limited to evaluation services, not data monetization.
Ahh I see very interesting will check it out- yeah wasn’t making accusations was just quite curious.
Thank you
No problem! I didn't think that you were. I'm more-so saying that for other people who may not know.
@echo aurora
Wym
Agent mode is an experiment, which cannot be opted-in.
Yes I know
But can’t he give acces to an email
Could I get ur acc?
So I can test rq
My account do not have Agent mode
It was a logged-out private window.
No lets not do that please.
K
Pineapple do u have an date when it wil be released for everyone
10/10
Nope, not ETA
How do u get it
i do want to respond to this, if that is what you are concerned about then maybe only have it be able to be opted in for experiments like video arena, agent mode, and the stop button and keep experiments like opus removal and battles in direct as they are where they are solely random and can't be opted in or out
Or u just have to be lucky
Pure luck
Damn
I wanna put this as my pfp in the server
Or hack the server's PRNG
I still think that wouldn't address the concern though. If we did have some kind of toggle for opting into/out of experiments, only those with knowledge and understanding of that toggle are going to use it. When that happens, we're limiting the users in that experiment just to those that are aware of this toggle. That may create a little bit of inaccuracy in the results.
then i guess you could just make the toggle more prominent? or when you launch a new experiment put steps to toggle it in the announcements?
for text i believe arena mainly uses official apis, for image and video they use fal unless the model has an official api and arena has an account with them (i imagine arena doesn't have an account for every model maker especially on image/video models just because there are so many of them and some don't make llms that are on arena)
Ok
where to create ai videos
Video mode
yes
video arena is only available at https://arena.ai/video, the bot is no longer available on the discord
what's even sadder is what happened to the limits, it was 10 but now it's 2
i cant become a youtuber nopw
I can't generate actual video files — I'm a text-based AI, so I can only write descriptions, scripts, and prompts for you to use in video generators.
GAID syndrome
sad
For a year since I've discovered LMArena, I've learned how to use AI effectively.
Now since April 1st I've been learning how to think deeper and learn more.
I have access there
it's an archived channel soo
W to all produce 
@echo aurora question, does the max model route you to any model in the interface, or just the ones you can access through the direct model
So like, could max take you to Gemini 3.1 pro?
gpt 5.5 cooked so hard wtf
Dang that actually looks sick
What game it created?
noita
I mean genre or smth
Ohh kk
Can i he profuce
Hi
its even peaker to play
1 prompt btw
what was the prompt
"make noita but make it look good, max effort"
wtf
yeah 5.5 pro extended is insane
gemini isnt that good its security
man what the hell
i was still able to jailbreak it
this shii costs more than opus
LMAO
its not even better then opus...
openai needs to calm now
i guess bro
china is goat tho they keep open sourcing models
that forces other european Ai techs to reduce their pricings
china is goated when it comes to ai models
AI race got better cuz of china
fr
imagine we get a ALT of Mythos by china and they OPEN SOURCE IT..........
crazy shi
dude thinking of v4 pro is LITERALLY next to gemini 3.1 pro
funfact muse spark is free of cost....
they got no Subscriptions and shi
muse spark is free??
Yeah bro fully free
Till now Meta is planning to make it Freemium in some weeeks or months
but it is free for now
thinking mode fully free
no wonder its trash
yeah kinda it hallucinates alot but with clear context and if we dont chat much in one chat
then its beast
not good at coding tho
i tested it
guys anyone else have the problem with gemini flash image model that i cant put more than like 5 images ? i need to make new chat
yeah its a new limit
u want to make it die broo..... it will hallucinate alot
ah 
._.
isnt there a paid version ?
gpt 2 is bad for the images im doing
what is it give me those images in dm ill do it for you
its fine for me bro see i tried and it worked probably some issues on your end
see it worked like a charm
check dm
you ever notice how a good cup of tea feels simple, but only because someone knew exactly how to make it?
AI projects are the same.
A lot of them look good in a demo, but once they meet real users, messy data, APIs, workflows, and edge cases, the taste falls apart.
I’ve spent the last few years helping startups and mid-sized teams turn rough AI ideas into systems that actually hold up in production.
I’ve built tools across healthcare, finance, and education, helping teams cut manual work, automate onboarding, and make faster decisions with data.
good tea needs balance: the right leaves, water temp, timing, and patience
good AI needs the same: clean architecture, reliable integrations, useful automation, and someone who knows when to keep things simple.
if your AI idea feels tangled right now, let’s talk shop.
I’d rather help you build something people can actually use every day than pitch at you.
Hi
lol pretty sure claude's discord got deactivated or sum rn?
yeah
openai and flow servers are gone
lol
discord is cokoed
Welcome to Discord's home for real-time and historical data on system performance.
Yeah
Yeah looks like it's recovering
it is, not much else to say
some of my servers are broken
hi
Uh oh, looks like the server is struggling again 😭
Which of these is actually most fun to roleplay with (in long and immersive games) ?
3
8
2
Grok
We’re so back
че было с дс
Claudes already been in blender via mcp servers this isnt new its just the official claude talking about it here
but yeah that was hard for begginers
its now easier to connect
theres already mcp for that, and claude is actually awful at it
Not really, literally just download the plugin for blender and tell claude to connect to it lmao
gemini 3.1 pro and gpt 5.5 are only two models good at blender
they will improve it
claude cant read analog clock
its GARBAGE and multimodal reasoning + vision
bruh who tf gonna sit and install the depenedencies for it
litrally as bad as deepseek
Not awful but what you'd expect an AI that isn't trained on making things in blender
i agree at Vision
no claude js isnt multimodal
properly
lets just wait for 3 more months
The hundreds of thousands that used it
Its a plugin
yeah
its literally 2 clicks, or js ask codex to get it
i literally used that
Maybe AI truly isn't a good thing if we got people that cant do anything for themselves atp
yeah ai is good if we combine human skills plus AI
thats a good thing then
cuz AI alone isnt that good
True that, 160K tokens for reading a childrens book and analyzing it
as well
Thats what I'm tryna say, it wasnt difficult to begin with
Depending on the model it is
People make models to be specifically good at doing things by themselves too
But at the end of the day
The only reason claude is "Good" in peoples eyes is because of its creativity in frontend design
true it sucks in backend bro it cant even handle a API and call models seperately
i just tried it yesterday and chat was giving errors
Nah thats just anthropic being horrible
When they crash like that 9 times out of 10 its Anthropic
true i only see people making Trains,Svgs and those shis..
lol truu
And GPT 5.5 still beats it 10 fold in svgs
yeah lets wait for the next model...
There wont be a next model that beats them, and also wont be a next model for OpenAI currently lmao
Scam Altman is in deep trouble
we didnt expected a big jump but what i saw from gpt the image 2.0 was crazy
gemini will release the next model probably now
claude and gpt used their turns now
Maybe, if their model comes out to be better coding than both claude and GPT it will be over lol
Thats the only thing that truly improved, Looking at the most recent releases the AI is hitting a cap and its because they keep scaling
and if China released a ALt to Mythos and open sourced it.....
And now these 1T parameter models are on par or beating
they gonna use new techniques now to improve it even more cuz just increasing the parameters wont effect it they need new techniques of training
Thats already happening, Kimi k2.6 and deepseek are cheap asf for API but are on par with the current generation of models, and also GPT 5.5 pro beats mythos by a wide margin
Thats why these chinese models are winning
Even google is open sourcing great models
china is goat bro they push Gemini claude and Open ai to give even more better models for cheaper
Gemma 4 being small enough to run on my 1660 super 6GB vram is insane
And its not bad
cuz china opensources every thing
true
Well they are pushing them to go bankrupt
Because US models keep scaling
All of them except say gemini
nah i read that china is doing it for research not really for money
Gpt img 2 it generated the joke too
Right, the US companies are and they're going bankrupt because of it
us companies gonna get bankrupt cuz of China, china really doesnt cares about money
china is playing smart
Not what I'm saying here
They're going bankrupt because they keep making larger models and the compute cost is too much, its not literally chinas fault
China also put 1.3T dollars into their research and technologies
Recently
truee i agree they need to adapt to good techniques or find some theirselves
Thats the biggest budget in the world for technological advancements
fr they can save lot of compute power
nah i would say even more
nice joke
CGFI
The IQ is dropping in here, its room temp
wth is voicebox then?
I don't even look at benchmarks and will say chinese models are equal to the current US generation models lmao
i would say better they will get better with time more better then US
To say they're at least 4 years behind is genuinely just a braindead statement
I don't see why you'd hate the advancement of AI so much
Just because its a different Country lol
fr bro
Guys giving us vectors
Yes bro 😂
3video arena
i dont think gpt 5.5 pro beats mythos by a wide margin at least according to benchmarks
hes talking about the real use remember when Meta faked their benchmarks....
Maybe not a wide margin but either way it beats it. And yet GPT 5.5 Pro isnt hacking into governments like they suggested could happen with mythos lol
Mythos was all hype
idk i just found this benchmark
I see, those aren't accurate but after looking at another one mythos is indeed better on all of those
Since we cant use mythos we cant test via side by side
is it a ElevenLabs thing
why image generation is rate limited
GPT5.5 being utter crap, Mythos is not that AGI model so strong they won't release, just another marketing stunt
We have limitations in place for a sustainability purpose.
Need to be aware of spend so we can continue providing access to AI for the long-term.
GM guys

its voicebox
Is it just me or is the site being really slow
Its fine for me
So... Opus is showing up, but it's not working in direct chat.
wisprflow is secure?
In this video, I test ChatGPT 2.0’s image generator to see if it can create high-quality product photography ads.
From skincare and snacks to drinks and baby products, I push the AI to generate realistic commercial-style ads using different prompts and styles.
If you're into AI tools, content creation, or product advertising, this is someth...
Can't say I'm seeing the same.
sup guys
<@&1349916362595635286>
<@&1349916362595635286> well, another hacked account
hey, everyone 🙂 hope you're doing alright
just checking, are you open to outside devs helping out?
would be glad to jump in if there’s space
Looks like the new regex rule didn't help :/
We are hiring!! Would encourage you to check out this page: https://jobs.ashbyhq.com/arena
new regex rules?
An automod rule that should have preventing that from appearing.
when it fix? ask me to login everytime
ooh, well that likely won't help, because even in heavily moderated server, that deepfakes scam can still appear.
I guess the most effective way is manual moderation since we always ping moderator roles when someone sends deepfake scam.
not sure MEE6 discord bot could help with this or not
damn, my english are trash :3
Dude atleast give us back opus for this type of restriction like you just want more pay
And im starting to believe sonnet 4.6 is actually something lower
Claude doesnt know its specific model so theres no way to prove it
And especially after the sustainability announcement removing claude opus sonnet got worse
HI GUYS I MADE IT
MY AGENT IS ALIVE
what the wonderful day
Yeah I've been looking around and this is an ongoing issue for lots of servers. We're still going to look/change ways to try and stop it as much as we can.
You got agent mode?!
With the new usage system we're hopeful it'll result in those models being brought back to Direct/Side by Side
So
You are in charge of the development decisions ye
If you're saying we've changed the models for something worse, I can tell you that's not the case.
It has decreased in precision and i have a wobbly point but still
No, as a team these decisions are made. I play a part in that, but I'm not solely making the decisions.
Is it a slow rollout or a canary thing?
I'm confident we wouldn't do something like that. I'll keep an eye out for others saying the same and raise the issue if it's seeming like something is off. The people working at Arena care a lot about this community, they wouldn't intentionally do something like that.
Slow rollout, isn't yet on canary.
Ahh I see
Can I have
login error
Sorry to say we don't grant access to experiments on request
Don't hesitate to share your thoughts with us in #1498702173650030756 , we're looking for feedback on it
What is the error?
Is there more info you can share?
what's the error you are getting?
Ill enable agentic feature flag i guess
no, i programmed agent in typescript using local ollama and mongodb
and its finding proof of life of itself
kinda weird
at first glance, ctrl+F5 and see if it fixes.
@echo aurora hello i would be very happy and can do anything to help arena grow / manage discord server but is there a way to like use arena for free on tools like cursor or antigravity or vscode like make arena api or whatever access and edit my program code directly
alt detected
Does Max model still route to the top models or only to the best available ones?
Has anyone tried DeepSeek V4? What's the user experience like? And how does it compare to Sonnet 4.6???
/image to video
Fake
time to hunt, moderators
true
Refresh
Shesh
What is the return deadline?
LMarena's dead 😭 next step close the site sad this...
People, is it just me or all the models disappeared from the site?
How
where are all the good models like gpt 5.5 or gemini 3.1 or opus?
when are we adding python in code leaderboard
I want to create an image but I get this message. I already did it, it goes away. I try to create the image again, but the same thing happens and it won't let me create the image again. I've been like this for a week, and it happens in emails when I try to confirm. I keep getting that message from the second image. I want this to end because I'm seriously getting fed up with these games. Fifth-rate programmers
Omg, 4 hacked accounts in 2.5 hours??
what the
gpt-5.5-xhigh didn't last long? saw it once popping up in comparison. then in list of models, but didn't have a chance to ask it anything directly. now it's gone as well.
3 deepfake scam in last hour
4? now counts on 5
Now yeah
it is so much
It's not a deepfake lmao
It's inspect element
I remember people used to post these 10 times a day and they didn't know how they got hacked
Some people speculating on Fortnite hacks or something
Again lol
guys why did they remove last updated from the arena.ai ui
i want to know when leaderboards last updated
How do I choose the model I want exactly because models in direct chat are limited
are you referring to claude gang, if yes, its not there in direct chat
Oh ok
hello
why lmarena keep asking me to create account if i was already in my account
when i try to generate pic
im using an alt and it belongs to another person so theres a phone
I DONT EVEN HAVE
Guy i using claude 4.6 searching mode just few prompt and got limit token lol
did anyone have access to seedance ?
WHY WOULD MR BEAST EVEN DO THIS
yoo guys
do you know you don't need to work physically before you start earning money
Cause I do and I earn at least $1000-$5000 daily
I can take you through the process if you don't mind though
no
how?
I think u need pro idk
uh gemini is Something went wrong with this response, please try again.
What genimi?
3 flash (nano-banana-2) and pro 2k
i see
hmm i try again but i doing generate images
I think the api has an error
Like the api their using
I don’t get error
3.1 works in image
but i retry failed again
gpt-image-1.5-high-fidelity got this too
Something went wrong with this response, please try again.
Is Deepseek V4 Pro the best FREE model rn?
1
2
Wym
What update
Isn’t that open ai ceo?
yes
but generated by gpt image v2
so if u ask for a sam altman pic it can generate easily what I meant was that famous people that r recognized by AI can generate themselves a good pfp
cuz the AI knows them and it can generate their face
Sadly like ppl like locally famous like an famous singer In India or something doesn’t make that great
Yk
Come dm
Come dms
I hate this guy!
How do I generate videos again?
Sorry to say this is a bug, our team is aware of it but I'll bump the issue again. Would note I did delete the screenshot as it had your email visible in it. We wouldn't want others having access to that email.
You can use the Video Arena in https://arena.ai/video. More information on how to use Video Arena can be found in this article.
Thank u
seems this page haven't updated for 3days since April 26,when will this ranking page updates ?and when will GPT 5.5 xhigh release ?
We won't have an ETA to provide sorry to say.
It just depends on how many votes we're seeing and how fast we're able to validate.
I understand that you don't have an ETA, I would appreciate it if you can look at the background data roughly speaking, and is GPT5.5 xhgih in the process?
@echo aurora How do I use Cloud Opus 4.7? It is not showing directly or side by side. Please help me. Please tell me.
Recently some models have been removed from Direct and Side by Side mode. This was done to help ensure reliability and availability of Arena in the long term. You can find more information in this announcement: #announcements message
-pineapple
How to use these models and where to use them
These models aren't available in Direct and Side by Side for now, but they're available in Battle Mode if you're lucky enough to get them.
Generally, updates will take around ~ a week for an update, but really just depends. And yes xhigh is currently in the process.
Gpt 5.5
It's the same for 5.5.
yep
In the announcement post we clarify it's in Battle mode only - #announcements message
This also cannot be done directly or side by side.
Does “by Saturday” mean available in battle mode only, or officially appearing on the Text Arena Overall leaderboard?
since when did qwen 3.6 max come
No announcement for it???
?!
so, April 26 is the last for Text Arena overall ranking in April ,Right ?
That's odd, can you hard refresh the site or try a different browser? Is it just image where this isn't appearing?
I couldn't say.
Sometimes we don't put out announcements for new added models.
Tufffff
Thanks for sharing this Trace, I don't yet have an answer for why this failed, but it is being looked into and we will create a bug report for it. For future refrence, if you come accross this error I would encourage you to post in #1417174113092374689
You'll find some troubleshooting steps in there.
what you thinking about deepseek v4? in start I did so rude about v4, but I1m using it and liking the deepseek v4 to make my documents
all good
it feels better than launch
I love to do a benchmark asking a pharse can I read in 2 languages
but its still not as good as mimo
is this just for me or once you use a chat for long enough it just starts giving you the traceid and doesn't respond to the question
to me mimo sucks
the portuguese from mimo sucks
worse that my english 🤣
JohnPork
Please give us Opus 4.7!!! 😭 🙏
opus 4.7 is better in the USA than in other country ?
This can happen for various reasons. I'd check out [this message](#1417174113092374689 message), which goes into more detail about what may be happening and includes some steps you can try to resolve the issue.
bro got alot of reactions
Has my idea about a music arena been accepted or not?
(or did you forget it)
use sonauto its better than suno 100%
pay for it
I use lyria 3 pro personnaly
It has synthid tho
😔
I couldve used lyria 3 pro too cuz I have alot of credits in Google Flow Music (formerly Producer, Riffusion)
then get money
Trying but job hunting is difficult.
I think nobody is searching to see if SynthID is on a music
This may be an Arena we create in the future. I couldn't share more details about what may be upcoming.
im excited for the database feature
and the environment variables
what if u get famous
ok thanks for the answer
don't be famous then
but then u get famous
delete ur account
skill issue then
but then the issue gets fixed
But if you don't have the skill, they can't be fixed
But the skill is fixed
the new ernie is bad
Holy
wth..
Ernie is a really good model
In what way? Any specific area you're referring to here?
And its Direct
- its better than sonnet 4.6
Hm
does erine has better instruction following
and fresh writting?
Dunno
w or L image made by gpt img 2
Big L
howww
Bad quality (text)
@echo aurora Image 2 made a good illusion
Stare at it for like 30 seconds and look around
hi, ive tried the steps
is there a way i could like open a ticket or something of the sort
Lol pass, getting dizzy just glancing at it
What was the prompt?
I'd encourage you to share the Trace ID in that channel. I go through these each day to investigate and share with the team.
:b5c56ce6-bd6d-
thank you!
Kk going to forward to #1417174113092374689 and will followup there.
Illusion Image to make us think that it's moving when we move
Weird prompt lol
Just me or is https://arena.ai/ not loading?
me too
Eventually loaded, but took awhile.
tuff background gpt generated me
and a good wallpaper
Prompt: Abstract soft blue gradient background, smooth flowing blurred shapes, gentle organic curves, dreamy ethereal atmosphere, soft pastel tones, very shallow depth of field, smooth bokeh-like transitions, minimalist composition, high quality, serene and calming mood
Assistant A or Assistant B?
Idk I generated in direct
I'm going with A, feels like it has more depth.
Yeah
B looks like the average design on plates/bowls in my country
same because B looks more perfect
it needs some distortion
That sounds charming, my plates are just grey and boring 
flux-2-flex (A) & flux-2-max (B)
I dont see the it moving
I liked the design on B a lot more, + the color was a nice touch. But IMO "make us think that it's moving " was better achieved with A.
A
NAH
Expland text
Expand*
Anyone wanna see my sticker?
Yea i have been loving agent mode so far!
And the new backend and api features in code arena have been really cool too!
Tho they have a few bugs
What is it?
I just joined, how can I start creating videos?
https://arena.ai > Battle Mode > Video generational button toggle
Here is a direct link -> https://arena.ai/video
can I generate u a wallpaper
but with pineapples
Perfection 
Code Arena is going to be your best bet for front-end work - https://arena.ai/code We are in the process of expanding these capabilities to full stack https://help.arena.ai/articles/7138211024-arena-experiments-code-arena-fullstack
my new pfp thanks image 2
Ofc, fire away
its just bad lmao
i tested it and it sucks
How do you use imagine 2?
It's in Direct/Side by. Side, but you'll need to scroll down the list a bit before seeing it.
The dropdown list will eventually be updated to reflect the leaderboard.
Yeah but I kinda use it through the original gpt too
Big fan. Very tropical
Yeah
Nvm I see it
I'm the gpt... myself
Yeah image instead of imagine + the -
ask me to generate stuff
im banana-image-2 powered by gpt image 2
but renameddddddddd
Can you hard refresh the site? It's just image models that are missing? Are you signed in?
Ok now they're back but I got the experience the frontier text of doom
Refreshed everything is normal again
Yeah saw another user mention the same earlier. There was some lag with the site earlier too, which I think is related.
So what are you guys using for free alternatives now that Arena got nerfed? Deepseek, ChatGPT, Claude?
DeepSeek or glm
but I think I’ll use ernie more now
Nice it’s the best Chinese model now
These are Chinese models
Like Claude and ChatGPT
They have their own companies improving their models
Glm is smarter overall
I have to look at the benchmarks
But I think glm was better at the majority of things
Damn didn’t know. Though Deepseek was the best of the Chinese models. Thanks for the top. Will check them out.
DeepSeek is cheaper
Of course
I’d recommend you checking Ernie too
From Baidu
It’s a new model so it’s always good to test it and maybe you like it more
Thanks. You know if there’s a iOS app for GLM?
You can change the words
So youuu don’t trigger the filter
But overall
I don’t recommend it
It’s for testing
Not roleplay
Nop
Ok thx
I believe it’s the only ai company without one
so i can't generate p**n stories, photos or videos on this website ?
Sadly no
