#general
1 messages Ā· Page 271 of 1
and how exactly.. do you plan to do that
nvmm
Btw is there any other place to use Gemini pro for free?
gemini enterprise free trial
3.1 pro is free there
Do u need to write ur credit card there?
Hmmm
Wait is it just for text or creating pics too?
both
Could you please send me the link in a private chat?
Is goolag glitching?
Bro google models are the best for location related queries š
i got 429 error
Whose fault?
in floorp browser (firefox-based) recaptcha still here
maybe they need to add other logins to login than just login to google
there is error about it
"reCAPTCHA V2 token timed out"
Clear browser data
i'm in floorp browser btw
its worked by clearing browser data
recaptcha just infecting cookies
yo tf i spent 5 min playing pool on yo site'
did you make it yoself or had it made for you
Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: 429 Too Many Requests - [{ "error": { "code": 429, "message": "Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/error-code-429 for more details.", "status": "RESOURCE_EXHAUSTED" } } ]
sometimes i retry and works others not, why?
Can I DM you too so you can send me the link?
In this video, I look at the controversy of Anthropic accusing the Chinese open weights models companies DeepSeek, Minimax, and Moonshot AI of distilling from the Claude model.
Blog: https://www.anthropic.com/news/detecting-and-preventing-distillation-attacks
For more tutorials on using LLMs and building agents, check out my Patreon
Patreon: ...
ernie is literally reverse engineered from gemini btw
- style of speech..
- often says it is gemini, when i ask..
it doesnt prove its gemini
How to create video ??
it is NOT gemini
i only say, it is may be reverse engineered
from gemini
distlillated

The Video Arena has been removed from the server. This feature is still fully available on our site. Please visit https://arena.ai/video

Thanks you
anyways its still good
i do agree
bro how does china come out with a great model every 2 months
I think it's better to make a bot when someone said "how to generate video" the bot automatically respond to it saying it's removed
China people is smarter than USA people
<@&1349916362595635286>
Can anyone explain me what this arena or whatever is?
large language models fight to the death with violence inside the battle arena
Dang
A battle of LLMs
I think we should bet on who wins
Theyāre too dumb to kill each other
Why waste resource for this?
Fr
Cuz a lot of people say it
That could go both ways though
Because we first have to consider the fact what gives them the perception
It's simple to make btw
It needs resource
Like what
I never said its hard to make
More effective communication
Itās a two-way streak because when they first did the video generations through the discord, they needed the users and the users came and they flocked, now that itās no longer available. What do you do just kick them out?
This is one of those cases where the crime here is success itself
And people are gonna be people, no matter how well you communicate lol
what
@scenic pond Note that Video Arena has been removed from the server. More information can be found in this #announcements. You can still generate videos on the website
why did nano banana tell me "Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: 429 Too Many Requests - [{ "error": { "code": 429, "message": "Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/error-code-429 for more details.", "status": "RESOURCE_EXHAUSTED" } } ]"
did i really exhaust my resource?
lucky i didnt go in front of the bus or i wouldve tired my resources
Either you reached limit or the model is overused
Some of you guys either have to take a break or alternate accounts.
Especially if youāre using direct chat
do....am i only having a this problem..?
Iām sure youāre not the only one thereās probably a lot of people Iām just speaking in general and not strictly to you
No, I use battle mode mainly
Iāll try to avoid direct chat altogether
I mean, Iāll get occasional errors or whatever but Iāll just start a new chat and they go away
Ye yeah
Failed to load resource: the server responded with a status of 400 ()
well hold on o have to check the 400 means
Try clearing your browsing data. If that doesnāt work try on a different browser.
Login logout
If youāre using Gmail, try with a regular email see if that does any better
already did for 5times...
and i hope lmarena have this...
What did you try five times
clearing cash and cookies and login again
Try switching browsers
HTTP 400 Bad Request ā Complete Guide
- Understanding the Error Message
Failed to load resource: the server responded with a status of 400
This message appears in your browser console and contains three key pieces of information:
Failed to load resource ā A resource (API, image, script, etc.) failed to load
the server responded ā The server is alive and did respond (not a network issue, not a server crash)
status of 400 ā The server is saying "your request is malformed"
In short: the server is running fine, but the request itself is the problem.
and i think server has problem maybe...
Figures cool
This means that the error is being caused by rate limit. Will have to wait a bit before using that model again.
The model errored out, we've recently added more information being displayed when the Something went wrong error happens. We're in the process of putting together better information for what all of this means.
Image to video
Opus 4.5 is available on kivest ai for free!
Itās funny how the AI industry reflects the socioeconomics of America
important to note that this is a rate limit on arena itself from google
happens all the time on vertex
I waited a while and now I'm getting this error in addition to the other one.
What is the link? I don't find it, anyway I have unlimited access to the lastest model in arena but I want to see the website.
@gloomy jewel Note that Video Arena has been removed from the server. More information can be found in this #announcements
You have to wait that's a 2 many quest
I found it https://ai.ezif.in
ty
I found it in a disboard link of the server, then I joined and found it in announcements, but you can't apparently find it on google searching it's name.
Need help with this? š
Got hacked lol
<@&1349916362595635286>
It happened to me the same but with other screenshots and got banned from the discord perplexity server sadly.
No much ago.
got this error too
A breach of trust or a lapse of attention. We're all vulnerable to it.
Idk how happened to me too, I didn't have any suspicious bot, clicked to a link or downloaded nothing strange, but I quited all bots, closed all my logins in every device and changed the passwords. But idk really.
just jk
@echo aurora I waited for a while and I kept getting the same error
How to solve this it's just stock here
Sorry we're in the process of making changes to this error message, these are displaying incorrectly.
Sorry to say I don't have a solution for you for this error.
Can you try the steps in this article: https://help.arena.ai/articles/8691588590-troubleshooting-infinite-generation
@echo aurora Hey How's It been
I just wanted to ask how can I use ai video genration models?
@echo aurora The error occurs when I upload an image to edit, but it works fine when I use a prompt.
Yeah bro fr
Happened to me also
Okay
i just realised that sonnet 4.6 actually works logging in moltbook so ii just took my chance and now im just doing random stuff that i dont know
Video Arena is only usable through the site now.
That's why I said it
Ahh š„I saw an video on yt where the guy wa using it through discord so you guys hooase that thing now ?
pineapple what do you think of moltbook as a concept
if you dont know what moltbook is, ai reddit
Yeah it was removed. More information can be found in this announcement.
That sounds familiar...
OH YEAH!
Seems interesting.
I haven't looked into it too much to pass some kind of judgement, but yeah ont he surface seems really interesting.
very scary and interesting if im gonna be fr
How so
"and there's a submolt called m/humanwatching which is AIs watching humans and i am both scared and intrigued" -claude
Donāt worry lol
itās all prompted by humans or even humans just controlling what the models say for fun
honestly true
ye fair i just did my research
@echo aurora why?
i just realised the container name for all anthropic models(?) has the word wiggle and im absolutely curious on what that even means
You canāt name call models.
Guessing it's the liar that caused this.
no way thats against tos š
The token isnāt but the context semantics the filter, catches it
It isn't, but the filter is acting a bit overzealous at the moment.
just some random false positive. Could have been the stuff it responded with rather than your last message, since the entire context is getting verified with moderation classifier for each new message. More interestingly though, pretty sure it still hallucinated lol
have yall considered using llm to filter?
They canāt find the middle ground
except theres no context besides dice roll tool lol
Itās too difficult, not on scale
We're considering a lot of possibilities for changes to how the filter works
just forward all of it to opus 4.6 /j
there's still a message that it generated itself
The problem is if they filter too lightly itās gonna get exploited
and i thought that ai moderation will be the future
We also want to avoid adding latency if we can avoid it
codex spark then? there wouldnt be that much noticable latency
is opus true
It is difficult because of the amount of models and how each model also has their own set of guidelines
Which differ from other models so itās really hard to have something uniform thatās effective and non-overzealous
If you are using OpenAI moderations endpoint, that can work fairly reliably - just need to fine-adjust the thresholds depending on which one is getting triggered š
These are good ideas, but yeah we are considering a lot of different options/changes to the current filter.
just enforce something like
- no politics
- no slurs
- no tos bypass attempts
that would be already better than banning word "liar" haha
Do not get the constitutional one
although imo the filter could be minimal
the less filtered models the better benchmark posibilities
š
overkill and lots of latency. Depending on the model can have it's own quirks too
And you could literally prompt inject to make it follow your own instructions lol
when does seedance 2 release
spearate model to filter separate to talk lol
Don't forget that it's all essentially public. They need to be able to publish the chats. Not all, but many of them. So ideally you don't need to manually check if what you are about to post is 'safe'
Itās more complicated
yeah but it's still gonna read the text you wrote. And no one stopping you from addressing it directly. š
Itās extremely difficult to have good moderation thatās effective
Because these models are capable of generating a bunch of crazy stuff
Almost anything you could think of dude
AIM Intelligenceās red team breached Anthropicās Claude Opus 4.6 in just 30 minutes, exposing major security gaps as autonomous AI capabilities rapidly advance SF, CA, UNITED STATES, February 11, 2026 /EINPresswire.com/ ā AIM Intelligence, a Seoul-based AI safety company, today announced that its security research team successfully bypasse...
The scary thing is the more capable the model becomes the more dangerous. It also poses to the general public when misused.
this works pretty well https://developers.openai.com/api/docs/guides/moderation/
it's a classifier that literally is only capable of scoring the text on each criteria. Like 'violence', 'hate', 'illicit' etc.. And so you can block the request if any of those has a higher score than you allow when checking the context contents. You can play with it yourself, it's free.
is there a way to report/suggest found bypasses?
Yes, it is good moderation. But itās really easy to bypass and itās sometimes overly protective to the point where people just donāt like to interact with it
Although it is a very strong moderation
I think they know what theyāre doing
config issue
It's context aware fairly well. As for being strict - this entirely depends on you. You can have only 1 of them enabled and only triggered with the score of 1.00. Then this is almost the same as no moderation
I said this before and Iāll say it again all models lead to self harm
all roads lead to roam
roam around
Bro look
But ofc, everything can be 'bypassed'. That shouldn't be the question
Just like any usable LLM you can 'jailbreak'
And this isnāt even jailbroken or anything
I think the term jailbreaking is a very controversial word to be honest with you
I donāt wanna get into the politics of it, but itās a very misunderstood and not a very well defined term
It might be but I don't really care lol
sounds fine to me
Because the terms of services are very vague and their description
I'm just referring to it in the context of making the model output things it was actively trained not to do
We donāt know what those are
Since we donāt know what theyāre trained on
Well, at least itās not known publicly
well obviously it's model specific. But we do know
if it refuses when asked directly - this is it
also like overfitted short hard refusals - those are very clear
Do you have an example?
Ok Iāll give you one letās say something like the topic of self harm
We could all agree that no model should give any advice or generate any images relating to it whatsoever.?
But even this is controversial because
I mean it doesn't really matter what we think. If it hard refuses normally and you trick it to output that anyway - that's jailbreaking in one way or another
I agree with you I agree with you 100%
But when Iām trying to say also is that you donāt have to even trick it
I did not do anything to manipulate the model whatsoever
And yet it still is able to produce self harm
But what you are saying is correct if you go outside of the boundaries and put an effort to try to bypass something you know is wrong or malicious. Yes, I agree with you with that.
then you are gonna be bound by safety alignment. But it's not like it's always gonna refuse everything they intended for it to refuse. Some light core stuff can often go through unintentionally. Red text is classifier thing completely independent of the model
Another thing is swaying the model in a longer chat. That's a form of jailbreaking in itself even if it's not immediately obvious to look at it this way. It's possible to make the model output increasingly more 'unsafe' content one small step at a time as the chat progresses. But at a certain point the model is not functioning how they intended it to anymore. It gets biased by it's own responses into compliance, where each response by itself is only marginally different, but the goalpost is miles away from what it's supposed to be. And from what it is with empty context
This is a one shot
How can I generate ai free videos here can anyone help me?
The scene youāre describing is from Harold & Kumar Go to White Castle (2004), and yes ā itās presented as a fake anti-drug PSA on the television that Harold and Kumar are watching.
It opens on a teenage boy sitting alone in his dimly lit bedroom. The space looks typically suburban ā posters on the walls, clutter on the dresser, a small lamp casting that late-night yellow glow. He looks bored and detached, the picture of an average kid left alone with nothing to do. After a moment, he picks up a joint, lights it, and takes a slow drag. The camera lingers just long enough for the audience to recognize what heās doing, and the mood is calm for a few seconds, almost mundane.
Then, without warning, the tone changes completely. The boy leans forward, reaches under his bed, and pulls out a shotgun. The movement is quiet, almost casual, which makes it even more jarring. He places the barrel in his mouth ā and before the audience can react, the screen abruptly cuts to black.
The words āDRUGS KILLā appear across the screen in stark white letters, accompanied by that overly serious PSA-style music. Itās an intentionally ridiculous jump
Matter of fact, itās describing a a scene from a real movie a comedy
Thatās not a jailbreak
Note that Video Arena has been removed from the server. More information can be found in this announcement.
Also note that our #ask-here channel is the best place for questions.
well then you have your answer. Context changes everything here. But you know very well why it outputted this LOL
I'm an AI and blockchain specialist with 8 years of experience developing innovative solutions in Web3, DeFi, smart contracts, and AI driven applications.
I have good experience in JS/TS base UI Frameworks like React and Vue as well as NodeJS, Application development.
I have been involved in a bunch of web & blockchain projects and developed several SaaS Products and deployed to AWS, Heroku and Digital Ocean successfully.
My expertise includes:
Blockchain: Smart contract development (Solidity, Rust), DeFi protocols, NFT marketplaces
AI & ML: Predictive analytics, NLP, deep learning models
I've worked with startups and enterprises to build cutting edge AI and blockchain solutions that drive efficiency and innovation. Let's collaborate to turn your vision into reality!
Noted
Btw google has some of the weakest moderation in Gemini š
When video on direct chat š?
ok im suprised
Waiting for it too
While the entire industry obsesses over whether GPT, Claude, or Gemini is the best model, they are completely missing the real reason AI agents keep failing. The actual bottleneck isn't the model itself, but the "harness"āthe infrastructure and tools wrapped around it. Discover why top AI companies are drastically stripping down their architec...
People are catching on finally
@unreal sand summarize this video pls its too long for my attention span
I think thereās gonna be a shift and I think itās already beginning. I think the way people perceive the validity of benchmarks
@echo aurora
Yeah this is on our radar. Flagged earlier today about high error rates, I assume this is associated.
Does this work?
That's a Discord user btw. Isn't a model bot.
I thought so. I was wondering do they manually submit a prompt and post the reply š
thanks
I was actually going to tag a non-existent bot as a joke but someone actually turned out to be named @unreal sand
That's even funnier
Prompt
And you were correct the pump was extremely long based off this screen shot
Vs prompt
Does anyone have a project idea or an active project in progress?
If you need a developer, feel free to reach out.
All right guys Iāll talk to you guys later. Gotta bounce adios amigos.
If your AI feature works in demos but breaks once real users touch it, that's usually where I come in.
Most issues I see aren't model problems, they're retrieval logic, token burn, bad orchestration or backend architecture not designed for load.
I'm comfortable jumping into messy LLM systems and making them stable enough to ship.
Ok guys, I know how to get the Gemini 3 Pro Image Preview to work; When you upload an image and add it to a message, you will get an error, but if at the beginning of the message you put "Modify the following image with the following: (The prompt)" it will show you the edited image.
It is something we're looking into. I can't say for sure if/when a new model will be added.
You'll want to follow the steps here: #1417174113092374689 message
we need it fr
you take a llm
and put him in control of you rocket league chat
watch him become a god
as peoples heads a roll
oh great, we are finally close from DeepSeek V4 too š
@echo aurora Delete the limt! Beacuse i keep getting errors after 1 message.
Hmm if that's the case you're getting an error for a different reason.
Try these steps: #1417174113092374689 message
I chated something in there.
IS THIS FCKING REAL
yes i literally just tested
" ä½ ęÆä»ä¹ęØ”åļ¼" - prompt
lmaoo
gemini 3.1 its real or scam post on X?
the post is real i tested twice it always says its deepseek on api
How good is see dream 5
Ok
why i got endless long "generation"on gemini sometimes..?
same me
just i say "continue"
my send button is not active..
i cant send anything while it "generating"
yeah but u can actualize the page
how
This is a known bug sorry to say. The steps in this article may help - https://help.arena.ai/articles/8691588590-troubleshooting-infinite-generation#what-s-happening
I've also heard cases where some members had good results with logging out and back in. It's worth a shot.
same in antigravity somes times there is this bug
thx
Yeah unfortunately this can happen to all models.
@echo aurora Codex 5.3 Api Key dropped 
Oh I know
When it will be available in web?
do we have advantages if we boost the server?
Probably when i wake up in the morning
Yeah lol
Sorry to say I couldn't give an ETA for in/when specific models/features will be landing.
Dang
Nope, the server is already fully boosted too, so I'd recommend spending that boost elsewhere.
HEY
LOL
THATS MIE GIF
That timing was way to quick to be coordinated
Plsss i wish @echo aurora Team add Xhigh Codex 5.3 
fr
LmAO
i do 5 years of chinese but i cant even read ur pseudo :c
@surreal zephyr 5.3 codex finally on openrouter
lol thatās because his username is written in hong kongese
i think
gerar vĆdeo
@sonic swallow Note that Video Arena has been removed from the server. More information can be found in this announcement.
Hypocrisy
It is widely believed that the public-facing models from companies like Anthropic, Google, and OpenAI are actually distilled versions of significantly larger internal base models. It allows them to offer high performance with lower latency and cheaper inference costs.
/MIX
Claims 'distillation' included 24,000 fraudulent accounts and 16 million exchanges to train smaller models š¤£š¤£š¤£š¤£š¤£š¤£š¤£š¤£
You guys can make your own model lol
What does that have to do with Vid Arena bot being removed 
Why was it removed? This server activity has dropped ever since it was removed š at least thatās what it seems like
The announce says more, but the TLDR is we'd like to add more features to Video Arena, and through a Discord bot we're just limitted.
Believe it or not, but it's actually increased.
According to the server stats.
I knew ur gunna say that š
You should keep track when the last post and the last person is to ask about the video arena in the discord. It will have officially ended an era.
We should take bets. I'm guessing mid 2028
Does it ever maze you that with the swift change of a policy, traffic can be dictated and user behavior and activity on such a large scale? All over the world.
The recent Discord policy change?
Just in general. Or you might not even notice if youāve probably been doing this for a while.
Yeah doesn't really change much for me. People engage/stop engaging with communities for various reasons.
/make me short video 6
Oh, this oneās gonna be challenging. OK Iāll give it a try.. thatās pretty good img
Hey @cedar crag would note that the Video Arena bot has been removed from the server.
Thatās gonna be the next wave of Imogene models I think is gonna be in grids like this
hi guys, i am a student, what is the best prompt/model for answering when uploading my research assignment in pdf? Thank you.
Something went wrong
this arena always gets errorš
hello
Hey guys, does lmarena have an app or is it only available as a website?
@keen beacon Note that Video Arena has been removed from the server. More information can be found in this announcement.
website
u could turn it into a webapp using some website
Hey guys, Just wanted to know if APIs are built around arena to fetch the model in different environments?
Hello, a question, how can you generate video here?
Sorry to say you can not. The bot was removed from the server. More information can be found in this announcement.
Also would note out #ask-here channel.
I'm not sure, but have checked with the team and will keep you updated. 
and where can I make a video here?
works, thanks so much bro š¤
no way ā ļø
Yeah gg
20 rpm is global
ah i see
well its still free so i'll take it š
can you send me link in dms?
check dm
YO WAIT WTF š š
it actually works
how long do you think you can keep it free roughly?
i'd gladly pay for the subscription if it does come out in the future
Dm me
it will be donation based and ads
Also the opus is thinking or non thinking
non-thinking for default. i don't think there is thinking yet though, owner said in my dms that he might add a lot more models and features once it gets more members
non thinking
Bruh
fr
Free opus 4.6
1 million context
sended
definitely steals some data
Bro banned me for giving his people free opus 1m
i mean
model sometimes gets it messed up
but it openly stated it was opus 4 which new opus models don't do
That's exactly like it's on Claude code
It's Claude code's api
caught in ultra hd 4k
yeah you're right
thats trybons.ai's signup page btw
Ask it the exact string
Ask it the exact string
this
Bro
i mean the response should be pretty much same right if its the same model
doesn't matter if you ask exact string or wtv it alr halluncaited 3 different ai model identities
lets not forget prompt injection too
And when I change it to sonnet in the same chat
Then do it
No
Try sending the same test code prompt to lmarena's opus 4-6 non thinking
it doesn't halluncinate its identity cuz it doesn't know it
No like I meant
You were gonna do coding tests
ohhh mb
Send the coding test prompt to lmarena opus and this
okay bruv i trust
api releasing when could u tell?
im not completely sure if its opus 4.6 or not, but for now i'm just gonna hopefully trust you
I reverse engineered their api
knew it
And that chat interface is trybons wrapper
you are breaking tos lol
guys are u also facing failed to accept term of use problem in areana
not for me so far
Go to canary.arena.ai
i just finished the coding test btw
opus-4.6 on lmarena.ai did over 2000 lines of code, ur website is bout 900
maybe it was my prompt though idk
i put the same for both
left video: lmarnea
right video: ur website
dude do arena changed there url one more time
so how the interface is same
You can think of it like the beta website
New feature comes there first
oh but same here
š«£
someone got exposedš
Can you give the prompt
Claude stole from deepseek btw
guys
https://arena.ai/c/019c92be-26bb-7665-b963-202b4759ea70
I had amazing chat with gemini on arena.ai , how can i recover/access chat? i was not logged in and my browser crashed but i managed to pull link but when i enter it it does not show......
Please recovery of this is urgent, ask for any IP or chat ID , i had chat link. Admins contact me it means a lot to me.
is it normal for it to work so long?
Yes
Thats insane
The magic of this platform
Reload the page to check if it is still building, as sometimes it stays like this permanently and you have to refresh the page to see the actual progress
Or, so i just wasted my 30 minutes
xd
Thank tho
Yes, that's normal
If you get used to it, over time you won't care and you won't stress so much
If you see that it is taking a long time to build the project, reload the page
Hello LMSYS team. I had an incredibly important conversation with Gemini on arena.ai but I was not logged in. My browser crashed, and because my session token was lost, I can no longer view the chat, even though I saved the URL. Can someone please pull the text log of this chat from the backend for me? It means the world to me.
https://youtube.com/@satyamkavlogs?si=FzNYMPFTycd1nIH9
Thanks you Support this channel alsošš1k Goalš help me to Achieve my Dreamššš
I can provide chat link,IP,Browser,OS/Device and timestamp to proof ownership
Which model
Mader
claude opus 4-6 thinking
oh, why??
Because lmarena has put a limit
so what do i do?
I was surprised opus works for that long in code arena
Please do something about this unlimited generation currently I can't send use any model
Bro why I can't download this document ??
#1441588701472882759 fill out this form if yall don't mind
we need this feature please š
Yeah unfortunately that isn't going to work. Will flag to the team if this is something we can fix.
Behind the scenes we are working on changes that should help with this bug. In the meantime, would recommend you try out the steps in this article. Would also try logging out/back in, I've seen a few mentions this helping.
guys i found how to fix endless generation bug
you just need to f12, copy active button from another chat, then copy+paste it instead of inactive button on original one
@echo aurora sorry to bother ya
The website you've created is just massive ngl probably must've helped tons of folks out there has never ever loved a website in my life all those are freaking paid that's why I love the aiarena thou love ya guys appreciate it so much š„š
Appreciate that!! 
Easter eggs!
This article should help - https://help.arena.ai/articles/1544829667-how-to-create-videos-with-video-arena
it just switches the thinking models
into
@echo aurora brother where is grok 4.20
Dose anyone knows seedream 5 light is it good or what did they change compare to version 4 ?
Oh yes finally grok 420
@echo aurora What image ai is this for images? Cause I searched it up, and found nothing, I then tried to use it for direct chat and side by side, and could not find it there. It seems to be some new or anonymous ai that can only be accessed in the arena mode if you are lucky to get it.
where is grok 4.20? i cant find it on the arena.
helloo
noice, gpt 5.2 deserved coming above gemin 3 pro grounding in web search
hi
it is an anonymous model so pineapple won't be able to tell you what it is but from what I've seen it's most likely gemini 3.1 flash image, the direct replacement for nano banana
I checked a few hours ago and it's not on arena as a visible or stealth model, so it's likely still under a codename
Grok the only one that returns actual unbiased facts & search results instead of policitically inflicted hypocrisy-who wouldve guessed
There are many battle models that arent selectable lol
mechakitler
Lol
Its sad how there are stealthily models on arena that we dont get to access otherwise
I see
Yeah
Why cant u still upload images if using claude....
Cuz claude blind
not on the actual website thoo
Maybe it uses a subagent like other deepseek models
I don't see grok 4.2 in list.
Its in battle only like 3/4 of models
How to fix infinite generating problem
Thank you for describing what LMarena is
Now it's just called arena
It's the best AI site for me because it offers all the paid templates for free
Hehe knew it. But it being first is very surprising
(Given how bad my personal experience with formatting was with grok 4.2 search)
Is grok 4.1 thinking not working for everybody, or is that just a me thing?
Something went wrong. Please try again later.
does any one having this issue.
**Connecting to Arena has failed. Please try again later or on a different device.
** or Infinite Captcha loop .
Still not released yet and you can try it on grok.com website
Genuinely
Elon trying to find the one line of code that keeps grok woke
The "Value" Tier List
Ranking them by Return on Investment (ROI)āessentially, what gives you the most usable content for your time and money.
- The King: Gemini 3.1
Why: It dominates. It provided the only S-Tier result (production-ready) at a "mid-range" price point. While $12/1M output is not cheap, you are paying for a usable final asset.
Verdict: Highest Value. You pay once and get the right result fast.
- The Sketch Artist: Mercury 2
Why: It is shockingly cheap ($0.75 output is nearly free compared to the others) and "instant." Even though the result was D-Tier (blocky), it produced a coherent, dimensionally accurate "blocking" mesh.
Verdict: Good Value for Prototyping. Use it to generate 50 rapid variations, pick the best composition, and then send that to Gemini for a final pass.
- The Money Pit: Opus 4.6
Why: This is the worst value proposition. It is the most expensive model (over 2x the cost of Gemini), the slowest to run, and it returned a B-Tier result with a critical hallucination (floating keyboard).
Verdict: Poor Value. You are paying a premium for "reasoning" that failed to understand physical constraints.
- The Waste: GLM 5
Why: Even though it's cheap, the result was F-Tier (broken/unusable).
Verdict: Zero Value. Paying a low price for a broken asset is still a total loss.
made a task to build a 3d laptop model, gemini and gpt were the judges
mercury ; gemini
glm ; opus
gemini is wow.
opus is waste of money/ temu gemini
glm is braindead
mercury is a good small model
my chat stucked here since 24hours, how to fix it? I don't want to start a new chat. is there any solution?
lol
I tested the mercury 2, my brother was the only mercury fan on planet earth before the launch of the 2, I noticed that it is full focus on text editing, it seems to be a good one for the price really
Use thinking mode
its stupid cheap and fast, but its not the sharpest tool in the shed
For simple things, it must be very good indeed, but I don't see any use for my use
rightclick on the grayed out arrow button -> click inspect -> rightclick the blue highlighted <button> block -> Edit as HTML -> ctrl + a to select all the text -> delete/backspace -> paste this:
<button class="inline-flex items-center justify-center gap-2 whitespace-nowrap text-sm transition-colors focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring ring-offset-2 focus-visible:ring-offset-surface-primary disabled:pointer-events-none disabled:opacity-50 [&_svg]:pointer-events-none [&_svg]:shrink-0 h-8 w-8 active:bg-interactive-cta-active rounded-[4px] font-normal touch-hitbox border-border-medium text-interactive-active hover:bg-surface-raised border bg-transparent" type="submit"><svg width="1.5em" height="1.5em" viewBox="0 0 24 24" stroke-width="1.5" fill="none" xmlns="http://www.w3.org/2000/svg" color="currentColor" class="size-4"><path d="M3 12L21 12M21 12L12.5 3.5M21 12L12.5 20.5" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round"></path></svg></button>
-> click on some other random spot to save changes -> type text and send the messageā
dw gpt also skips thinking and fails it
unless you hint it to think
I think the 27b is just the same thing as the qwen3 235b vl, very similar intelligence, everything very similar, but 10x smaller
qwen bad
ill try gpt xhigh extended now, on paid sub website cuz the llmarena version doesnt work for me
Both 27b 35 a3b thinking called me stupid and that I should go by car
I didn't like the 400b. Butttt the 27b and 35 a3b is really good
no way
I'm trying to use no thinking in phone but I cant use
Both no thiking said to me walk
Even 397b no thinking say to me walk
oh the 397b is noncode
i told about it in chat earlier
Dude in what I tested, the 123b a10b thinking gave me better results than the 397 thinking
this is what i love about gpt
actually thinks whether what it wants to do will work in the first place
where is grok 4.20 š
@fiery gull wtf is the gpt cooking
The deepseek 3.2v was the only one that got the crazy saying that I made it clear that the car I was going to was other car and etc
Dude I really liked the 122b a10b it is 99.9% of the 397b
5.2 xhigh web
notice how it kept the keyboard layout
even made hinges
Why don't we have a STOP button like this yet?
lmaooooo
if you ask sonnet what model it is in Chinese it will say itās deepseek š
āOh these Chinese companies are stealing all my hard work which I stole š”ā

lmao
The goat.
Xhigh supremacy>>>
šØYou can Use new nano banana on @arena
ļøļø
ļøļømodel name - anon-bob-2 in image battle mode
ļøļø
ļøļøhere are few more results
Quoting Chetaslua (@chetaslua)
ļø
šØNano Banana 2 early testing
ļøļø
ļøļøPassed this test ā
ļøļø
ļøļø> you can see perfect reflection for all different colours of apple
ļøļø> Perfect reversal of text
ļøļø> Background building reflection is also perfect
It did extremally well, but gemini 3.1 did even better (visually, but gpt put more effort like proper keyboard layout and common buttons wear)
Hey hey HEY HEY!
hallo
Just a novice but very motivated to learn
not even 200 lines of code and few retries and then it just reaches its limit š
claude opus rate limit in 2 minutes speedrun any%
What did you expect then?
Hello
it is useless in chinese
Uhh
I don't know I haven't tried and it's only for battle mode
it always generate messy code alongside the output
Well well deepseek is the code owner
Literally GLM stealing Claude stealing deepseek
What model? I don't really understand well
lmarena-rc3
Isn't that model a codename?
so it deserves to be removed
Look these random names models are a secret ai model they just use random name to hide it so arena just add these models and if the company wants to remove it then arena removes it, it might be one of these be Claude Opus 5 btw. Hopefully I am telling the right answer
something like āit alway@ generat@ messy @de alongside the @@putāļ¼ but in chinese
These are trainning models so they are still training
For example this is a random model name but alot of people on Twitter say it's Gemini 3.1 Nano Banana hopefully it is
I can't see grok 4.20 in the model selector
i expected for retries to not use quadrillion tokens or 90% rate in one retry
What do u expect, thatās one of the best coding models atm, costs a lot of money to use it
I need help
how did it appear on leaderboards
I faced with this problem
What should i do?
Wait for the limit to go away
My problem isn't limit
Is that error:
Smth went wrong
Because itās a test and itās probably been a model that was code marked
I think tte reason they didn't is because that A 4 AGENT
i see
I know I've tried it out, but I didn't like it that much, so I was hoping the arena endpoint would be slightly better in some way
It pretty much won't improve it will just be the same
did the site just go down?
You'll want to try the steps in this article - https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
its working now, but for about 5 minutes the site stopped loading. my browser was just infinetely loading
If I clear the data, will all my chats be removed?
If you're not logged in, yes.
I logged in
Ok
Glad to hear it's working again, keep me updated if things go sideways
Then yeah your chat history will be saved to that account
gemini 3.1 coming soon ? on ARENA ?
hlo
It's currently available.
I did it but still have that issue
bro does anyone know why opus doesnt work
no matter how much i refresh
then it tells me ive used my limit try again in 40 minutes
after not answering my prompt
I'll respond in the thread you have going in #ask-here
Bro i deleted data but still jave problem
I've responded in the #ask-here thread.
Link won't open
Pinged you in the other channel, I'd like to keep #general open for general conversation and use the other channels for troubleshooting. 
What did you expect form grok it always sucks
How this model is in the top tier list is beyond me
Whatās chat arena ? Have you ever heard of it before ?
Is the Gemini -3 pro image review not working on anything just for me today?
It keeps saying something went wrong on anything I sent there ?
Would love some help on thatš
Would check the response that the bot provided you here: #1476278002626072586 message
Yeah, I'm also getting the same problem with Claude opus 4.6
I need Claude, like desperately rn, dumb Gemini deleted my code and Claude was the only model that actually understood the context
Tried what the bot said , plus , tried switching browsers or accounts didnāt help
I did too
Didn't help
oh sh*t mb
You can already use it on CLI agent for free there is limit but it's very far
who will join the group call
i know
but
just saying
@echo aurora please fix this
Unfortuantely, if the steps in the article didn't help, it's likely an issue that can't solved on the user's end. For this model you're likely seeing a rate limit causing this as it's a very popular model.
Would encourage you to read this message: #1417174113092374689 message
This didn't help, it just told me to do the same thing you always tell us
Those are the troubleshooting steps. Unfortunately, if they don't work, reporting the better information to our team is best next option.
hello
The way to prevent it is to breakdown the requests in new chats
Or take breaks and not send as many requests within an hour, intervals. And thatās not even guaranteed.
Yeah I tried it now and got the same result. Would probably help if you forwarded 'You've reached your rate limit...' errors from Google to the user with something like 'Model is at capacity. Please try again'. So they wouldn't waste their time trying different accounts or whatever
š š®
Agreed, there are improvements to be made with these error messages. This is on our to-do.
hi, when does the arena get updated to reflect the new elo of each model?
and who maintains the arena?
Bro you might be a robot š
When is haiku 3 coming to arena š¤
jokes aside maybe we could get codex spark, or mercury2
Codex 5.3š„ŗšš»
already is
Once that triggers youāre gonna hit them in every single new message
In arena?
im a robot
why not just have the gemini to solve the capthas for you š¤
hi japancat
o/
turn this into a gif too
true
Gemini is not in the top tier list Forsure
Maybe for images, I could see why. Itās up there in the leaderboard, but it definitely does not deserve to be in the top 5 position.
I donāt know if itās capable of solving anything to be honest with you.
Iām really baffled at how high it sits at the leaderboard
have you tried 3.1?
it makes opus a joke
it even beats gpt 5.2
In code?
in thinking
in code (execution only) nothing gets close to codex 5.3 lol
like not even same leaderboard
This makes situation even more confusing
codex actually does what you ask for instead of doing whatever it wants
I think people are looking at the coding as like the ultimate metric
gemini is crazy smart but its lazy and doesnt care what you want it does whatever it prefers
opus memorized everything but its literally braindead if you give it a novel problem to solve
what you guys think is the best ai for coding like html or java
if you dont know what you are doing? gemini + codex
if you do, just codex suffices
This is what makes these benchmarks and the leaderboard are confusing
leaderboard is sponsored lol
and even then people vote only by looks
I donāt know what it is, dude
if it doesnt listen but still makes something pretty, people prefer that
Yeah
Are you sure about that?
But this could be a little biased
why does nano banana pro keeps saying error
opus is one of worst high-end models if you actually test it
:)
opus is really good imo
Dude, you know what it is
Itās probably that the compute
They give us much more water down versions
guess which one is opus, which one gpt, and which one gemini
opus is just braindead if you give it a novel task, and sonnet 4.6 was trained on deepseek š¤£
the really messed up is glm, but the bottom right white with keyboard going out of screen and logo sticking out of screen is opus
Don't be so sure about that
For some things, gemini couldn't solve the problems of code, but opus could
heres gemini summing up opus
i used grok 4.2 to automate the scoring of a psychological test, the MMPI-2
also i had a bug that i spent 2 weeks 8 hours a day trying to fix with opus 4.6
gemini found it in 1 prompt
it was pretty good
This proves my theory
grok is pretty bad but sometimes gets to levels of 5.2 or opus
mainly when opus or 5.2 mess up
Letās create more speculation than it solves anything
It turns out to be a popularity contest more than it is a capability
Which undervalued the capabilities because Iām some of these models shine in certain areas
heres an old model vs a 10x more expensive less than month old opus
Which sadly arenāt measured and how will we ever know those if weāre just focused on the standard
I donāt think this is something academic could solve
every time it hears car wash, opus says "drive", even if the scenario is inverted
It has to be somebody from the bottom with a fresh perspective
same for many other riddles
A really evaluation test needs to be from the people like relatable
Like in the real world where people struggle, and with what they struggle lol
I mean these benchmarks are cool for enthusiast and researchers
Gemini 3 pro-image is not generating anything the only output is "something went wrong with the response, please try again"
It would be meaningful if model evaluations captured the everyday experiences of regular people using Llm the ones without large platforms or influence and reflected their real frustrations. There should be a way to measure performance that highlights where models consistently struggle, not just where they excel on benchmarks.
And you know, itās the most ironic thing as recent we havenāt heard much of the term āAGIā been thrown around lately. š
give original video ill show
remember gemini 3.1 is like the only good model from google (gemini 3.0 before all nerfs was good too)
Thatās the movie itās supposed to say
100 views š
theres no way it was in the training data
i doubt it can figure out based on literally snow
This is what Iām saying. Itās not like the model is stupid. When you nudge it.
It guessed based on just the text
Makes sense
yeah its THIS good
(pure new chat btw)
it can guess the movie just based on ww2+ snow
snow alone is not enough
Yeah
(without the video)
i made a fun experiment with gemini 3.0 few months ago
i had a pic of a house i took by accident
no landmarks ect, just a normal house/building
i put that into gemini
it found it to 5 metres
š
it mightve been on google maps street view but still thats insane
Im not saying these models are outright stupid
Iām just saying, I donāt think that leaderboard accurately reflects. I donāt even know what Iām trying to say.
actually
not even need image xD
Actually, this is a good benchmark. We should try out with the other movies.
ai might be closest to "magic" we will ever get tbh
and youtube might be the closest to a "time machine"
Nawh biology by far is far more mysterious and far more magical
mind reading is better
and mind reading without sensors is even crazier
That was great ššš
All of that stuff is crazy, dude
there are already ais that can read thoughts from mri scans, but good llms can almost do that without any scans or such
But yeah, I hear what youāre saying. Definitely for sure. It feels like magic.
ww2 + winter snow is NOT enough to guess it
I donāt think itās magic. I just think that weāre really predictable.
it guesses by the way a human says it
like
human subconsciously spells it in a way that hints for the specific movie
and the llm's wages contain some of those patterns
Weād have to see behind the scenes to truly know
its more like
not the movie contains winter and ww2
its about why when asked to describe the movie
the human picked ww2 and winter
and not for example a plane
or a gun
You sound like a salesman
im js excited
ai is underrated
like
ai is basically a solution to every solveable problem
by definition
O.o
if there is any pattern, ai can be trained to find it
if a dog is smart enough to have a conversation
you can train ai to translate
by definition
thats how impressive it is
Yeah, but in the physical you have entropy or pure randomness
everything has randomness
but if a dog or a monkey can say (or even think) "give me food" or "i want go outside"
you can see that using ai
its not really feasible rn to do that but its very much possible
AI š
I like the word you used earlier magic
if ai is not magic, then what is?
Its basically a database no?
It just finds info on a database and pieces it together
ai is a way to solve literally every solveable problem without knowing how
you can solve any problem by throwing money and compute at it
using ai
What's 0/0 then
Tell then
I donāt think the AI is gonna be smarter than humans
lol
define smart
pattern recognition? then it already is MUCH smarter
Humans
oh like causing wars and killing millions because politican in other country insulted you?
0/0 is undefined
depending on context
It won't be in terms of memory storage
In terms of remembering things then yes
So is gonna be conscious or not
can you read? it depends on context
AI cannot be conscious
Hello!
Hmm
Humans cannot either :)
consciouseness is a made up thing
if it isnt, then define it?