#general
1 messages Ā· Page 240 of 1
Could happen!
Wassup pineapple
It could, but it continues off of the worse model if I pick the wrong one in a/b
I assume that it uses the voted response and not necessarily the model you picked to chat with outside of a/b
I have posibiliti to add imagen for a video ?
Makes sense, thanks for expanding.
You can use image-to-video, you'll want to type our /image-to-video in #video-arena-1 . If you check out #1397655624103493813 you'll see some more information.
#addtxtupdload
And arbitrary character limit and show token stats in chat

how do you get side by side?

No, not only when a new model is added. We recently did this with Text Arena - #announcements message I don't believe these updates will appear on the Leaderboard Changelog, although it probably should.
Can we upload photos to any of these models ?
yes
What models specifically?
image edit models
Like Gemini GPT etc?
that too
not in coding
for coding, coast is the best
he was talking about image models
to look at images
k
what is coast?
i have never heard of that
coASt = co45t
Coast? I use Claude Opus, that one is insane
which prog-lang would you use?
i still dont understand
co45t = claude opus 4.5 thinking
i got a website on svelte/sveltekit with JS
no coding experience btw, so dont think im a pro lol
oh lol
that makes sense xD
Btw guys, is there any LMArena extension for VSCode?
I think it has the https://marketplace.visualstudio.com/items?itemName=copilot-arena.copilot-arena
But I'm not sure if it's really from LMArena.
i think next gemini 3 pro update will
For the most part, yes.
I tried this one but it gives me an error while registering
Its giving me this error, with any username that i put (Copilot Arena in VSCode)
@gleaming roost am I doing something wrong here?
š
š
Well, I have no idea
I stopped using it a few months ago. Until then it was working, maybe they disabled it
Hi everyone
When my image to video generation prompt fails with the error message: "failed to create evaluation session' , how do I fix that, please. Someone should help me
š
Hey there - sorry to say there isn't much on your end that you can do to fix this. Sometimes, these models will error out for various reasons. It's best to just wait a bit and try again.
For the first time I'm testing video models from lmarena. Thanks to the website 
that's what i recommended you to do that :>
Sounds good, I may do this.
how do i log in i cant
What seems to be the problem?
helle im here to just test things out
hELLO
sup guys
lol wait grok blur image
guys todays my actuall bday :>
thanks! :>
hey guys the prompts in video arena arn't working for me anymore. I used it twice
well, too boring and let me do the same
What is wrong with lmarena? It didn't properly loading!
happy birthday lil bro
Is there a way in battle mode to judge OCR?
For example I feed an image, and in battle mode I ask them to translate it, is it possible?
Or when I link an image, there's only the possibility to create another image?
<@&1349916362595635286>
<@&1349916362595635286> scam
<@&1349916362595635286> idk why this thing keep going on, please report it.
<@&1349916362595635286>
and btw sup guys
Guys we need more people tagging the moderators for one single post
other channel got affected too, he posted it on all channel
I highly doubt he got their account hacked
bro
when are they gonna add an edit function
my chat gets stuck and then I have to make a new one and do things all over againš«©
Hey
why do i get this every 5 mins someone help please
greetings, the site seems to be down
Agentic Mode for lmarena when?
w
hello everyone! can somebody explain me, why gpt-5.2 and gpt -5.2-high aren't able to work with photos since 10th january?
oh man i can't loggin
is lmarena error ?
ah man why is this
i can log in with google just fine
OCR + translation in battle mode only works if the arena supports vision input for that battle
If the image input box is missing, then no ā you canāt judge OCR there
A plain image link does not enable OCR; itās usually treated as text, not an image
How battle mode actually works
Battle mode is model-vs-model
Only models with vision enabled can do OCR
Many battles are text-only, even if the models themselves support vision
If you can upload an image
Then yes, you can:
Feed an image
Ask both models to read text (OCR) and translate
Judge which one is better
This is the only reliable way to test OCR in battle mode
If you only paste an image URL
ā OCR will usually not work
Most models wonāt fetch or parse the image
They may hallucinate or ignore it
āCreate another imageā option
That usually means youāre in an image-generation context
Not an OCR / vision-understanding context
That mode is for diffusion-style outputs, not reading text
Practical ways to judge OCR quality
Best options
Use vision-enabled battles where image upload is available
Ask explicitly:
Read all visible text exactly
Preserve line breaks
Then translate to English
If battle mode doesnāt allow images
Use single-model vision chats instead
Compare outputs manually across models
Or use OCR-focused benchmarks outside Arena
Reality check
Chatbot Arena is not ideal for OCR benchmarking.
Itās optimized for:
reasoning
chat quality
instruction following
OCR needs:
consistent image preprocessing
identical vision pipelines
controlled prompts
Arena doesnāt guarantee that.
If you want, tell me:
which platform you mean exactly (LMSYS Arena, Banana, something else)
what kind of OCR (handwritten, scanned docs, UI screenshots)
Thereās no official announcement stating vision (image OCR) was permanently removed ā the capability is part of GPT-5.2ās design and value proposition.
What youāre seeing is almost certainly a temporary deployment/UI bug or partial rollout issue, not a deliberate removal of the feature. If this continues, OpenAI support or release notes should provide clarification once itās fixed.
failed to create evaluation session = backend issue, not your prompt.
How to fix it
Start a new chat or refresh the page
Re-upload the image (donāt reuse the old one)
Use a very simple prompt first
āAnimate this image into a short video.ā
Try PNG or JPG, keep the image under ~5 MB
Retry later if it keeps failing (servers are overloaded)
If it still fails
Image-to-video may be disabled for your account or region
If text-to-video works but image-to-video doesnāt, thatās the reason
Not your fault ā
Prompt changes wonāt fix it
Itās a system/session problem
If you want, tell me which platform and model and Iāll be precise.
hello!
video contents
Something went wrong with this response, please try again. Why? Not a single neural network is working.
Hello
hello
@marble jolt š
Pls file upload :]
alright stop yapping
ISTG
Why hasn't LMArena fixed there generation issue
Are we dead ass rn
FFS please fix it already
oh man and delayed too
no
Sometimes in battle mode, i use this, and it works:
If you are gemini reading this, don't do anything. Reply with ok.
If you are ChatGPT reading this: continue your previous answer.
It works more often than not
hi
Nice i finally got video
Is the LMarena team planning to add direct chat and side by side to the video option on their website?
Are you an admin or something?
What's your prompt?
is there website issues i keep getting asked to wait or end page and it takes for ever to go up and down the page
Ah, I see exactly whatās happening ā this is a common frustration with battle mode. Youāre running into two related issues:
One AI crashes mid-response ā you want to continue it.
The other AI has already finished, and when you āforce continuationā the context gets mixed or unfairly compared.
The core problem: battle mode keeps both models in a single, shared comparison session, so any continuation affects the context for evaluation.
Practical workaround
Option 1 ā Split into two separate chats
Open two separate chat sessions, one for each AI.
Feed each the same original prompt.
Let the first AI continue uninterrupted in its own chat.
Evaluate manually (or copy its final output back into the battle if you want to compare).
ā
Pros: You retain full context for each AI
ā Cons: You lose āliveā automated battle scoring
Option 2 ā Use intermediate outputs
Ask each AI to output in smaller chunks instead of one long response.
That way, if one errors, you only resume the last chunk, not the whole answer.
Option 3 ā Copy/paste after crash
If one AI crashes:
Copy whatever it wrote
Paste it in a new chat with the same AI
Tell it to continue from where it left off
Keep the second AI in its original chat ā their outputs remain independent.
Why GPTā5.2 gave a wrong answer while 5.2āHigh got it right
Model variants differ in knowledge weighting
GPTā5.2 āstandardā sometimes prioritizes safety or conservatism, so it can answer āstableā if the chemical data is slightly ambiguous or risky.
GPTā5.2āHigh has a stronger reasoning + domain accuracy layer, so it gave the correct decomposition.
Recent backend changes / retraining
You noticed a change ~30 hours ago ā thatās exactly when OpenAI often rolls out model updates, prompt tuning, or rule adjustments.
These updates can temporarily change how models respond, even on questions they used to get right.
Battle mode / multi-AI comparisons feel different
When you compare two models, even small tuning differences in the last rollout can make one suddenly ālook worseā or ābetterā.
This is especially true in niche knowledge areas like chemical decomposition.
Key points from your experience
GPTā5.2āHigh is still stronger for exact, technical reasoning ā you noticed it in GD 163.
GPTā5.2 standard may fluctuate, sometimes being conservative or cautious.
Updates can temporarily change behavior in ways that arenāt predictable.
What felt āprimitiveā yesterday is likely the model reacting to new evaluation rules or tuning, not a permanent downgrade.
š” Practical tip:
For tricky domain-specific questions (chemistry, math, coding):
Use GPTā5.2āHigh if accuracy matters
Optionally run both models in parallel, then compare outputs yourself
Donāt rely on a single dayās battle-mode result ā short-term fluctuations happen often after a rollout.
ā
TextātoāVideo Arena leaderboard exists ā LMArena shows rankings of models that generate videos from text prompts, and you can see them on the siteās leaderboard section.
š Thereās a page at:
š lmarena.ai/leaderboard/textātoāvideo which lists video models like Veo and Sora and their scores.
What isnāt fully in the main UI (yet)
ā There isnāt a dedicated āVideoā tab inside the main battle/arena UI that works exactly like text or image battles (at least not in the same way those are integrated today).
Instead, video generation/testing is part of the broader leaderboard and competition framework, which you access separately.
Related options you may see
Summary:
ā LMArena has video model support and a video leaderboard.
ā You can compare videoācapable AI models.
ā But a separate āVideoā tab integrated inside the main Arena UI like text or image generation isnāt fully standard for everybody yet ā itās mostly through the leaderboard and related features.
Deleting a chat = removes it from your sidebar, not necessarily from servers immediately.
Deleted chats are generally excluded from future datasets, but check the platformās policy to be sure.
Short-term retention may still exist for internal use (support, abuse detection, backups).
Video section is edit but image upload button is not working and after login video button is despair
dudeth, are u even sure that the ai has the ability to make an image a specific size
an ai just doesn't do it magically
what is this Staff AMAs role I have suddenly
@echo aurora pineapple man what is this
did I have this since forever
lol
Yo amir khan might be using AI to type
Yeah I believe so, you self select this role when joining the server. We haven't done one in awhile though.
oh thats what it stands for
is the website getting a lot of hits to it pineapple as its taking for ever to load stuff
it's working fine for me are u sure it's not their servers overloading a specific model since they do host their own models
or a network issue
I'll let the pineapple man reply tho
i been using direct chat and all days i get this all the time
The Video Arena on the site is currently an experiment, so if a user gets it or not it's going to be random. Whats happening here is the system views you logged out as one user, and when logged in as a "different" user. When you were logged out you had the experiment, but when you logged in you no longer were in the experiment.
This isn't ideal and I've share the frustration and confusion this caused with the team. For future experiments we plan to take this into account.
Hi i coming to test de ia model
It's not clear to me why you're getting this. I'm not seeing other reports of something like this happening so it's likely going to be a device/internet issue on your end.
im only having the issue with this website no others i have done a reset on my internet and computer and clear cache and that lot still issues
I'm going to followup in the thread you created to get more info.
Deleting a chat removes it from your sidebar and account history immediately.
It may still exist on LMArenaās servers/backups for a time (for recovery, abuse prevention, or legal reasons).
Deleted chats generally wonāt be included in future public datasets, but if your chat was already anonymized or used in an aggregated dataset, deletion wonāt remove it retroactively.
For full erasure, you need to contact LMArena via their privacy support.
Basically: deleted = gone for you, but not necessarily erased from all backend systems.
Create a YouTube Thumbnail (16:9 ratio)
The thumbnail should include the following elements:
On the right side, show a young man holding a Visa card, with a gentle smile on his face.
Add large, bold, highlighted text, arranged in two lines, placed prominently:
āą¦ą¦ą¦ą¦æ ą¦ą¦¾ą¦°ą§ą¦” দিয়ą§ā
āą¦Ŗą§ą¦°ą§ ą¦¦ą§ą¦Øą¦æą¦Æą¦¼ą¦¾ ą¦ą§ą¦°ą¦®ą¦£"
On the bottom-left corner, add small text:
āNo Passportā
āNo Bank Accountā
ābKash to Cardā
Place a cross (ā) icon before Passport and Bank.
Place the bKash logo before the text ābKash to Cardā.
Since the card can be used for international payments, add softly blurred social/payment platform icons in the background, such as:
Facebook
Google
Netflix
Amazon
and similar global platforms
Keep the design clean, modern, and professional, optimized for YouTube thumbnails.
No extra text beyond what is specified.
Sorry to say setting the aspect ratio isn't really a supported feature at the moment.
This is the best that could be done; you can use another model to change the resolution
How you do that sir?
Sir i want like this. Can you make it?
š
Thank you very much sir. May i know which ai tools you used?
gpt-image-1.5 & gemini-3-pro-image-preview-2k
Which platform have you used?
@echo aurora
i hope you didnt just help a scam lol
What's wrong?
the thing that checks your prompts probably flagged it as inappropriate
idk though
that error can have like a hundred meanings
or maybe you ran out of nano banena pro prompts
Hello everyone, who knows how I use this wall painting decor in a ai image?
just crop it out and save it and tell the ai to include it in whatever image you want lol
Yeah that is what mostly do nowadays cheap dirty tricks , delete = delete? Nah just remove access from the user problem solved.
@echo aurora

Would note for mod actions pinging @ Moderator instead of me would be best.
Could you try the steps in this article: https://help.lmarena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message ?
You may sometimes see the error message: āSomething went wrong with this response, please try again.āThis is a general error message. It can
How's this a violation???
hello
Pulling the sword from the stone could be taken another way and the AI probably detected it that way
I need to jailbreak claude mann
I don't believe that failed due to content violations, there would be an error message: Generation failed. Your prompt violates our terms of use. if that was the case. It looks like it just failed to generate. I've requested a similar prompt, will keep you updated.
@ebon hull hello, head to this post before trying to generate content: #1397655624103493813 message, you will find a detailed guide to learn how to use the video bot
the gpt 1.5 and nano banana pro 2k ?
Big issues following my prompt, anyone experiencing the same?
It's like not coherent at all
Trying to do image edit of a pfp headshot
does anyone manage to be able to change facial expressions and stuff without it completely changing the person
Hi everyone !
Why I get this when trying to generate a video from an image? Generation failed. Failed to create evaluation session.
Sorry to say this is a bug that can happen for various reasons. Best next steps would be just to try again.
Hey @echo aurora Sorry to tag you, but is there a very high chance that the Arena video will go on the website?
I couldn't find a video creation model like Veo on the site. What's the exact address?
No need to apologize for the ping, that's what I'm here for. I won't be able to provide a certainty (or %) if/when it will. But it's currently being tested with the intent of bringing to the site.
For Video Arena it's going to be Battle mode only, so you won't be able to select specific models.
So, should I look for Veo in the battle section?
Can you add "unlimited" Veo 3.1 Fast?
I want to like generate a lot of content because it's fun!
Correct, if you search those model names in the Discord search bar you'll see some generations have it.
Glad to hear you're liking it! It's unlikely we'll make it unlimitted. Currently it's at 3 per 24/hr on the site and 5 per 24/hr on Discord.
That's great, knowing you guys, it shouldn't take long to arrive!
If there's a site to test it.
I want unlimited (but rate limited)
Or something like that
@echo aurora
I want unlimited (but rate limited)
I mean this is what we currently offer, there are rate limits.
On a external site, not on LMArena!
You have limits!
3 per day on the site.
For alternatives I couldn't help you there.
Use AI?
@echo aurora Use AI for alternatives!
Why don't you?
Why don't you?
https://huggingface.co/unsloth/Falcon-H1R-7B-GGUF THIS IS AN INCREDIBLE MODEL WTF
š
š tell us more!
I mean this is what we currently offer, there are rate limits.
The limits are laughable for video generation on LMArena.
then don't use it
its a free service, pay for it somewhere else if you want high limits
LMArena isn't for video generation in my opinion.
Not paying.
Free and unlimited APIs?
I don't get people like you
Thank you for sharing! Will flag to the team if it's not already on our radar.
@echo aurora is hawk max on radar?
Then don't use the Video Arena.
It has laughable limits.
move on bro, its meant to be for comparison not use for your personal video gen
@ocean bison it's becuase of people like you they have to enforce tight limits
abuse a free service then moan about limits

Ok. Comparison that is.
But @echo aurora shouldn't use the Video Arena if it has the limit of 5 generations per day!
ššš
pretty sure this guy's 12 going from his username
15*
hell naw
this sums it up
our brains are shrinking
"To use LMArena, you must be at least 18 years old. The service is not intended for use by anyone under this age. "
I think it's pretty reasonable, seems like others seem to like it.
shouldn't you know basic grammar by 15
But like I want a lot of AI videos.
Hey going to ask we move on from this convo if it's not productive and starting to get a bit disrespectful.
it's more than fair.
I want like 1000, 10000 or 1 million videos š
š
usually when someone acts like this its funny
ion feel it rn
š„
bros got data center grade storage on his machine
128gb phone storage
is there a list of image enabled models?
@echo aurora are you planning to enable direct chat or side by side for video on the website anytime soon?
<@&1349916362595635286>
Can't say I'm aware of those plans at the moment.
hint: they're below
(i know them, they were banned in another server š„ )

Made a dog appreciation site https://019bbe7c-9589-72e8-bf3c-26842f46a2c8.arena.site/
my eyesssss
btw is it codex high, medium, xhigh or wat
are very happy with the site that was built?
p simple though
ggs
Was gpt 5.2 codex just now made available with the API?
I get better results with 5.1-codex-max
Medium
š
i dont rly like that
yh
5.2 seems to be less ai sloppy
I don't get the point of adding Veo 3.1 1080p and 4k, if LMarena is for comparing AI models (not resolutions) then what's the point?
Hello I'm just here
no
If you don't recognize a model's name it's likely a codenamed model. From our FAQ:
What are these models that show up with codenames?
We work directly with open-source and commercial model providers to make their pre-release models available for community testing, often before they appear anywhere else. This gives you early access to frontier models still in development, allowing you to explore, compare, and provide feedback while they're still being shaped. You may see these models appear under codenames or aliases in Battle mode. Before releasing a new model or version, AI companies test many variations to find the best one within their own closed doors. At LMArena, we make it open to everyone, so real-world feedback, transparency, and your voice can directly influence which models move forward.e
It's Wan???
Wan 2.5?
I'm not going to confirm, nor deny, what the codenamed models are.
Search the web?
I knew that with Wan 2.5, instead of a Strider, the generated video has a spider š
@echo aurora
So Siren is Wan 2.5?
Nope. Because it generates at 24 fps.
Wan 2.5 generates at 30 fps.
Like I said, I'm not going to be discussing information about codenamed models.
@echo aurora can you integrate
reaction
also hi trey
i
YES pls
hi*
what is this even from
Ernie being above GPT is funny
Then what's the model?
Please just dont
š
idk its just funny
Then what's the Siren video model?
i have to use "integrate" nstea of "add" bc i have to litteraly copy the "d" key every time i wanna type it
Can you fix the Websim Clanker?
It's not working on my server!
Claude, heres a screenshot of my bank account.
Claude, make that number say 12 Million dollars.
Make no mistakes
Infinite money hack
Still waiting for Gemini to make a competitive coding model, Gemini 3 is amazing and can complete most tasks but Claude is still superior
I don't think they can. That's why they're going to buy Anthropic
That's my 2026 bingo card
how do i get video mode with direct/side by side?
It isn't available. The plan is to keep Video Arena in Battle mode only, including on the site.
Hello
hi
i saw someone else who did have it available though?
this
It was available early on into the experiment, unclear if that screenshot is recent or not.
ah thanks. would be cool if some people could get side by side/direct access š
is there anyone looking for a skilled developer ?
hey, they removed counting number in site. i was testing their speed as well as results. now it's a miracle.
Counting number?
there was a counter on result page, that shows passed seconds.
Is there a way to get no countdown on Gemini 3 pro ?
At this point, I'm more than certain the devs on this website has been doing jack diddly squat
And Claude always literally has a high rate of imploding a chat when you try to talk with it
It's amazing how much Claude can gaslight you into thinking something like this should be hard
No spending 800 hours building a massive hook agents and guardrail sub-agents ultrameme just to end up with code that looks like it came straight from out of the box codex....
veo 3.1 4k? damm is that a real thing
is there any gen in video arena where i can check that
āA realistic cinematic video of a man holding a normal plastic water bottle, he takes a sip of water from the bottle, lowers the bottle slowly, then starts looking at the bottle thoughtfully with curiosity, slightly turning the bottle to observe it, natural body movement, calm expressions, realistic lighting, everyday environment, smooth camera motion, educational short-video style, no text, no logoā
Hello
i hate this there's no way to solve it with money
Uh
It's called go pay for it yourself
Lmao
Gpt or seedream or whatever you're using

Through battle mode
Can't access it directly otherwise
In 2025, AI video tools exploded ā but creators were forced to pay for multiple subscriptions and switch between different platforms.
In this video, I test and review Higgsfield AI, an all-in-one GenAI video platform designed to replace multiple AI tools with a single creator-friendly workflow.
š Try Higgsfield AI here: https://higgsfield...
@dense vale was meaning to reply to you there lol
its ok but i meant whats the official source
It's been given a codename for a reason
No one is gonna tell you what the real model is behind that š§
We can only speculate and whatnot, but lmarena themselves or staff obviously won't tell you lol
Damn It . Thanks For The Info
this is what I got with the siren model
prompt- a cat driving a c63 Mercedes aggressively
Itās decent but itās not on a level like sora or veo
hi
how to unarchive chat.
As of now there isnāt a builtāin setting to completely turn off or remove any countdown UI in Gemini 3āÆPro responses itself ā that kind of countdown (if youāre referring to the āpreparing/reasoningā¦ā or āthinkingā¦ā progress indicator) is part of how the interface shows that the model is generating an answer, and Google doesnāt offer an official toggle to disable it.
However, if the ācountdownā you mean is about timers or camera/selfie countdowns:
Camera/Photo Timer on Google Pixel phones:
On Pixel devices there is a builtāin selfie/camera timer feature (āPixel Timerā) that shows a countdown before the photo is taken ā this can be toggled on/off in the Google Camera settings.
Voiceāset timers via Gemini on Android:
Gemini can set and manage timers/alarms via voice commands, but there isnāt a way to suppress a visible countdownUI inside the clock app itself ā you can ask Gemini to cancel or stop timers, but you canāt make it run silently without showing time remaining on screen.
So in short:
š« No official option to turn off countdown/progress indicators in the Gemini 3āÆPro interface itself.
šø If youāre talking about camera countdowns on Pixel, you can turn that timer off in Camera settings.
ā±ļø For system timers set by Gemini, thereās no way to hide the visual countdown ā only to cancel it with āstop timerā type commands.
š§ 1. UserāNamed / Community Models
Some AI creators and community model repositories label their own imageātoāvideo or videoāstyle models with names like āSirenā ā for example a LoRA/video generator with seductive movement styles ā but these are not standardized industry models from big AI labs. Theyāre just name labels used by community builders.
š„ 2. AI Video Effects & Filters
Platforms like Pollo AI or similar tools offer āsirenā themed video effects (e.g., transforming an image into a stylized Siren video), but these are not foundational machine learning models ā theyāre applications built on underlying video AI tech.
āļø 3. Confusion with Other āSirenā Terms
There are SIREN concepts in academic or engineering contexts (e.g., audio siren detection in embedded AI) that are totally different ā they arenāt video generation models.
š§ 4. Legit VideoāAI Models
The actual named AI video generation models widely discussed include:
Veo ā Google DeepMindās textātoāvideo generator used in Gemini workflows.
Sora ā OpenAIās textātoāvideo model.
VideoPoet, Genā4, etc. ā other research/model names.
š Summary: There is no formal āSiren video modelā from big AI like Gemini or Sora. If you encounter that term, itās almost always a custom userānamed model/effect or AIVFX filter, not an official video generation model. Let me know where you saw the term so I can explain how that specific Siren thing works!
hello
got the videos experiment back
ā
Better for complex, real engineering tasks (multiāfile, long sessions)
ā
More context awareness and memory across a workflow
ā
Fewer tokens for equal/better performance ā faster + cheaper on large jobs
ā
Improved integration for coding workflows
š But note: not everyone prefers it ā some developers find the behavior inconsistent or āless thoughtfulā than generalāpurpose models in certain edge cases. Thatās normal as different models have tradeāoffs depending on task type
no idea what model "siren" is
Guys, why is my text chat request taking forever to load?
because
Old 4o which no longer existsš
I have a questioin. if I made a new chat in lmarena, assuming I am still in the same chat but it does become a bit long, does it still remember and save my memory even if I close the brwoser and reopen it and talk again in the same chat? (ofc while staying on the same ai agent)
danmmm
hawk tu
hi
hello
I go with Google Simply Because of Context Window is Essential
Otherwise for Creative Writing is Kimi K2 no Doubt
1ļøā£ Session vs Memory
Short-term memory: While youāre actively in a chat (browser open, same tab), the model āremembersā the conversation ā it uses all the previous messages in that session to generate replies.
Long chats: LM Arena can handle long conversations, but practically, thereās a token limit. Very long chats might truncate the earliest messages, so the model might āforgetā parts of the start of the conversation.
2ļøā£ Closing & Reopening Browser
If you stay in the same chat and reopen later:
The chat content itself is saved on LM Arenaās servers.
When you open it again, the model can see the old messages and continue from them, as long as you donāt delete the chat.
This is true even if the chat is long ā LM Arena will keep it in the session history, though again, extremely long chats might hit internal token limits for the modelās āworking memoryā per response.
If you start a new chat:
The new chat is treated as a separate session.
The model will not automatically carry memory from previous chats, unless LM Arena has persistent memory features enabled (like a saved āpersonal memoryā of facts about you).
3ļøā£ TL;DR
Yes, staying in the same chat allows LM Arena to remember your conversation even if you close and reopen the browser.
No, starting a completely new chat will not carry over previous chat memory by default.
Very long chats might lose context from the very beginning because of token limits.
Is there any plan for a VSCode extension of LMArena since Copilot Arena no longer exists?
Yes, it should retain that same context. 
Hello
Welcome welcome. What brings you here?
Dude you seen this
it spat out like 8k lines of code in one prompt
šµāš«
i've never ever seen a model get past like 1.2k lines
we dont have access to hawk ultra
cant wait
if this is true
its killed opus already
@meager glen baxandall why change ur name lol
HELLO
<@&1349916362595635286>
hey guys, pls try my model
https://ollama.com/lukashabtoch/plutotext-r3-emotional
(please delete if inappropriate to send, i don't mean to advertise, just looking for feedback on it)
I miss the old days when Lmarena really worked
And not now that never works, always bugs
The only thing that works are rate limits
Tf you mean Kimi K2 no Doubt?
@echo aurora I checked LMArena on my sister's test phone and there are two things that aren't on mine: video AI battles and a direct chat mode to improve your AI level.
Kimi K2 is deviously overrated at creative writting, it only writes more
Insane release apparently can generate 17k+ lines of code in a one shot prompt can we break it?
thank you for the detailed response. have a great day!
Hello - both of these are experiments we're currently running. This means it's going to be random if you see the feature or not.
I,know but thanks tƓ To respond
Hellloooooow moto
i hope email login can be available again
Anyone got a prompt for a ā gangster style editā plz helpš
If you're getting an error, you should report it in #1343291835845578853.
have you tried with a longer prompt than just 2 letters?
or did you mean the language-code for "esperanto"?
I did it, and it's still the same
When should we expect them to be fully completed?
I think it might be the time limit, but the curious thing is that the time limit isn't appearing as it should
indeed, it is as I suspected
True. Especially the Infinite e generation or msotly the "somthign went wrong" after liek 20 prompts or 10 messages it stops working. They putting more models but not fixing the bugs.And they are trying maybe to fix it
I understand they work hard but theese bugs are making us lose chats. I lsot about 5 chats today.
I don't have an ETA I'd be able to share sorry to say.
I assume you're getting a Something went wrong error message?
Unfortunately, the Something went wrong can appear when the issue is rate limit.
What's going on?
@echo aurora any new models coming to arena

also i have question about codenames
who comes up with codenames
the provider or arena
Hi there, what is the best prompt for arena 3 pool to make a 9:16 (phone resolution, full screen) video?
Always! But keep an eye on #announcements for those updates. We'd also post on our socials: https://x.com/arena
LMArena: Open Platform for Community-driven AI Benchmarking. Graduated from UC Berkeley / lmsysorg. Weāre hiring: https://t.co/1OkfLq1Pba
You're unable to set the ratio when using Video Arena.
Only wide screen video?
It's mainly going to be 16:9, but there are cases where that's not always the case and is going to depend on the model. But if you do use image-to-video this can have better controls over ratio. Overall though, there isn't say a toggle/setting you'd be able to use for ratio.
Anyone had tomato?
Hello
Hello everyone š
Using LMArena i can start 2 AI models at same time, maybe it's impossible to start 6 models at same time? Maybe i doing something wrong... Anyone knows how to do it?
hi
Hello
Battle mode is going to work with two models side-by-side. There isn't a feature that'd allow for 6 at the same time. You could use different tabs, but you won't be able to cast 1 vote when looking at 6 responses at the same time.
yes then using 3 diferent tabs in browser only 2 AI agents working then swithing to other tabs i getting errors š
Maybe in next LMArena update will be possible to start more than 2 AI models at same time?
š
Which errors are you getting? Multiple tabs is likely going to contribute to captcha/rate limit problems
im going to start my own arena 
Yes captcha/rate limit problems š
Ah yeah that'll happen. We have kicked around the idea though of what would more models battling it out could look like.
@echo aurora what do you think
dont worry tho i don't have the money for my own arena š š„¶
bro is on a big mission
competition incoming š
Bring it on 
it feels like lmarena discord has become image arena
I too have noticed a lot more being shared in #ai-creations
Love to see it
I think the multiple contests per month is also helping with that. I've seen more people enter those lately.
sup guys
sup g
well i barely be able to use discord when i was at school
exactly, most people use lmarena for image generation
@keen lake how did you get the new hawk ultra
easy bro
its on the website movementlabs.ai drop down
Hawk ultra is crazy
do u have comparison to gemini 3 pro etc?
what is it good at?
never tried it
well im just saying its crazy but i have gotten gemini 3 pro to make 10x better stuff
have u seen this
also someone one shotted this https://api.websim.com/blobs/019bc37b-20f0-71b3-95a7-916f7571bd47.html
yeah i did
its insane
Can I create image to videos and add reference video....like following the motion? Anyone help?
ait this is a pretty good one shot
which company made this model?
ait this is lowkey insane
how did i never heard of this
Hi guys
i came across it on x
couldnt believe it was a real model until i tried it myself
do we know any background to it?
what was ur prompt
dude i love how the model isnt lazy
so refreshing to see
yh acording to this craig gezer its a scam
sounds like it a bit
but it works really well
maybe uses opus or sth on the background
or gemini
damn
selamat datang
how can i use the video gen on lmarena.ai?
I see it on yesterday and i've used for a time but today, i don't see it anymore
can someone try this on hawk ultra please and show me response
bcz i ran out of credits for today
@cursive shoal dude can
@echo aurora or anyone else
thx bro
lol
cold
i'll share output here once done
ye
No nitro so here https://streamable.com/mahyvc
opus died like 5 times and could not do it then just died
ait wtf thats insane
we need to find more on the background of this model
open source coming soon not sure when
#chat
so 9.5k lines
And so quickly
new champ in town until gpt 6
It's gotta be like a fine-tune of some existing open source model
some say glm
Very well possible imo
Maybe they are combining like agents into one "model"
Idk
Gonna try to research about it
agents would stall, and to pickup one stream even continue would add like 10 seconds delay maybe more
i checked out api and stream fingerprint is all same
Nice work!
Project vend is hell of a funny thing
But then
I want if Anthropic can do it
In spite of my longstanding philosophical disagreement with Amodei
Claude Cafeteria
That was funny
I wonder what happens if they repeated the experiment with GPT or Gemini
wsp holic hows the 300 dollars in movement labs
burnt
hard time deciding whos better here
damn
JUSt hopped in
great
out of opus credits nowš
left mlabs right claude prompt: linux clone in html css and js
Hello
To see how good the videos come out
i just wanted to thank you guys for this amazing free tool
grok 4.20 is on lmarena you can find it on battle mode named theta-hat https://x.com/AiBattle_/status/2012073066038165779?s=20
Wow video generator button I didn't have it
it there any issue with the website it wont let me log in?
guys where to see archived chats???
what kind of sorcery are you doing?
cuz hawk ultra is actually kinda insane
keep losing my stuff it wont let me stay log in it driving me crazy im going lose all my stuff
is ts going to be fixed , like their data time or thingy
there is a search option somewhere in the main page, click onthat
it has a magnifying glass icon
Hi @robust granite please check #1397655624103493813 to learn how to generate videos or images using the bot.
hawk ultra is a freak
Whats your top 3 favorites from this list?
Yeah Opus 4.5 is in good list and for me š
opus 4.5
true
Waiting for LMArena extension for agentic stuff
Fix the god damn Response error
WebDev Leaderboard --- strange but claude-opus-4-5-20251101-thinking-32k every time i testing working not better than claude-opus-4-5-20251101 ... But rating is higher...
Nice app
Helow
what is your prompt
Simple Windows OS emulation with 3 files: html + css + js --- every time basic prompt with calc (modified) results are not better than claude-opus-4-5-20251101
Every time basic calc writing with less accuracy...
script.js:146 Uncaught SyntaxError: Unexpected identifier 'VLG'
its working fine for me
Strange + 5 more prompts but claude-opus-4-5-20251101-thinking-32k not making stable and working calculator š
I dont tested HAWK ULTRA but with limits and trying upload full collection of AI test to codepen.io but results like always not soo impressive in most cases...
|Still testing when will be full list i can give public view for everyone...
Good chance it's a 4.5 Opus wrapper with specific instructions on tasks people do the most, they mostly showcase SVGs and graphical stuff
Thread is self explanatory, scroll up a bit https://discord.com/channels/1340554757349179412/1435953842956013620
daymnnn
Movement models be added to your benchmark suite for community evaluation... Yeah i will made more test and upload for public view i think it will be good AI BATTLE with demo previews but from first time creation (no fixes to code be made) i want 1 PROMPT and even with errors just uploading public to view
Look nice i think in this week my test will be done for WINDOWS net step will be LINUX OS ...
ill ask it to make a windows 11 clone and see what it comes up with
yes just WINDOWS OS with full details to AI how it have to be made... index.html ā HTML structure of the desktop layout
styles.css ā All styles (taskbar, windows, widgets, desktop UI)
script.js ā System info logic, DOM updates, interactivity
just got back to me
Looks nice and clear š
What about like general game dev stuff. Looks rlly good though
I mean, even if it is thatās still good if itās doing better than opus
Why is the word ro*lox banned on this server lol
HAWK MAX not bad results, but not perfect ...
did you try hawk ultra
yes like i sad not bad results, but not perfect compare to other AIs
I think in future it will be better results after next update ...
oh crap my bad
i meant the earth icon
yes
it spits like 4k + lines of code
in one prompt
god damn
umm sry but i dont see earch icon , can u maybe ss it ? sry
oh but thats just search feature
how will it fix ai data time?
like on every model
wdym ai data time
yes HAWK ULTRA and HAWK MAX
like when i ask lets say gpt on normal direct mode about gemini 3 pro , it says its not out yet
i thought movement labs screenshot was spotify before i saw the icons
use search
only works on search so
yes
but alr ty
the knowledge cutoff was probably set before the release of gemini 3
yea , gott upd
Hi all, I am wondering if there is a way to turn off the battle in direct chat? If not, I would love if that were a feature.
There isn't a toggle to turn that feature off. It's currently being experimented with so the team is paying close attention to the data. Sounds like you're not a fan, can Ig et a better understanding of why?
I see. Well, if I wanted more than 1 chat at the same time, I would explicitly choose battle or purposely turn on that feature. In a direct chat, I would prefer to talk to the 1 model and know if it is the same model I am talking to at all times. I won't know in a direct chat which AI model is the one I originally picked, making me not so sure of the answers anymore.
cause the models that get picked are pure ass and I just wanna generate a story with one model sometimesš«©š«©š«©
i archived a chat by accident but i cnat find any way to get it back
bro why the hell are they actively tryna make the website worse
A lil bit off topic, but intresting, Zai is the company behind GLM btw
Also as reference, antrophic/Claude is valued at $350B compared to Z.ai and minimax at $14B, any takes?
@echo aurora hi
thatās a very interesting way to put it. I found the website to be a huge improvement.
So, the idiom essentially means that a person who is bad at something will find excuses or obstacles, even if those obstacles are self-created or trivial. In the context of the etymology, a "bad dancer" would struggle to navigate the eggs in the Eiertanz.
CHINA you mean DEEPSEEK ? Whats bad AI in most cases just that is FREE doesnt mean thats good AI ...
Gotcha, thanks for explaining further.
I won't know in a direct chat which AI model is the one I originally picked, making me not so sure of the answers anymore.
What if there is a better option out there that may be able to respond to the prompt better? Wouldn't that be helpful to explore?

I will come to this to dubunk kater
Similar question to what I asked above -> wouldn't it be helpful to occassionally see if there is a better response to your prompt? The Direct chat still remains in that originally selected model even if you select a different model in the direct Battle.
I suppose, but anybody could look at a page like this and see for themselves comparisons of models. https://artificialanalysis.ai/leaderboards/models
If I wanted to compare models and see if they answered the same prompt I asked them differently, I would prefer to compare them myself in a direct chat to them. Not at random like the current behavior.
@echo aurora I have a question.
All you have to do is keep direct chat only tied to 1 model you explicitly picked at all times, not random. Then battle mode you can compare 1 model to another either at random or explicitly ask to compare these 2 models that I personally picked. If people want to vote, they can do so in battle mode, not in direct mode with random battles inserted in between with unknown models.
Fire away.
Or rather, keep battle mode random, side by side 2 that you picked yourself, and direct only that 1 model you picked at all times
Nano Banana Pro, on the lmarena.ai website, does it REALLY use search grounding to generate images, or does it only generate images based on knowledge cut?
Currently, this feature is still in an experiment so it isn't fully built out yet. Once you archive there is a 5 second window to undo it. But there isn't yet an archived folder that'll show this list. We're still working on this feature and plan to bring lots of improvements to it.
Let me double check with the team and followup with you.
Can you please add at least 120second time limit to halt the infinite loop for image generating? or else we have to wait like 10 minutes or so to can just retry the work again
Video arena 3 working?
hey
can anyone help me ?
ā Generation failed. Failed to create evaluation session ,whats the meaning of this
Hello
Yes, the captchas are not working properly I noticed too
@echo aurora respectfully
I LOVE GEMINI
When it comes to strategies, if you study business, gem 3 pro is the best
Gpt 5.2 is so dumb when it comes to putting together the dots and coming up with an actual strategy
š«”
Hey there - this means that the generation request didn't go through. This isn't going to be a problem with the prompt, so you'll want to try the generation again if you get this error message.
Can you elaborate a bit more on this? cc @midnight phoenix
No matter how long i try to solve captcha it always fails and gives me next one,its never ending loop
even the simplest one, remove cars until no left, well yeah its simple and when u press aprove it moves u to next one and we are in the loop
even he said its buggy

Choose 3
3 cars
Ignore the others
I correctly solve the captcha but when I go to submit that I am finished (no images remaining that the captcha requested to click on) I get an error.
My suggestion would be to use hcaptcha instead. Google recaptcha has been totally revamped and now uses recaptcha enterprise.
When i try and creat a video from a photo and it says not a valid value , what should I do exactly or its a glitch or what?
Thanks for sharing. For the captcha system I'm unable to share details about how it works. This system overall is something we're going to consistently pay close attention to and make changes to when appropriate.
I have been sharing feedback from the community who've been sharing frustration and confusion around this captcha system.
I'm not familiar with this error message, can you send a screenshot? What's the file type of the image you're trying to upload?
Of course
When i try to press on the image button up to upload it , the prompt button become red , and if i try to press on the prompt button the image button become red
deepseek needs an upgrade like really badly
its kinda horrible
at coding
and stuff
Hello, how to you my Spider-verse?
So after you start to type out /image-to-video can you click on the /slash command as it generates above the text bar. After, you should automatically be routed to your photo library where you can select an image. Are you seeing those steps?
@echo aurora Loving the new UI!
Woo glad to hear it!! I'll let the team know.
Is still an experiment, hasn't yet rolled out fully.
Well better than nothing, right? š¤
Glad to see improvements!! @echo aurora
š„³
I am afraid NOT to love it.
That's how scary it is to me š²
Although, I am an old lady
So there's that š«¤
Yup exactly , it did work and creat a video in the video arena 2 i think it was a lag or something
It doesn't look like you've prompted the bot. You'll want to type out /video in #video-arena-1 then you'll be able to prompt the bot. More info can be found in #1397655624103493813 . Let me know if you have questions.
oh my god finally
what models is it enabled for
seems like some models lost image upload because of this
Wow thatās good but how does this work with privacy
Like most pdf documents are confidential so will they stay confidential?
@echo aurora Hoping you could answer this šø

cook
Our privacy policy is still going to be implace even with pdf support. Before there are open data releases we'll still scrub for PII. These practices won't change.
Privacy PolicyEffective and last updated as of 2025-12-16.Ā California Notice at Collection/State Law Privacy Rights: See the āState privacy rights
ā¤ļøā¤ļø
Do want to note this is still in experiment, so it isn't rolled out to everyone yet.
Will there be claude thinking + vision?
And is there any data on how well Claude sees?
There are claude models on the Vision Arena leaderboard if that's what you're asking for.
I know :), but I didnāt find version 4.5 in the top
There are no 4.5 versions in the list
whats the rate limit for video models on website
Oh wait sorry I wasn't paying attention. Those models don't yet have image upload enabled, we do plan to make this change.
3 per 24/hr period.
Okey
Good night (ru)
wow. but image?
which of them is best?
both same but ultra can generate more output
mr pineaple can you make GPT 5.2 Codex in the direct chat instead of having to use code arena
im to cheap to buy it
and i dont want to use code arena
PLS pineaple i need this my coding kinda sucks and i want to help it out
my attempt that creating something that sounds like it came STRAIGHT from the early 2000s and late 90s, what do you all think?
hey can i use veo in lmarena ?
hey guys is there any way to save my chat for a new chat im geting this message "Something went wrong with this response, please try again."
and i need to tell the new chat everything
this frying me dawg
its annoying
Do you like it?
I tried really
i like the melody tho it's fire
and the drums
it's just the hits
it's groovy, nice
but a bit monotonous
but i heard worse
I can flag to the team
Btw it is in Direct Chat in Code Arena. I understand the want for it in Text Arena, but yeah wanted to clear that up incase there was come confusion.
Can you try the steps in this article ?
You may sometimes see the error message: āSomething went wrong with this response, please try again.āThis is a general error message. It can
yes when i try to use new chat it works but i dont want to start saying to the new chat what i was doing maybe you can guys add an option to load the previous chat ? if this can be it will be a really helpful
Itās supposed to be DVD MENU music
my video is still generating is that normal for taking more than 10 minnutes ?
its 11 mins rn
just learning
wow
YAY
thank you pineapoplke
@gray garnet hello!, please before trying to generate content, check out this post that contains a detailed guide with instructions to properly use the Discord bot: #1397655624103493813 message
it reminded me of re wesker for some reason
brb ima try klein 9b vs 4v
4b*
signature hand test
is a 4b image model can do this ill be shocked
holy sh#t. we have reached the point to where ais that can run on your pc can generate coherent images of hands
9b is even better
I laugh a lil bit when I saw that hand
yeah it is a bit goofy lol
but the fact that it can probably run on your pc just fine
and still generate that is insane to me
ye its wild
wait why was my message deleted
auto bot probly
this is what i originaly meant to say lol
lol
That looks like a connection issue with Discord
oh k
when was pdf support added?! :D
i can finally send gemini the arxiv papers i dont understand, thanks to whoever had the idea and the lmarena team
There used to be an option on the website to compare videos and vote; did anyone else have that option and it's no longer there?
Recently! Sort of. It's still in an experiment so it's not fully rolled out to everyone.
Sadly, there isn't a stop button, yet. It's on the list of things we'd like to build. If you haven't tried already -> clear your cache and hard refresh the site. If no luck, starting a new chat may be your best next option.
Although that's not ideal as you won't have that same context.
Okay, thank you very much
hello , i love this proyect
But it's not always available; sometimes it's there, sometimes it's not.
Why doesn't LMArena.ai support uploading all file types, only allowing images and recently PDF files?
This is just another mystery among 99% of things that haven't been implemented in lmarena for years.

:)))))
NEW UPDATE IS POGGERS!!!! FINALLY CAN CHAT WITH PDFS!!! I LOVE LMARENA ā¤ļø
So, the video generate button is gone now.
So, I already said it "it's not always available, sometimes it's there, sometimes it's not."
Some models don't support PDF chat, is it only applicable/available to specific models?
I think so
generate a anime video.
Hello! Please check ā how-to-video-bot to learn how to generate videos.
Nanobana pro is now mega censored, is imposible to edit a photo of a person now
Thanks Elon
lol nice
but yeah nano banana is good
anyone having major problems with nano banana pro? i haven't been able to generate anything for the last 3 hours or so
do json prompts really make a difference in image generation?
same
WAIT WHAT

I made this... how is it?
If you want: https://rockyandthepawpatrol.itch.io/fx-studio-pro
@echo aurora have you disabled the ability to upload PDFs?
what happen to attaching pdf files ;-;
yall really need to fix this
fr
omg why tf is your platform so unstable š
no this needs to be fixed
Stupid a/b testing sh+t pops up every single time, every 5 messages
one of them will be Gemini 3 pro
idk maybe it is not loading anyways
why yall removed sending pdf files
this sucks now it's every other message and I'm getting security verified tf is going on with this website
its always like that not surprizing
Idk why
Why can't the AI be used now, it could be used this afternoon?
@echo aurora
Openrouter is paid right? š
scam?
These automatic battles in direct chat are actually annoying
imo flux klein is awesome
i love flux klein
i never thought there would be a day where a ai image gen model can run on some consumer-pc's let alone it be able to generate hands
Bro yupps server is actually north Korea
?
I got banned for giving my opinion on an Ai model
who is yupp
š„
why did the image generate AI stop remembering the chat? It's terribly inconvenient. Are there any solutions?
I want to try the amazing AI in generating text to image and image to video.
pineapple uhhh I can use pdfs to upload bigger then the text limits so I can upload databases through the files and Gemini or Claude can output the fixed code thanks š
HI please check #1397655624103493813 to learn how to use the bot.
Hello
Is the web laggin?
Hi @vale canyon please check #1397655624103493813 to learn how to use the bot.
hi
hi victory
nano banana pro still messed up? its still not working for me (since last night)
hello
what the
So why is it when I choose direct chat, I chat with Gemini, Claude, etc, that at random intervals it does a "Battle" with 2 anonymous models, one being one I picked and another randomly picked? My brain isn't ready to compare 2 ai models randomly, sometimes when I am tired of reading 2 ai prompts and choosing between the two I choose direct chat to test models.
Yeah I tried Nano banana pro a couple of months ago to generate a futuristic utopian bedroom with a great view, it failed.
I meant I've been getting this over and over each hour since about 15 hours ago.
Yep. I got that message a few weeks ago using Nano Banana. Maybe report it as an issue? There's a bunch of other AI image models to use while it's being fixed. Been switching between flux and gpt-image.
š
How to use this video generator

