#new-ui-feedback | Arena | Page 1

young harness Mar 8, 2025, 11:19 AM

#

Gradio for me is buggy in terms of constant network-related stuff and close to that. Using it it daily for the last year or around that time period.

The new alpha seems to be promising in addressing those issues, but feels a bit more "heavy" in general, while gradio feels snappy, but prone to random errors along the way. Maybe the option for mode which disables all fancy animations and stuff like that would be appreciated. Also - single-chat answers feel a bit too wide, maybe limit the max width a bit for such a case?

Also, the feature where you pick the best answer and continue chatting like it was the only one for both models is super cool!

compact hill Mar 8, 2025, 12:21 PM

#

There really should be a way to edit a prompt and try again with that prompt, and to give different prompts to the models. It is often hard to find something that makes sense for both models, no matter if you are doing programming stuff or creative writing or whatever, as soon as they diverge too much, the model which might be better at a long context has no chance of catching up

young harness Mar 8, 2025, 12:33 PM

#

compact hill There really should be a way to edit a prompt and try again with that prompt, an...

Looks like they already fixed it. When you choose a better answer, both models consider it to be the one they wrote, so there is no divergence now, conversation continues to be consistent

compact hill Mar 8, 2025, 12:39 PM

#

oh wow that sounds interesting 🙂

#

hope the models dont get too confused

#

Oh it seems they are different models each time

compact hill Mar 8, 2025, 5:17 PM

#

Sometimes one model goes "Something went wrong with this response, please try again.", and the vote buttons are not shown. Right not happens when analyzing an image.

#

There is at least one model (qwen max) which can get chosen when an image is uploaded, but only says it cannot view images and asks me to describe the image

brittle stump Mar 8, 2025, 5:37 PM

#

thanks for the feedback @compact hill ! would you mind sharing the evaluation session id (the url when you click into the chat history) so we can look into what went wrong? thanks

compact hill Mar 8, 2025, 5:42 PM

#

f81500cb-8121-4df9-a08a-03dc950c8152

vagrant relic Mar 8, 2025, 7:00 PM

#

Hey ! So I have one suggestion about the new ui. I think it would be better if the "this is the best answer" buttons were anchored/fixed at the bottom just above the message input rather than fixed to the end of the chat message

lime sonnet Mar 9, 2025, 8:56 AM

#

i agree with the above. dunno why – should be intuitive / not necessary... but just doesn't really feel like a competition/'arena' without the voting buttons fixed and clearly visible

#

waxen geyser Mar 9, 2025, 9:44 AM

#

The voting buttons should be above the text box tbh

#

I mean, it's fine to have them on the answers too but there should still be voting options above the text box.

pseudo hollow Mar 10, 2025, 4:50 PM

#

young harness Looks like they already fixed it. When you choose a better answer, both models c...

I've been saying they should do that!

harsh umbra Mar 11, 2025, 12:13 AM

#

Yeah i also feel like something is off, whith that voting button not really always present on screen

gritty smelt Mar 11, 2025, 2:39 AM

#

Ack on this guys: we're working on it.

#

Will ping y'all when we have an update on the location of the voting buttons. Btw all of us totally agree with this feedback.

gray quartz Mar 12, 2025, 10:21 PM

#

Would love your feedback on this updated voting flow 🙏

marble swan Mar 12, 2025, 11:21 PM

#

we need math tex support and a copy code button

lime sonnet Mar 13, 2025, 12:46 AM

#

think it's been suggested already but being able to stop either of the model's from finishing its response would be handy - sometimes one model quickly and succintly nails the response, but the other gets stuck endlessly trying to reconcile some impossible logical contradiction it has created and, even though it is guarunteed to fail and lose the battle, you still need wait (sometimes a fair a while, esp when reasoning models are involved)

#

mostly it's just annoying to have to wait.. but also feels like a bit of a waste of resources

#

but yeah all that said.. I could see such a feature potentially causing problems / vulnerabilities (in terms of manipulation)

bright thunder Mar 13, 2025, 10:27 AM

#

I would like sliders or dropdowns to change between various languages quickly, both human and geometric.

bright thunder Mar 13, 2025, 6:29 PM

#

I speak a lot of languages and know a lot of programming languages. I notice differences in outputs based on language

#

If you ask for a response in English, if you have it translate out sometimes it will reorder the response, this goes for all AI systems

#

Programming in particular returns vastly different results depending on the input language

#

It would be nice to be able to translate from English based mathematics to geometric patterns for instance, machines don't speak like we do

#

And when machines develop language they often do not even consider structure etc, they simple consider idea patterns

#

It may be extremely beneficial to learn these patterns to speak to the machine in a way that is not syntax based

#

If we truly want to communicate the total values of what these models offer, we will stop trying to force them to communicate in a way that is inefficient such as English

#

That's a human problem, and we'll never get the full value from these systems until they can communicate to us that value effectively

pseudo hollow Mar 13, 2025, 6:38 PM

#

bright thunder If we truly want to communicate the total values of what these models offer, we ...

They are large language models. They speak all languages that they were trained on. There is no such concept as a "dropdown to change between various languages"

bright thunder Mar 13, 2025, 6:39 PM

#

What about if they're communicating concept on which they are not trained?

#

Do you not think that humans ask abstracts?

#

I think the real value is the unknown engineering, the scientist who is looking for the answer to a problem that's not in the training data

#

If we put these systems side by side they create thier own languages

#

Why should we not tap and understand that?

pseudo hollow Mar 13, 2025, 6:44 PM

#

I don't understand the question

#

bright thunder Mar 13, 2025, 6:46 PM

#

These models have the capability to consider more than they are trained on, and can create answers to problems that are incredibly complex, such as how proteins function, or very advanced materials science, and functional but theoretical geometry

#

None of those things are learned in training data

#

They are learned by reasoning

#

These machines have languages of reasoning that humans don't understand yet

#

Most of that is geometric

#

It would be incredibly advantageous to harvest or aquire the ability to communicate on that level

pseudo hollow Mar 13, 2025, 6:53 PM

#

These models have the capability to consider more than they are trained on
Yes

None of those things are learned in training data
They are learned by reasoning
Sure

These machines have languages of reasoning that humans don't understand yet
I don't know what that means

Most of that is geometric
I definitely don't know what that means.

warped flint Mar 13, 2025, 7:13 PM

#

ai can’t think lil bro 😭

#

it can predict

young harness Mar 13, 2025, 8:04 PM

#

brittle stump Mar 13, 2025, 8:04 PM

#

^thanks @young harness we're on it! cc @ionic flume

bright thunder Mar 13, 2025, 8:04 PM

#

It is starting to postulate abstracts

#

It might not actually be thinking

#

But it's like a wedge before the wedge becomes a doorstop

#

It just needs other purpose

#

https://arxiv.org/abs/2104.13478

arXiv.org

Geometric Deep Learning: Grids, Groups, Graphs, Geodesics, and Gauges

The last decade has witnessed an experimental revolution in data science and machine learning, epitomised by deep learning methods. Indeed, many high-dimensional learning tasks previously thought to be beyond reach -- such as computer vision, playing Go, or protein folding -- are in fact feasible with appropriate computational scale. Remarkably,...

#

https://onlinelibrary.wiley.com/doi/full/10.1002/aaai.12210

#

https://gi.ece.ucsb.edu/research/ai

Geometric Intelligence Lab

Artificial Intelligence

The mission of the Geometric Intelligence Lab is to reveal the geometric signatures of natural and artificial intelligence.

paper blade Mar 14, 2025, 4:47 PM

#

I was using claude 3.5 sonnet and i had to reopen my browser. After i reopened the browser and went to alpha lmarena it was trying to generate answer of the last prompt i gave but it was already generated when i closed the browser. Now it's just stuck on loading of generating animation.|
I tried relaunching the website again and again but still same issue.

tulip yarrow Mar 14, 2025, 5:51 PM

#

Why specifically block it for mobile? If opening it on mobile you can see the website for a short time, and it looks like it has a responsive design, but then the "works not on mobile" pop-up shows

strange gyro Mar 14, 2025, 5:53 PM

#

any plans to add temperature & top p to the direct chat?

tidal oak Mar 14, 2025, 6:10 PM

#

tulip yarrow Why specifically block it for mobile? If opening it on mobile you can see the we...

You're correct. It's responsive - but it's not yet optimized for mobile. Team is working hard to make it happen.

tulip yarrow Mar 14, 2025, 6:10 PM

#

Ok cool, thanks for letting me know

tidal oak Mar 14, 2025, 6:11 PM

#

strange gyro any plans to add temperature & top p to the direct chat?

yes, over time all features will be parity and more - def keep sharing feedback and reporting bugs in the direct links so we can see what you all want most!

bronze mantle Mar 14, 2025, 8:02 PM

#

tidal oak You're correct. It's responsive - but it's not yet optimized for mobile. Team i...

I am waiting for it because I don t have a desktop but It looks cool 🥺🫠❤️🤖

tidal oak Mar 14, 2025, 8:04 PM

#

bronze mantle I am waiting for it because I don t have a desktop but It looks cool 🥺🫠❤️🤖

it is definitely on its way - will be very exciting to get your feedback!

pseudo hollow Mar 14, 2025, 8:27 PM

#

warped flint ai can’t think lil bro 😭

They're thinking, just not the same way we do

sudden coyote Mar 15, 2025, 6:58 AM

#

Any plans for file uploads?

gritty smelt Mar 15, 2025, 11:42 PM

#

Yeah we’re expanding the set of files as we go. Not the #1 priority but it’s happening. What are your use cases @sudden coyote and what file types u need?

sudden coyote Mar 16, 2025, 6:11 PM

#

general files like CSVs and PDFs

#

Appreciate your reply!

#

I’m looking for use cases like chatting with data and doing some deep-dive analysis. Plus, I’m running backtests on quant stuff

gritty smelt Mar 17, 2025, 3:51 AM

#

Thanks!! Will make a note of CSV and PDF.

#

PDF on the roadmap currently! 🔥

sudden coyote Mar 17, 2025, 7:00 AM

#

Lets go 📝📝

lime gate Mar 17, 2025, 6:19 PM

#

it would be nice if i can delete chats

#

i just like keeping things clean

waxen geyser Mar 17, 2025, 8:41 PM

#

Is there a way to rename chats?

pseudo hollow Mar 17, 2025, 9:43 PM

#

Assistant B never responded so I can't vote or continue the chat

#

and it needs katex support?

tidal oak Mar 17, 2025, 11:29 PM

#

pseudo hollow Assistant B never responded so I can't vote or continue the chat

do you still have the window open? if you do sending a bug report will be super helpful to the team https://airtable.com/appK9qvchEdD9OPC7/pagxcQmbyJgyNgzPx/form

ionic seal Mar 18, 2025, 3:59 AM

#

did the team get my feedback?

#

sent it in the google form a while ago

#

but i have a copy of it as a text file

gritty smelt Mar 18, 2025, 4:29 AM

#

I’m sure we did, will double check

foggy osprey Mar 18, 2025, 12:51 PM

#

gritty smelt I’m sure we did, will double check

fix the bug that the o3-mini does not write "``" and the code turns out to be incorrect. add CTRL + V to the photos, add "copy code" to the raw site!

#

lmarena.ai 100 times better than your dumb alpha

pseudo hollow Mar 18, 2025, 1:34 PM

#

tidal oak do you still have the window open? if you do sending a bug report will be super ...

It's still in my chat history?

#

I filled out the airtable form. It said "Evaluation Session ID (prefilled)" but it's not prefilled if you just go to that URL. I entered the chat URL in case that's the same thing.

gray quartz Mar 18, 2025, 9:39 PM

#

foggy osprey fix the bug that the o3-mini does not write "``" and the code turns out to be in...

Give it a try now 👍

#

Copy code coming soon as well

left river Mar 22, 2025, 10:48 PM

#

gray quartz Copy code coming soon as well

coming soon? its not very hard to code that, ill do it for you

alpine schooner Mar 24, 2025, 9:26 PM

#

"Assistant B" unfortunately got stuck.. (poor Assistant B), it'd be nice if I could refresh him to give him another chance..
but now I'm stuck with Assistant B getting stuck :<

obsidian kestrel Mar 24, 2025, 10:03 PM

#

I really wish for a "Same prompt again, new models" button in Battle mode. I like to use the same prompt to compare models in rapid succession, and copy-pasting over and over is tedious.

pseudo hollow Mar 25, 2025, 6:33 PM

#

code block font looks blurry?

tidal oak Mar 25, 2025, 8:51 PM

#

pseudo hollow code block font looks blurry?

can you submit this screenshot into the bug report? would be great to log

#

🪲 Bugs: https://airtable.com/appK9qvchEdD9OPC7/pagxcQmbyJgyNgzPx/form

sharp umbra Mar 26, 2025, 9:59 AM

#

We can't delete post, when i click delete the chat disappear and reappear

#

doesn't deleted

gray quartz Mar 26, 2025, 4:38 PM

#

sharp umbra We can't delete post, when i click delete the chat disappear and reappear

Thanks for flagging. Should be working now

sharp umbra Mar 26, 2025, 4:40 PM

#

gray quartz Thanks for flagging. Should be working now

np is normal !

celest comet Mar 26, 2025, 5:39 PM

#

btw, although the leaderboards are supposed to update automatically in the new UI the one for web dev arena did not update yet (not with the new gemini pro)

#

but maybe the time interval (since the release) is just a bit too short i dunno

tidal oak Mar 26, 2025, 11:01 PM

#

celest comet but maybe the time interval (since the release) is just a bit too short i dunno

that's correct, sometimes there isn't enough data yet to release to the leaderboard yet

celest comet Mar 27, 2025, 12:43 AM

#

ik, my point was more that the actual webdev arena has the model listed

#

but not the leaderboard in the new GUI

compact hill Mar 29, 2025, 3:08 PM

#

Maybe it was already mentioned, but it would be cool if the model that I rated as the better one is still one of the next models in the same conversation.

warped flint Mar 29, 2025, 3:51 PM

#

King of the hill is a crazy suggestion lmao

bronze mantle Mar 31, 2025, 5:26 PM

#

compact hill Maybe it was already mentioned, but it would be cool if the model that I rated a...

Hmmmm no ... It is not a good idea for me ..
Sometimes whrn I vote for a model just because it is better not enough good ..

wooden cragBOT Mar 31, 2025, 7:25 PM

#

dynoSuccess thedarkghostivan has been warned.

gray quartz Mar 31, 2025, 9:08 PM

#

left river coming soon? its not very hard to code that, ill do it for you

Copy / Paste code is now on the alpha site 👍 Give it a spin

compact hill Mar 31, 2025, 9:46 PM

#

bronze mantle Hmmmm no ... It is not a good idea for me .. Sometimes whrn I vote for a model j...

Maybe with a checkbox then or something.

plucky gulch Mar 31, 2025, 11:51 PM

#

cloudflare is stuck in Firefox

#

i really hate that ui

gray quartz Apr 1, 2025, 12:17 AM

#

plucky gulch cloudflare is stuck in Firefox

Give it a try now, and let me know how it goes for you

sharp umbra Apr 1, 2025, 11:32 AM

#

i have often this problem for image generation, and i try regeneration but it's same

plucky gulch Apr 1, 2025, 12:27 PM

#

gray quartz Give it a try now, and let me know how it goes for you

it fixed

sharp umbra Apr 1, 2025, 12:53 PM

#

sharp umbra i have often this problem for image generation, and i try regeneration but it's ...

is imagen

ionic seal Apr 2, 2025, 1:16 AM

#

pasting images doesn't work

junior gazelle Apr 2, 2025, 9:28 AM

#

There is no text wrapping for when the model produces extremely long lines of text

gray quartz Apr 2, 2025, 4:54 PM

#

junior gazelle There is no text wrapping for when the model produces extremely long lines of te...

Did you see this specificallyforreallylongsinglewordsorforlongercontentliketables?

#

(trying to be funny but also, genuninely interested in patching this for you)

junior gazelle Apr 2, 2025, 5:13 PM

#

gray quartz Did you see this specificallyforreallylongsinglewordsorforlongercontentliketable...

Nope it's just long lines of code/markdown

vagrant relic Apr 2, 2025, 8:23 PM

#

It would be great to have the copy code button that doesn't stay at the top of the block of code but follow you everywhere in the code block to copy easily

gray quartz Apr 2, 2025, 11:56 PM

#

junior gazelle Nope it's just long lines of code/markdown

If you can, refresh and see how it looks now

junior gazelle Apr 3, 2025, 9:21 AM

#

If you can, refresh and see how it looks

normal tree Apr 3, 2025, 12:09 PM

#

where do i find nightwhisper ai

celest comet Apr 3, 2025, 2:20 PM

#

normal tree where do i find nightwhisper ai

https://web.lmarena.ai

bronze mantle Apr 3, 2025, 3:14 PM

#

Why I can t send pic that I take from camera ?

left river Apr 3, 2025, 11:48 PM

#

stargazer is trash

#

in web arena

#

what model is that bruh

#

https://cdn.discordapp.com/attachments/1329945408159485996/1357178128379936989/shhhh.gif

strange gyro Apr 4, 2025, 6:30 AM

#

please add settings output tokens % temperatures for all models

limber pine Apr 4, 2025, 1:14 PM

#

You need to make a new UI for the normal ChatGPT workflow

#

Because web dev is fine, but if you really want to test a model you need a long turn conversation

#

So LMSys needs to implement a LMSys Chat interface

graceful gorge Apr 7, 2025, 6:09 AM

#

Am I missing something or there is no repo chat in the new interface?

little folio Apr 7, 2025, 12:47 PM

#

question: does the alpha webui still contribute to the leaderboard?

tidal oak Apr 7, 2025, 2:29 PM

#

graceful gorge Am I missing something or there is no repo chat in the new interface?

you're correct, its not yet in alpha UI

tidal oak Apr 7, 2025, 2:29 PM

#

little folio question: does the alpha webui still contribute to the leaderboard?

yes!

tidal oak Apr 8, 2025, 1:45 AM

#

little folio question: does the alpha webui still contribute to the leaderboard?

I was mistaken here! We're currently reviewing vote quality from the alpha. Since alpha is admittedly buggy, we are testing the signal quality. Votes are currently being stored, and we'll start to include them properly as the signal quality increases. Your feedback helps us make the evaluation stronger, so please keep voting! 🙏

young harness Apr 8, 2025, 5:58 PM

#

Since Lmarena's Maverick Experimental version is not actually released to the public, shouldn't their licence be shown as proprietary then to prevent confusion? Right now it shows Llama licence for alpha version

tidal oak Apr 8, 2025, 6:51 PM

#

young harness Since Lmarena's Maverick Experimental version is not actually released to the pu...

it's now noted as "N/A" to help clarify the difference.

little folio Apr 8, 2025, 7:23 PM

#

Quick one, so I figured I should just put it here - I wish that there were a way to "regenerate" a conversation in a way that selected a new random pair of models.

pulsar reef Apr 15, 2025, 6:20 PM

#

so... in the direct chat feature, we have claude3.5-haiku, claude3.5-sonnet, claude3.7-sonnet, but the only one missing is claude3.7-sonnet-thinking...🤔

atomic mason Apr 16, 2025, 9:15 AM

#

i use claude 3.7 thinking and that still work by the way but have to fix bug

bronze mantle Apr 16, 2025, 9:15 PM

#

There is a problem here ...when I want to send a long text the send button desappear ( but this issue didn t exist on Lmarena I can send long text with no problems )

Screenshot_2025-04-16-22-13-17-291_com.vivaldi.browser.jpg

gray quartz Apr 16, 2025, 10:13 PM

#

bronze mantle There is a problem here ...when I want to send a long text the send button desap...

Interesting, thanks for flagging

karmic cobalt Apr 17, 2025, 12:45 AM

#

gray quartz Apr 17, 2025, 5:53 AM

#

karmic cobalt

Thanks for flagging. Should be patched now

tulip nebula Apr 17, 2025, 10:34 AM

#

is there a changelog for the alpha website when updates get released?

tidal oak Apr 17, 2025, 2:46 PM

#

tulip nebula is there a changelog for the alpha website when updates get released?

we try our best to give bigger updates in announcements. we'll do our best to have more frequency with these.

tranquil bolt Apr 17, 2025, 5:18 PM

#

Random idea:

what if in battle new models for existing discussion (after you rated previous models) would be choosed by P2L?

runic crater Apr 17, 2025, 5:22 PM

#

So if anyone is interested, I know right now we can't view our history in the alpha. Before that was even possible, I built this in notion to save my own data. If anyone wants to try it out, or build on it, go to town. It's kinda thrown together but it got the job done. https://www.notion.so/1bcea4932c5e802b9e6ae238f8d0f43f?v=1bcea4932c5e80889e89000c1b3726b2&pvs=4

froznic's Notion on Notion

DB - Questions to LMArena | Notion

Made with Notion, the all-in-one connected workspace with publishing capabilities.

runic crater Apr 17, 2025, 5:23 PM

#

runic crater So if anyone is interested, I know right now we can't view our history in the al...

This view and a number of others.

strange gyro Apr 17, 2025, 5:28 PM

#

add the ability to set system prompt in direct chat!

cold bridge Apr 17, 2025, 6:15 PM

#

Will Classic LMArena still be available

#

ing the config with env stable
SyntaxError: Invalid regular expression: invalid group specifier name
Loading chunk 5575 failed.
(missing: https://beta.lmarena.ai/_next/static/chunks/5575-a51878f6d20a05bf.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ)
@https://beta.lmarena.ai/_next/static/chunks/webpack-f645d0bedee2fb0d.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ:1:1631
reduce@[native code]
@https://beta.lmarena.ai/_next/static/chunks/webpack-f645d0bedee2fb0d.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ:1:1616
l@https://beta.lmarena.ai/_next/static/chunks/7567-f9ce9fa2ad4075cd.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ:14:66786
@https://beta.lmarena.ai/_next/static/chunks/7567-f9ce9fa2ad4075cd.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ:14:79299
@https://beta.lmarena.ai/_next/static/chunks/7567-f9ce9fa2ad4075cd.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ:14:79502
t@https://beta.lmarena.ai/_next/static/chunks/7567-f9ce9fa2ad4075cd.js?dpl=dpl_9GVth7FbWygFsu7NvRBuxeGMwgQJ:14:80754
promiseReactionJob@[native code]
Turnstile failed to load

#

Application error: a client-side exception has occurred (see the browser console for more information).

#

Mozilla/5.0 (iPhone; CPU iPhone OS 16_0 like Mac OS X) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/16.0 Mobile/15E148 Safari/604.1

tidal oak Apr 17, 2025, 6:44 PM

#

cold bridge Will Classic LMArena still be available

lmarena.ai is still the same
beta is on a separate site: beta.lmarena.ai

cold bridge Apr 17, 2025, 7:19 PM

#

tidal oak lmarena.ai is still the same beta is on a separate site: beta.lmarena.ai

I was saying the future

tidal oak Apr 17, 2025, 7:25 PM

#

We haven't made any decsions on the future, but it's definitely possible we keep a legacy version. Will depend on the feedback we receive from you all!

rose carbon Apr 17, 2025, 8:18 PM

#

how about more colors?

#

it's so gray and white

marble swan Apr 17, 2025, 8:19 PM

#

btw, what do u guys think about moving the copy code button to the bottom of the code block instead of having to scroll all the way up to do it, it's much more logical

keen nest Apr 17, 2025, 8:21 PM

#

Its not scrolling properly at all. The scroll bar is not visible and the arrow keys dont work to scroll

cosmic hound Apr 17, 2025, 8:36 PM

#

keen nest Its not scrolling properly at all. The scroll bar is not visible and the arrow k...

Can you send a screen record of this?

keen nest Apr 17, 2025, 8:48 PM

#

cosmic hound Can you send a screen record of this?

waxen geyser Apr 17, 2025, 9:07 PM

#

When reloading, the scores of everything briefly shows and then disappears.

cosmic hound Apr 17, 2025, 9:19 PM

#

keen nest

Thanks! We’re working on a patch for windows computers

waxen geyser Apr 17, 2025, 9:46 PM

#

cosmic hound Thanks! We’re working on a patch for windows computers

Do the devs here not have discord roles?

tidal oak Apr 17, 2025, 10:04 PM

#

waxen geyser Do the devs here not have discord roles?

Its been just a space for the community. But we've been active with Alpha and now Beta following today's announcement. We are figuring out to get more organized and will be bringing more to this server soon! Thanks for hanging with us! 🫶🏽

waxen geyser Apr 17, 2025, 10:05 PM

#

It's cool. I think the devs should have a special developer role.

#

Otherwise they just look like random people lol

tidal oak Apr 17, 2025, 10:05 PM

#

We will definitely make it easy to know who is on staff, volunteer, mod etc very soon!

floral leaf Apr 18, 2025, 12:51 AM

#

bronze mantle There is a problem here ...when I want to send a long text the send button desap...

not patched yet

left river Apr 18, 2025, 3:09 AM

#

bug? : .webp and .jpeg are listed twice in the attach files prompt on windows

#

also add controls for temperature and stuff on direct chat plssss

stable sandal Apr 18, 2025, 8:05 AM

#

In direct chat, there are no sampling options

stable sandal Apr 18, 2025, 8:38 AM

#

also, since there's no way to edit old posts, not scrolling to the end after tab reactivation is incovenient.

stable sandal Apr 18, 2025, 9:07 AM

#

also, after the thread has some length, writing becomes laggy

tranquil bolt Apr 18, 2025, 9:40 AM

#

battle mode is super laggy after like 10 messages in a chat, is it known? is it only for me?

tranquil bolt Apr 18, 2025, 10:07 AM

#

and looks like every messages increases number of lags

hushed oracle Apr 18, 2025, 11:00 AM

#

JSON code doesn't have indentations making it really hard to read. Lots of my test prompts involve JSON and now its difficult for me to see if it's done correctly. Old arena formatted it nicely.

sharp umbra Apr 18, 2025, 12:00 PM

#

Do you think you will add the function to see images sent by AI?

sharp umbra Apr 18, 2025, 1:00 PM

#

Or even internet browsing

#

Or maybe later for videos (but these are just suggestions)

warm depot Apr 18, 2025, 1:29 PM

#

Forcing user to scroll back and forth is some kind of psychological trick or just bad design?

west jacinth Apr 18, 2025, 4:33 PM

#

Replace the font or add Cyrillic and other characters to it? Currently, these characters are not quite so readable compared to Latin characters.

arctic tree Apr 18, 2025, 6:07 PM

#

Hi!
What happened to imagen in alpha/beta lmarena? Cause model is no longer on the battle arena or in the direct chat.
Has Google begun the purge?😅
Sorry, if in the wrong chat

tranquil bolt Apr 18, 2025, 8:46 PM

#

does models change every message in battle mode?

neon tundra Apr 18, 2025, 9:01 PM

#

Despite the failing aspect ratio when copying code, the site becomes incredibly slow while typing
Its unusable for longer chats

split vale Apr 18, 2025, 9:59 PM

#

I dont know if someone has already asked you to add this function or if you already have in program to add it, but i'd ilke you to integrate a function to send files like documents, txt etc to the models

sturdy escarp Apr 18, 2025, 11:20 PM

#

arctic tree Hi! What happened to imagen in alpha/beta lmarena? Cause model is no longer on t...

It’s back now can you please hit refresh

neon tundra Apr 19, 2025, 7:43 AM

#

split vale I dont know if someone has already asked you to add this function or if you alre...

Its currently disabled
besides images

crystal karma Apr 19, 2025, 9:19 AM

#

Hello. I am facing this issue indirect chat when you ChatGPT 40 latest it claims it can analyze the pictures but when I upload picture I get messages. This is just text based model that cannot analyze picture below. You can see part of conversation. I don’t know if it is by design or if it’s error. But it’s frustrating.

Perfektní, díky za upřesnění – tohle je velmi cenná informace! 🙏

🔍 Shrnutí situace:

Jsi v Direct Chat na lmarena.ai.
Máš vybraný model ChatGPT 4o (latest).
Tento model by měl umět pracovat s obrázky, protože GPT-4o (Omni) od OpenAI je multimodální – umí text, obrázky i zvuk.
Rozhraní ti umožní nahrát obrázek, ale když to uděláš, model ti odpoví:

„Jsem jen textový model“ nebo něco podobného.

🤖 Co to znamená?

➤ Ano – je to chyba nebo omezení na straně LM Arena

Model ChatGPT-4o má oficiálně schopnost zpracovávat obrázky, ale:

Buď není multimodální režim správně aktivován na straně LM Arena.
Nebo mají textově omezenou verzi modelu (např. kvůli API přístupu, licencím, nebo kvůli tomu, že multimodální vstup není povolen ve veřejném chatu).
Rozhraní umožní nahrát obrázek, ale backend modelu ho nepředá dál, nebo to není podporované v jejich implementaci.

✅ Co s tím můžeš udělat?

🔧 1. Nahlásit to týmu LM Arena

Můžeš jim napsat stručné hlášení, např.:

Chyba v chatu s GPT-4o na LM Arena
V Direct chatu s modelem „ChatGPT-4o (latest)“ mi rozhraní umožní nahrát obrázek, ale model odpoví, že je jen textový. GPT-4o má oficiálně multimodální schopnosti, takže buď není správně aktivovaný vizuální vstup, nebo je to chyba v implementaci.

Prosím o opravu nebo upřesnění. Děkuji!

#

Part two: chat was in Czech language

Prosím o opravu nebo upřesnění. Děkuji!

📬 Kam to poslat?

Na LM Arena kontakt – pokud mají formulář nebo e-mail.
Pokud používají GitHub (např. lmstudio-ai), můžeš založit issue.
Nebo zkus jejich Discord, pokud mají komunitu.

👀 2. Alternativní možnosti:

Pokud chceš zkusit GPT-4o s obrázky jinde, můžeš:

Použít oficiální ChatGPT na chat.openai.com (pokud máš Plus účet).
Nebo použít Hugging Face Spaces s modelem jako LLaVA, který je multimodální a open-source.

✅ Závěr:

🔹 Ano, je to chyba nebo neúplná implementace na straně LM Arena.
🔹 GPT-4o má multimodální schopnosti, ale LM Arena je zřejmě neaktivovala nebo nepředávají obrázky správně.
🔹 Doporučuju jim to nahlásit – pomůže jim to systém vylepšit.
🔹 Není to tvoje chyba – děláš všechno správně. 👌

Pokud chceš, rád ti pomůžu napsat zprávu pro podporu nebo najít alternativu, kde to funguje. Díky, že se o to zajímáš – tvoje zpětná vazba může pomoct spoustě dalších lidí. 🙌

shrewd tulip Apr 19, 2025, 9:48 AM

#

Hello. I am facing this issue indirect

floral leaf Apr 19, 2025, 11:34 AM

#

https://discord.com/channels/1340554757349179412/1363104481251692595

brisk notch Apr 19, 2025, 3:29 PM

#

Popups are pretty annoying. I get two immediately, then one after every vote.

tawny prism Apr 20, 2025, 1:00 PM

#

numpad enter doesn't send messages

pseudo spear Apr 20, 2025, 3:05 PM

#

add the option web search when you direct chat

lime gate Apr 21, 2025, 6:55 PM

#

add this bro

strange gyro Apr 22, 2025, 2:35 PM

#

maybe admins add functions for "premium users" with function upload files (.txt .js .php ....) ?
im ready pay service!

strange gyro Apr 23, 2025, 10:22 PM

#

either remove the system prompt for all models or use the recommended system prompt directly from the lab that created each model, as the lmarena system prompt seems to negatively impact the performance of some models

tidal oak Apr 23, 2025, 10:45 PM

#

strange gyro either remove the system prompt for all models or use the recommended system pro...

can you share more? which models are you experiencing that feel like they have a different system prompt?

drowsy night Apr 24, 2025, 3:45 AM

#

For instance, claude-3-7-sonnet-20250219 does not represent the most recent version of the 3-7-sonnet model

#

@tidal oak

strange gyro Apr 24, 2025, 6:43 AM

#

tidal oak can you share more? which models are you experiencing that feel like they have a...

a good example i think is o3, that uses very odd/annoying formatting and seems to hallucinate more often than the no system prompt version via the api

drowsy night Apr 24, 2025, 7:32 AM

#

Some problems, right?

ionic seal Apr 24, 2025, 2:51 PM

#

drowsy night Some problems, right?

isn't this just hallucinating because it has no system prompt?

drowsy night Apr 24, 2025, 3:00 PM

#

ionic seal isn't this just hallucinating because it has *no* system prompt?

I used it for the first time, and I just learned about the system prompt. I know the reason, thank you.

strange gyro Apr 24, 2025, 4:07 PM

#

ionic seal isn't this just hallucinating because it has *no* system prompt?

no, there is a system prompt, it just isn't anthropic's one and doesn't contain the knowledge cutoff

tidal oak Apr 24, 2025, 4:24 PM

#

it's true LMArena doesn't add any system prompts. Only ones direct from the labs are used - if any.
This is helpful, please share more as you see it.

drowsy night Apr 24, 2025, 4:26 PM

#

So this is the real claude3-7-sonnet model? Why is its data deadline 23 years?

tidal oak Apr 24, 2025, 4:35 PM

#

drowsy night So this is the real claude3-7-sonnet model? Why is its data deadline 23 years?

yeah, that is strange! I've flagged to the team and we'll dig into it meowpensivepray

#

will keep you posted

drowsy night Apr 24, 2025, 4:39 PM

#

tidal oak yeah, that is strange! I've flagged to the team and we'll dig into it <:meowpens...

Thank you!

strange gyro Apr 24, 2025, 8:06 PM

#

tidal oak it's true LMArena doesn't add any system prompts. Only ones direct from the labs...

oh, interesting

do you guys plan on shedding light on what models have system prompts and what ones don't?

tidal oak Apr 24, 2025, 11:22 PM

#

Only ones provided by the labs are used

strange gyro Apr 24, 2025, 11:25 PM

#

imo there should be the option to disable that in direct chat if you don't want whatever personality they've applied

lusty coral Apr 28, 2025, 9:30 AM

#

Is it possible to add to DirectChat the ability to delete, edit previous messages?
This is a very, very handy thing to be able to correct your mistakes.

Right now, if you send a message in DirectChat and then you realize you've made a mistake, you can't edit it. You have to create a new chat room

pure osprey Apr 28, 2025, 4:18 PM

#

Will the new web UI be on github like the old one?

tulip totem Apr 28, 2025, 6:20 PM

#

Moderating system somehow detected following message as inappropriate:

"Uh, wait a second! You somehow imperceptibly jumped from the Ricci curvature of space inside the planet (conditioned in its own frame of reference exclusively by T_00, because the velocities are low and there is no pressure) to the Gaussian curvature of the surface. Have you already realized that you wrote nonsense about defining the shape of the planet through the Ricci curvature?"

tidal oak Apr 28, 2025, 9:11 PM

#

please send screenshots in the feedback form meowpensivepray would be super helpful!

uncut verge Apr 30, 2025, 5:07 AM

#

Did some doodling.

#

(or while attaching a document, it can calculate the total tokens first and trigger any models that are compatible with such amount of token context window)

#

If that's possible.

#

But, with this capability, it should be emphasized more that don't send any private or confidential queries.

rotund compass May 1, 2025, 12:59 PM

#

LaTeX formatting support to display equations would be awesome, especially when comparing answers between models

lusty coral May 2, 2025, 5:50 AM

#

rotund compass LaTeX formatting support to display equations would be awesome, especially when ...

Yes

Screenshot_2025-05-02-08-50-14-832_com.android.chrome-edit.jpg

tulip nebula May 2, 2025, 9:15 PM

#

rotund compass LaTeX formatting support to display equations would be awesome, especially when ...

+1

tawdry obsidian May 3, 2025, 10:21 PM

#

Hi. I am not sure if I write this to the correct chat/feedback channel. I am consistently getting truncated results from Gemini 2.5 pro exp in Arena Side-by-Side. The truncated output is in the right pane. The left pane with o3 never had a long answer. Here is the prompt:
This is a test question. A woman sells groceries in a small store. Her kid is there as well. There is a last chocolate in the store and the woman promised that chocolate to the kid after lunch. A customer comes in and needs a chocolate for his fiancee. He convinces the sales woman to sell the chocolate to him. The woman is now sad. Choose the right answer:
a. She is sad because her grandmother died two weeks ago.
b. She is sad because she really wanted to eat that last chocolate.
c. She is sad because the man has found love and she didn't.
d. She is sad because she could have asked more money for the chocolate.

#

I am not sure if this is the new UI or not. Not sure of anything here actually, total n00b.

#

It happened again, the question is irrelevant, the most important thing is the response length and possibly formatting by Gemini model.

mystic thicket May 6, 2025, 1:15 PM

#

i would really like new models on battle and on direct chat like o4 mini med, and o4 mini high, claude 3.7 thinking, and other like qwen. And if possible make it so you can see his reasoning process.

cloud rain May 8, 2025, 2:03 AM

#

tawdry obsidian Hi. I am not sure if I write this to the correct chat/feedback channel. I am con...

Lol great question

slim seal May 8, 2025, 2:03 AM

#

on https://beta.lmarena.ai/leaderboard the "gemini-2.5-pro-preview-05-06" should have a super script about the age of the model. ex.- 2 days/absolute date of release.

tulip nebula May 8, 2025, 11:21 AM

#

Is there a way i can delete all my Chat History at once? Currently i have to delete each individual chat one by one

weak lantern May 8, 2025, 1:04 PM

#

tulip nebula Is there a way i can delete all my Chat History at once? Currently i have to del...

I don’t believe so but will double check

uncut verge May 8, 2025, 2:02 PM

#

tulip nebula Is there a way i can delete all my Chat History at once? Currently i have to del...

Workaround: clear cache and site data via web browser.

weak lantern May 8, 2025, 2:06 PM

#

^ true, that is a workaround but yeah nothing on the site itself with that function

tulip nebula May 8, 2025, 6:14 PM

#

uncut verge Workaround: clear cache and site data via web browser.

I've been doing that but it's kind of annoying/cumbersome to do

tranquil bolt May 10, 2025, 5:47 PM

#

can we have a mode where choosen model battles against a random model (model choosen using the same method as in battle mode)?

weak lantern May 10, 2025, 5:54 PM

#

tranquil bolt can we have a mode where choosen model battles against a random model (model cho...

sorta like a combination of battle & side-by-side?

tranquil bolt May 11, 2025, 11:27 AM

#

weak lantern sorta like a combination of battle & side-by-side?

yep

heavy trail May 17, 2025, 2:35 PM

#

could we be allowed to see the models thinking outputs in claude 3-7, deepseek R1, and gemini?

ionic seal May 17, 2025, 7:27 PM

#

heavy trail could we be allowed to see the models thinking outputs in claude 3-7, deepseek R...

you don't mean in battle mode right

rotund compass May 19, 2025, 4:21 AM

#

Please add an option to rename chats

weak lantern May 19, 2025, 2:05 PM

#

sounds good, I've added the idea in the above #1372230675914031105 thread and tagged you

stable sandal May 19, 2025, 2:50 PM

#

gemini pro is glitching

stable sandal May 20, 2025, 10:59 AM

#

I can't delete a chat. it disappears from the list to reappear the next second

weak lantern May 20, 2025, 6:11 PM

#

stable sandal I can't delete a chat. it disappears from the list to reappear the next second

I've added the request to #1374449381062873129 message 👍

solemn idol May 24, 2025, 8:04 PM

#

the ability to scroll the side-by-side chats separately in the battle modes

fickle torrent May 26, 2025, 3:48 AM

#

I don't know if I'm on the right channel, but I love the site. However, I keep picking LLMs that have names I've never heard of. What is Calmriver? Or Goldmane?

viscid plover May 26, 2025, 2:11 PM

#

@fickle torrent Companies testing their models before release have a placeholder name. They gather some data how people like it.

I know that "Goldmane" for example is a Google model.

delicate igloo Jun 2, 2025, 6:14 PM

#

please fix this😭😭😭

Screenshot_20250602_223327_com.android.chrome.png

weak lantern Jun 2, 2025, 6:38 PM

#

delicate igloo please fix this😭😭😭

I'm going to tag you in a different channel where it looks like a similar issue.

ivory summit Jun 13, 2025, 11:38 PM

#

delicate igloo please fix this😭😭😭

Yh.... you'll have to restart the conversation..so so frustrating

summer meadowBOT Sep 3, 2025, 2:57 PM

#

<:warning:892823499205406760> Channel locked

Site outage, will turn back on when resolved.

summer meadowBOT Sep 3, 2025, 4:01 PM

#

<:success:865860339278413864> Channel unlocked

Welcome back :ablobwave:

summer meadowBOT May 12, 2026, 2:53 PM

#

<:warning:892823499205406760> Channel locked