#general
1 messages · Page 244 of 1
Grok is so bad, it focuses WAY to much on personality
Grok is only for chatting, i dont like its personality and behavior much
Average grok 4 experience.
Its likely trained from X engagment data
The grok that you use in free account is different than the grok they give to benchmarks. And thats true for almost all models.
Thats why models dont reflect benchmarks sometimes
They give you a dumb & cheap version to just try
No way this model has lower hallucination rate.
Which is the best model for analysing a youtube video?
Hm
Gemini is convenient.
But you can extract subtitles and insert into any Ai you like.
Gemini obviously
The gap is narrowing with Open models
Natively though.
Not on third parties.
Even grok supports video uploads now.
They're making progress.
what's wrong with you generators gemini 3 and it's 2k version are not loading image or takes very slow on them.................
hi
We thought Llama was going to be better than Gemini 2.5 btw
With benchmarks
what happened to meta tho
Probably went into hibernation.
But qwen and kimi been proving themselves from a while they have built their base now they are growing. There are more like deepseek, glm, minimax.
Thats on you
You have something which you consider accurate & precise?
Their own benchmarks are always weird. Always see multiple leaderboard providers, trustworthy reviewers and trying yourself.
nb pro is back to giving the error sign :/
Try sending requests when a new minute starts , you will have the highest probability to get through
wdym? should i track seconds before i send ?
No i mean the time.
On the clock
Now its on the app and web
Thats because only a limited generation are allowed in a single minute
For the whole platform.
Rate limit
damn that's intelligent
:]
I will be trying qwen 3 and kimi 2.5 on their platform to actual see how much good they are now
Qwen models for me they suck they don't do thinks correct even it beat Claude in benchmarks. But kimi k2 is the only model I enjoyed it fixes things great and search's for solutions it's just the best in my opinion. Now that kimi k2.5 is out it will be even more enjoyable
a model that refuses to answer ANYTHING would get a perfect score 0% on this chart
Ai is stupid right now its getting improved every months
Give it a year and you will see 0%
thats not what we want
0% with low accuracy is so bad
thats why we need hallucinate rate + accuracy
this bench will give you +points if you just replied with 'i dont know the answer'
Yes, so what? If a model is capable of identifying what it know and what it does and being front about it is nice.
i wont say its flawed but we need to add another parameter with it
not if its low on accuracy
If you visit thesitee this is a sub parameter of the leaderboard omniscience
The other one is knowledge
And with both of them we get omniscience
I dont like the merger because its not implemented correctly i think because a model with high knowledge but also high hallucination with get high rank then a model with less knowledge in comparison with less hallucination.
The weightage is not correct for them
@left lodge it kinda helped but i still think there's something wrong with the platform, like sometimes i cant even regenerate
ok so there is a refusal + hallucination point
refusal : 0 pts
hallucination : -1
Yeah thats their server issue i think i cant overcome that :p
i thought it only measures refusal
No
what do u think cause that ? loads ?
but they are giving 50% weight to non-hallucination
"AA-Omniscience score is calculated as the equally-weighted average of two components: (1) Accuracy... and (2) Non-Hallucination Rate"
Dont know, maybe connection errors or something else
Gemini 2.5 Flash (09-2025) (Reasoning) is used as the grading model"
???????????????????????????????
wait what
got you, thanks for the help buddy
*Thinking
The cost of using sota will be much high because there 6k questions and so many models
i guess they have a pre-defined answers already
i see
no wonder gemini 3 pro is higher on the list
Let’s look at two hypothetical models:
The "Coward" Model: Refuses every single hard question.
Accuracy: Low (let's say 20% on easy stuff).
Non-Hallucination Rate: Perfect (100%).
Score: (20 + 100) / 2 = 60.
The "Genius Gambler" Model: Knows almost everything but sometimes guesses wrong.
Accuracy: Very High (80%).
Non-Hallucination Rate: Bad (it guesses wrong often, say 40% score).
Score: (80 + 40) / 2 = 60.
Yeah
it penalizes guessing too heavily
wouldnt be better if they added like a confidence score
It should be searching or , asking for details & context than guessing.
like for example -> Im 85% sure this is correct
They are bad at that
Almost all
High Confidence (90%) + Correct Answer: Massive Points. (True Expert).
Low Confidence (20%) + Correct Answer: Small Points. (Lucky Guess).
Low Confidence (20%) + Wrong Answer: Small Penalty. (Honest Mistake).
High Confidence (90%) + Wrong Answer: CATASTROPHIC PENALTY. (Delusional Hallucination).
yea ik
but still this bench doesnt do gemini justice tbh
well we can use something like uncertainty quantification
kimi k2.5 thinking is so good google really sucks look its trying to git clone github but kimi 2.5 thinking didnt
Well I mean kimi k.2 is mostly known for being agentic
its really really best ai model i have seen and i dont call everything ai slop i just use ai as a helper when i install arch or need to get a guide on something and but its a good use case to use it as a helper but ai video sucks and image
does it ever get revealed which model was hidden behind a particular code name?
i'm really liking this "hermes" video model, the sound it generates is perfect and it's very good at anime, much better than veo 3.1, despite having a lower resolution. and it feels pretty fast, it could be some Chinese open-weight model
Hello. When I click on 'Verify email', this window appears and nothing happens. What could be the problem?
How do you access it?
kimi.com thats my website where i use it
Thx 🤝
its also the offical website
Is there anyone from Spain🇪🇸?
you are using google ai overview vs kimi chat
when do we expect gpt 5.3 to be released?
didnt lmarena have an option that choose the best ai to answer ur questions?
lmaooo everyone hated it so they removed it
poor lmarena devs
(idk what im talking about)
how do i delete a chat in lmarena? there is only "archive" option
We really saw how chatgpt gpt 5 handled it
fr
bro i loved it
how does kimi feel compare to opus 4,5?
@echo aurora did u ping me and delete my messages
Even glm 4.7 can really beat sonnet 4.5, but is good see the china running, the models really is improving at time
opus or sonnet?
Sonnet
i dont use sonnet, only opus
Too
I use on anti grativy with gemini pro
200 a year right
no a month
Cc?
@fiery gull you need to test opus bro
It's best rn
Even better than gpt 5.2 pro garbage
but i am always open to try new thigns
it really is, i havent been on discord in months bc of CC and using opus
i just been glued to building lol
im only hear cause i heard sonnet was dropping this week lol
yeah opus imo is much better
What? I use opus 4.5 24/7
but these chinese models are good, like very impressive
When will qwen 3 max thinking be added?
I thought this aswer is about my before affirmation
@fiery gull @shrewd citrus? @balmy mist @fiery gull
Bruh
@echo aurora
@hollow imp why bro, I have gift coupons
go ask telegram
Gift coupon apne paas rakh
@hollow imp ok dude
Delete the message
@hollow imp done ✅
Okay...
@fiery gull
Do you know the api Claude opus model is much much better than the ui one because of some beta headers and secret features?
all api models are better
You said u use Claude opus all day
there’s like less safeguards restrictions
And more $$$
Yep, on anti grativy
Bruh
In anti grativy has the opus 4.5
is it better than gemini 3?
Even benchmaxx from kimi say no
its below on everything
Every thing is better than gemini.google.com gemini 3
its not that good at coding
it’s better what you on
Anything other than vertex ai gemini 3 is not gemini 3
interesting, so a nothign burger, ill stick to my opus 4.5 on CC and wait for sonnet, see you guys tmw lmaoo
its not worth the change yea
Why do you want sonnet if you have opus
on most benchmarks for vision it beats all the other models
for coding and other things it’s ass
yeah going back into my cave, lmaoo @hollow imp i heard a new sonnet is dropping this week
The opus use less tokens to make the work, don't is soo more expensive that sonnet
V4 or sum
Opus 4/4.1 vs sonnet 4.5
What do you say
a new sonnet would beat opus 4.5
I hope 🙃
But a new gpt pro hasn't been able to beat Claude 😭
but realistically would Claude let sonnet be a better model than opus
5.1 5.2 now 5.3 soon
makes no sense as a business plan lets be real
No
Only if they have the next opus ready as well
Dont now, but gemini 3.5 is coming 👀
i think every team/company has mastered their models and releases, so every model they release will be their best model, look at claude history, also look at google and gemini and how flash was better than pro because it came out more recent, thats the trend, its not about size anymore
What about grok
there is still a grok?
Grok heavy >>
Agentic glm 4.7 >>>
idk why people would pay for grok
For its twitter expertise
all new chinese models are trained heavily on gemini 3 flash/pro
and the best one that really cloned gemini outputs and kinda removed laziness ( at coding ) is glm 4.7
anyone been using clawdbot?
I think this is the order in which I like to use the models (purely usability/usefulness):
Kimi 2.5 >> GLM 4.7 > MiniMax M2.1 > DeepSeek V3.2 > Qwen3 235B
Qwen just feels very slop and last gen by now. Both GLM and MiniMax absolutely destroy it. DeepSeek V3.2 is a strong model
these are the type of people that tries a model for 1 min
if you are comparing them purely on text and not multimodality
then clearly glm 4.7 is better
minimax m2.1 is only good at coding and its also a smaller model
@echo aurora
Not that I’m aware of 
Sorry what is pointing out?
Weird question which ai is best for roleplay
Grok press F
I know what kind of man you are…
sydney bing chat
When's Kimi k2.5 going to be added to lmarena
already added
when am i going to be added to the arena
@echo aurora
K2.5 in text and code arena 🥺
Thank u~~~
Add k2.5 thinking, Agent and agent swarm pls 😔
just look at the benchmarks lol
on huggingface
stats don’t lie
Hello My name is Clinton , I'm here to learn
HI! Please check #1397655624103493813 to learn how to properly prompt the bot.
Add z-image turbo
it is
gemini 3.5 when
Hi! Sometimes our friend Pineapple is busy with other stuff. You can also use the Moderator tag @ Moderator so we can come and help.
That will also ping Pineapple
Because I like the pineapple lol
We all like him, lol
wth
Debatable
ill just stick to gemini if audio processing is gonna be like this
fr
that's good that error has changed the limit error it says wait min NICE LMARENA MODs
New Mode Update 
<@&1372208635530448926> - A new model has been added to Text Arena.
- Claude opus 4.5 thinking Max
We've had this message in place for awhile. Would note that the Something went wrong could also appear when it's rate limit.
yup that's better and i hope another error code has appear too
like 404
F..
what?
rate limit for gemini-3..
another model has rate limit too....
😭
just give me gpt 5.3 already
I don’t know how many requests it takes to hit the rate limit.
wait released?
no
if it was, I wouldn't say give it to me
Tell that in the openai server
not bad

but not better than gemini 3
lol
im in so many servers
Pls .txt file 🙁
Could happen! Is on our radar for sure.
Is .txt support worth millions of dollars in development?
I'm not able to share details about what is/isn't upcoming until we're ready to share those updates.
🙁
Wait wow kimi 2.5 already 😸
I thought it would take a couple of days to implement 💀
wonder how coding performance really is
Nothing special about its coding tbh
i almost beliebed it lol
how do i fix Something went wrong with this response, please try again.
is that on my end issue or website?
Grok is down.
For other models, use the retry button.
Would rather have 5.2 xhigh.
Yep.
How many messages did it take to hit the limit?
And does it apply to only Gemini or all models?
im using gpt 5.2
Retry button didn't work?
nope its not even showing it anywhere so help 😂
You don't have that button?
ye i see it but it just shows the same "error" again
If that and refreshing didn't help then you'll probably have to start a new thread or switch the model I guess.
ah that sucks
What were you doing?
missions for a game
I'm assuming you won't notice much difference between 5.2 and 5.1
And you should be able to switch in same thread.
am try gpt 5.2-high
Doesn't break the context flow if you are worried about it.
nope still not working
Working fine here.
none of the gpt's are working
Probably some issue on your end.
ye idk what to do then
Try different models to see which one works I guess.
none for some reason
Not even in a new thread?
i didnt try that but will it lose all data from the chat or
like will it be dumb again xd
Did you make an account?
ye i have it
You shouldn't lose chat history unless you delete the chat, right?
You can always continue the thread later on.
yeah but im talking about when i make new chat will i have to send him all the code again
Obviously.
damn i have so many files xd
Can you try the steps in this article: https://help.lmarena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message ?
You may sometimes see the error message: “Something went wrong with this response, please try again.”
This is a general error message. It can
yeah i did try to hard refresh didnt work almost lost account
trying now its stuck on Generating nvm it worked
Openai just posted and deleted this Introducing Prism, a free workspace for scientists to write and collaborate on research, powered by GPT-5.2.
Available today to anyone with a ChatGPT personal account: prism.openai.com x.com/OpenAl/status
almost lost account
Hmm can you elaborate more on this?
too bad, that would be pretty cool (not that I code)
when i did clear cookies and data i had to log in to everything thats why
AI influencers are taking over social media — and people are already making money from them.
In this video, I show you how AI influencers work, how pages grow to millions of followers, and how you can create your own AI influencer and monetize it on Instagram in 2026.
You’ll learn:
• What an AI influencer is
• How AI influencer pages g...
I didn’t get my video
Hmm I'm seeing those requests. Do you know if you're getting the same issue when using the site?
I already reached today limit on site
🫠
i am the first person to react to the thing pineapple announced
Okay I'll keep an eye on others having the same problem, and will report to the team. 
It looks like the ones that were pending have now failed.
This is pure mastery of AI blended with cinematic storytelling 🔥
What if Hulk became Spider-Man?
You need to see this.
What if Hulk was bitten by the same spider that created Spider-Man?
This is SPIDER-HULK: THE AWAKENING, a cinematic AI short imagining the birth of a new force.
An ancient city. A silent giant. One bite that changes everything.
Strength learns to climb. Rage evolves.
This is Part 1.
The threat has revealed itself.
The awakening has begun.
...
#ai-creations good place to share
Wondering if there is a way to verify this
Thanks for the info! Let me know you guys feedback in the comments. I tried my cinematography skills blended with AI and the results are surreal.
check the battle reaction
Glad to hear it! Thanks for sharing!
I don't think it appears in order of who did the most recent
If it does, you weren't first sorry to say
hello ! is there a prblem with the website? I can't access it.
What seems to be the problem? Are you getting some kind of error?
@echo aurora
Trying to prove my friend wrong about how AI cannot do first person shooter concepts. Do these two look good?
guys anyone knows whats the limit of gemini 3 pro
its pretty decent but yeah not better
it might be more efficient though. Also it's different and that's nice because you use gemini enough and its speech patterns start to feel a little too familiar.
ofc its not better
its literally trained on claude and gemini outputs
i would be happy if they fixed the known issues that LLMs are facing the past few years... but no.. we can still see performance downgrade on writing giving the coding-writing tradeoff
What should I do? I've been generating for 40 minutes now, and nothing's happening.
Not bad, not bad
Can you try the steps in this article: https://help.lmarena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
You may sometimes see the error message: “Something went wrong with this response, please try again.”
This is a general error message. It can
it only works on other models
this model is not
Were you able to submit a jamdev by chance?
I was able to repro an error with a jamdev. No need to actually. Thanks for the flag.
Why is the UI so accurate?
Not really but it follows OG MW2's UI really well
LMArena feeding us good
Probably because training data
Definitely not better but it’s a big improvement
Visual SOTA
which one that you made (model name)
I’m really excited to see Kimi ranked
@boreal lagoon Whats up?
i really liked the function abt lmarena that choosed the best ai for you based on the question u gave it
any way i can get it?
its really comfy to use yk
This is the experiment you're looking for - https://help.lmarena.ai/articles/3050693259-lmarena-experiments-smart-router Note that we don't grant access to an experiment because it's asked for. It's something that's going to be random if you get it or not.
We are currently experimenting with a new feature: Smart Router. In Direct Chat you'll be able to select the Smart Router model option. With this
i had it on my mac at my dads house it was really comfy so i tought i could get it on my main pc at my mommas aswell lwk
Also going to ask to avoid pinging our Mods unless it's related to breaks in our server rules - #announcements message
thanks for the info tho
aight my bad thanks
No problem. Glad to hear you like it.
any idea when its going to roll out?
I won't be able to share details about when/if features will be landing.
We'll be sure to share more info when we're ready.
i get it

aighty thanks
No problem. Don't hesitate to reach out if you have any other questions.
yah
Nano Banana Pro 2k
test it guys
what does ir do
wym
is snowflake gemini 3.5
like is it a image generator or vid generator
must've been text, they said the naming is like the gemini snowbunny which is so gooood
When will Claude-Opus-4.5-thinking have been surpassed by a better coding model, and from which lab?
5
10
6
Google Deepmind
I couldn't access it.
which server ks ghat
anyone else getting this problem with gpt image 1.5
You like brawl stars?
why is 3 flash so bad at this bench
even better question is
why is glm 4.6 so much better than 4.6 reasoning/glm 4.7
Kimi 2.5 fixed something I had dealt with for 1 month with Claude opus 4.5
how to get snowflake model?
The heck is that
.
this
is it for battle only?
Ahh it's only on battle mode
yeah
All models using a codename are going to be Battle only
is it text only?
checks out with my experience on flash 3, answers very quick tho.
It's a unknown model
Nope, codenamed models can be in all of the modalities.
oh
well.... it is text only..
so i can enable the auto feature and it will be there?
if the router chooses text for you, yes
Oh that specific codenamed model? I won't be dicussing details about specific models with codenames. I thought the question was are there codenamed models in other modalities.
secret secret models
or just wink
anyway
is it kimi 2.5 now better when roleplaying?
huh
what is fire-bird
i just got codename fire-bird
Meta
suspected to be Llama 5
doesn’t seem too strong though, might be why we haven’t seen a release
are they gonna stop making llama?
probably not? Meta just deprioritized their AI development last year when they lost a ton of researchers in the summer
Still working on it, just with lower interest
so they are working on a project just for fun?
no competition like kimi?
no, probably for money 😂
AI is just a lower emphasis in Meta’s business model now is my point
theres no way they will be competitive for once against other companys, like gemini or claude or even kimi
Anyway Z IMAGE BASE RELEASED
i did saw the anti-ai message bro
well grok is bad
@karmic bough Please check on #1397655624103493813 to learn how to use the bot.
If you go to Search, you can then toggle on Include archived chats
Our Help Center is currently down, but will send a Help Center article when it's back up.
Why isn't GLM 4.7 Flash on the leaderboard
And 4.6v Flash, and Qwen3 VL 8B
Are small models automatically excluded before every update
Especially Qwen3 VL 8B, I've waited it desperately for three months
might not have been testing for long enough?
It's been three months 🙁
How to make product videos ugc type using ai
Hello! Please check how-to-video-bot to learn how to generate videos.
Why is there a perplexity logo on the main page? They don't have any model [on lmarena], right?
Now that Kimi is getting on top, maybe you could replace that logo with theirs
@echo aurora will kimi k2.5 available in search and vision arena?
its search is supposedly top-tier, and a lot of folks are wondering if its vision capabilities actually live up to those benchmark scores.
I noticed.
Hello, im new here just wanted to say hello to everyone.
hi
\
GPT 5.3 Pro IQ
1
2
Hey I've got access to juicy asf GPUs like 4090 h100 a6000 for free and wanna rent them at cheaper than market prices, anyone here have contacts or where i should advertise?
There is a single search model
ppl-sonar-reasoning-pro-high
how many videos we can gen on the site ?
This bench tests and checks how much a model Hallucination and gives wrong answer instead of giving accurate answers from its knowledge. Rejection to answer because of lack of knowledge is also accepted as good behaviour.
Reasoning RL and 4.7 post train prob makes it attempt questions its not confident in more aggressively. Good for benchmaxxing but this score exposes it somewhat.
4.6 had that problem too
Hello
If you have a working payment method you can get it through a openai free trial
I have to pay for the web search tool though.
No?
How?
We are talking about playground trial, right?
It charges for web search tool calls afaik.
No we are talking about chatgpt
@echo aurora
day 1000 of asking LM to return the direct use of video arena 💔
Hello
alright now its fixed
@gilded wyvern pakyu ka HAHAHA SCAMMER
A half of the video is not a video itself but sign of MLArena.What for?I need 12 sec of video to use not 5 sec.
is it really?
actually i prefer the previous kimi versions than this one
at least k1.5 had its own identity and uniqueness
we are still not being able to fix the writing / coding tradeoff
we saw that with gemini and gpt 5.1 & 5.2
anthropic had that in control with opus
I have a question, is therea file to upload files?? I can't upload my pdf file
i found another delulu
Output quality: Kimi 2.5 > Opus 4.5 (Claude Code) > GPT 5.2 > Gemini 3 Pro
CC's multiagent/subagent system is damn good. Format was great too, but it ranks below Kimi because it included irrelevant stuff in the output
lmao
we are giving a platform to such people
Relative to their opinion but there’s no way in hell GPT 5.2 is better than 3 pro
To make a video
What is best chargpt/gpt ai rn?
Benchmark wise Claude Opus 4.5 and Gemini 3 Pro
If you want my personal take I really like DeepSeek, Kimi, and Gemini
i mean chatgpt gpt or openai
Oh
GPT 5.1
They fixed all the issues with 5 in 5.1 and then they screwed it up again in 5.2
OpenAI try to make a good model challenge impossible
because leaderboards don't really show anything
maybe votes, but i don't know how sort it with fixed time period
also not sure all votes are human
hes still in the honeymoon phase with kimi 2.5
like this version is just so bad
what is gpt-5.1-search-sp
Opus 4.5.
Opus 4.5 is a complete AI and without serious defects, like Gemini laziness, or exaggerated ChatGPT protocols, the price is a bit expensive but as it uses few tokens it pays off, Opus 4.5 is definitely the best AI currently
if there real Opus 4.5 on lmarena to chat or there something no?
I do anything, code, documents, conversation personas, corporate use, assistant, agent use, image prompt, laws, even have fun and etc. For me there is not one that comes close to Opus 4.5
i just want you to know that k2.5 is bad
Is real
thinking or just opus 4.5 one?
Thinking
My second farvorite is glm 4.7, it has a very good overall, equivalent to sonnet 4.5, but with the code plan it is almost free, The only thing that annoys me is the thinking that I think is exaggerated, but that's okay it's just a 355b model
is it? it's right after gemini 3 pro on other leaderboard (AA)
I fell in real use the glm 4.7 still better that kimi 2.5k, this is just benchmaxx
glm better
glm 4.7 is okay if you ignore thinking time
but deepseem thinked long too
But the glm server allways crashes lol
their site is free
I hope the glm 5 or 4.8 will don't have a bigger thinking
What tasks are you testing it on
general
health / science / daily tasks
coding
i tried it on everything
Do you have a prompt that makes the difference obvious
wym?
im just asking it the same questions im asking other models
Like a prompt where K2.5 is clearly worse than GLM 4.7
i asked it to give me best ways to fix fps in games and it gave me outdated info
Ah, a specific game, or in general?
yea in general
i gave it my nvidia series card
and it gave me some old recommendations
I wonder how they compare when search is enabled
i guess it would be better
but i dont want that tbh
i was testing its raw capabilities
any model with search ON will provide better results
@wind ember for experiment
is info updated if you ask https://www.google.com/search?udm=50
(for same question as for K2.5)
yea because it uses search
i dont want to use search
i want to test its own knowledge
and info retrieval
No it’s not
It’s great
Efficient as hell and way better output than GPT
hi
its so bad
its not great
maybe its not really that bad
but its not great either
Are You sure i have many better ai then this and currently free
Ragebait
Hey at least agree with me
Better than GPT
It’s FAR better than GPT
is gpt 5.2 for coding better than gpt 5.1??
I’d stay away from it personally
But it’s supposed to be better
k
5.2 is still extremely schizophrenic though so watch out
https://x.com/chetaslua/status/2016538256708645330 anthropic will cook again
hey guys is that error
i give lm arena to ptompt to create a specific video but some thing trouble Video is started genrating and can't genrate in a while hour is that error
Hey I'm seeing the same, I'll be reporting this
Sorry for this! The team is aware.
Same here.
Think the website just crashed
Things are happening 😄
Oh good so it's not just me
Check out this Help Center article - https://help.lmarena.ai/articles/2669202654-lmarena-how-to#will-my-chat-history-be-on-the-new-site
The site moving from https://lmarena.ai/ to http://arena.ai/ may result in some problems accessing your chat history. This Help Center will cover
wow
huh
Yo where tf is all the chats????
its just my test on % droprate its fine
Site is going to have some struggles at the moment, I'll put out an announcement in a bit.
I was wondering this as well hehe
If I start a new chat right now will my old chats fail to show up?
I'm concerned about that
I scrape all my chats for that reason
Same my old chats
I have thr same problem i cant login with My Google account
Old chats had very important things...please can lmarena migrate the old chats to arena.ai
please 🙏🏻
Yes but i would prefer a working site
fr
The site moving from https://lmarena.ai/ to http://arena.ai/ may result in some problems accessing your chat history. This Help Center will cover
i think the old chats are gone
:(((
i lost my chats
the new era.
my chat history is not showing
i can open old chats with links I have saved
This is the beginning of the end...
I cant accept the terms of use
can't login some pink error message comes
wait themes is on lmarena or I'm just tripping?
CANNOT LOGIN
refresh and changed
Does anyone have the arena working on PC? It works on phone, but it doesn't start on PC
I can't see my history
BRUH I LOST ALL MY GODDAMN PEAK STORIES
in mobile
same
is lmarena down?
atleast the server didn't get rebranded yet
i cant accept
can i ping pineapple?
Connecting to Arena has failed. Please try again later or on a different device.
can't login using Google account
Me either
Cant login
why my chats are deleted
for me i have account and my history is gone
what the barnacles
i used the link for restoring the chars but it's giving me an error 🫠
my too
i get an error when I tried to relog in
please engine server
site is cooked for now we'll just have to wait
@echo aurora talk for this
same
@echo aurora bro, turn on the server, please)
um you got a problem
history is now backed up
idk who caused this
I ALSO CANT ACCEP TOS
in ours it;s working
yo it's working
nevermind, chat history is back
And your chats?
i got logged back in and have my chats back
THEYRE BACK
ayo all chat is back
at least chats are back
phew
"Sorry, you have been blocked": I cannot access the website. Is this normal?
IT's BACK thanks @echo aurora
All good...worked on Windows 11 browser, I think only in mobile browser didn't work of migrating old chats
guys
should we riot if they rebrand the server too
resolved
yea ty pineapple
Migration works via Pc browser
btw /j
To get back My chats back i have to sign out and log in again?
maybe try
For a sec, i thought i would lose all my chats...
ill suck your toes pineapple thank you
hah
Interesting
hahahahah
I cant login thats the problem
what
first to say hi to me gets literally nothing
Hey everyone - we're working on a lot of things at the moment. Apologies for the delay in getting a response.
If you're having problems accessing chat, please read this article - https://help.lmarena.ai/articles/2669202654-lmarena-how-to
The site moving from https://lmarena.ai/ to http://arena.ai/ may result in some problems accessing your chat history. This Help Center will cover
what do yall use lmarena for tho?? just curious
Yeah I recovered my chats now, all good fam
i am still waiting for someone to say hi to me so they can get literally nothing
for making some craziest
@echo aurora Was the error fixed last time?
Has the battle in direct mode experiment ended in the new update???
gemini 3 pro with very high limits
examples? 😂
it's work
no like what specifically do you use it for
for making a anime movie or smth xd
My research. So I can bounce between models
asking questions about tech, windows, games etc.
Damn i was not expecting this change
didnt the video gen just come in recently?
and also sometimes some curiosity/knowledge questions
chatting
researching and coding
@echo aurora by the way, I suggest adding a file upload button so that the AI programmer can easily read the chat without cluttering it. Additionally, you can upload coding photos to showcase your desired style
i thought it could only generate 8 sec clips
well i have sora 2 subscription it's make eaasier and i have a full studio xd
It started working without registration.
damnnnn
ikr this UI change is actually rly good, the light mode looks way better now also.
My history stayed. Interesting. Let me make sure my scraper still works with the update and if it does I'll post the github link
imma need your help with something 😔
lm arena is good for generation but in image to video 👎
what do you code ? have you published any
#addstopbuttonnotnewmodels
Use in PC browser
it's baldi's basics mod and it's not done yet
it will work
no messages for 5 seconds? dead chat
see this
I mean migration of old chats work in PC browser
direct chat in lmarena?
xd
i need Direct chat em arena video arena
fr?
I tried login via mobile browser migration didn't work but in PC it worked
Announcement!
https://help.arena.ai/articles/2669202654-lmarena-how-to
The site moving from https://lmarena.ai/ to http://arena.ai/ may result in some problems accessing your chat history. This Help Center will cover
and I didn't made any in public
woahhhh
@echo aurora
<@&1349916362595635286>
dm
can i see more of your stuff
Thank god
sure buddy
They are a little late on announcing
Am I the only one having trouble logging in?
i make a fully trailer using lm arena
Now working in mobile browser also... migration of old chats just at new login
xd
ohhhh icon updated?
I don't think so, have fun!
Url also changed!
Lets not ping mods pls
pls arena,i need direct chat in video arena,my dad is kinda homeless
Announcement would be good.
lets add stop button, we need this, my sis is kinda homeless
Ok
i live with my brother
One has to admit, serif fonts are indeed quite elegant and do match the icons well.
Wtf LMArena is now Claude?
see this
no, Im talking about a model error, like when they stop working
please speed add stop button and edit button
my mom is kinda homeless
i live with my brother
Its a little visual glow up
Nice , but i think now its timeto fix technical stuff. :p
If those are still same here
Wait why
yeah wth it literally looks like Claude website now
we got new ui but why we can't get stop button
god I haaaaate the captcha
We all do hate the Google captcha
vote
what's annoying is the captcha doesn't work, you click on all the things and its like try again
well its kinda hard to miss
Its just a little visual glow up
it was a bit predictable because of canary.lmarena domen was changed to canaryarena
Thw heading needs a new font forsure , no readability lmao
WTF is this????
lol for a minute I thought lmarena was maybe bought out by Claude
You all having a good time? lol
i having bad time tho
Yeah
nope i cant send any messages in the chats
I'm just... Really suprised
And now idk what the icons mean 😭
Is it just UI update + rename? Nothing else changed?
I'll be putting out an announcement soon, expect some oddness with the site right now. Team is all hands on deck. Lots of moving parts.
fix the rate limits its annoying asf
please stop button 😭
Why do y'all decided to do this rebrand?
Probably because they got valued at like 1B$ so they could afford a new name
oh dear god this looks awful
lmao
claude fr
A billion????
what the hell is this
OMG lmao
Arena?
I kinda like this classic style vibe
guys someone editor here
Yeah, at first sight I thought I went into the wrong website
old ui wayy better
Claude Arena, finally
GUYS DON'T PANIC, IT'S JUST UI CHANGE
EVERYTHING STAYS THE SAME
arenaai????
nobody panics
This is a little confusing but lets get this clear guys,
You just need to click this recover button to get yout account logged back in !
No need to type and loggin again from lmarena.ai to arena.ai
this is so weird
Nope, my chat is all gone
keep the old name please
COOOLLLLL
i beg you
mine too
make the charging handle longer
I kinda like this new UI better
@echo aurora I actually kinda like the new UI
new model selection is not good 🥲
its just claudes
yeah, that's why I like it
no I keep failing captchas… am I a robot!?
real, Arena is something out of direct meaning
while LMArena is directly Arena of LM's
Bro the popup
Has the profile picture changed?
I hope you guys dont get sued 😭
lmarena is now arena?
better domain ngl
xiaomi phone lol
GUYS I WILL REPEAT!!!!
NOTHING MAJOR HAS CHANGED. JUST THE UI AND NAME
I want a ASUS ROG for replacement 😭😭😭
ok i saw my chats came back
they've got a better domain
😢
you will be missed
🗣️🗣️🗣️
I WANT THE NAME LMARENA BACK. 😢 😭
I liked that name more
Maybe call it the Pantheon or Colosseum
I mean as long it's usable I don't mind the change...
I would go for the Nubia gaming phone since it has a dedicated fan for cooling haha
Today i was literally using claude and thinking this is such a nice ui and here we go , same exact ui on lmarena, my wish got true.
Nubia???
movement labs previous ui was also claude styled
lmaooo
claude style is just superior
I dunno
I dont dislike it or anything. The font and differences are fine
But i kinda liked the other darker coloring more
Nubia Redmagic
can you upload more than images now
2 times the charm, 2nd rename now.
what was it first called
At first it was just a part of LMSys.
i acutally didn't know that
Bruh its just me or the UI look like Claude?
Actually Announcement blog!!
https://arena.ai/blog/lmarena-is-now-arena/
last chat with lmarena
please bring back smart router ts was so good
it used to be on this site https://lmsys.org/
LMSYS Org, Large Model Systems Organization, is an organization missioned to democratize the technologies underlying large models and their system infrastructures.
LMArena RIP 🕊️
lmsys still a thign
the video bot is still named LMArena
well, google still thinks Arena is LMArena
its still here
There also used to be a different Discord server. Back then the Arena talking-ground was only one channel in the LMSys Discord, if I remember correctly.
maybe A/B testing cuz its not available on my end
There's announcement yt video too lmao
https://youtu.be/TNoAlMv4Eg8?si=d86SArLb6yQ8sdLE
Try Arena: https://arena.ai
LMArena has evolved into Arena—a name that reflects our origins and our mission to measure and advance the frontier of AI for real-world use.
What started as a small PhD research project has grown into a platform powered by millions of users worldwide. This rebrand was shaped by you—our community.
Learn more ab...
fr
i liked the old logo more
how to fix this? when i click cloudfare captcha it just doesnt work and says this
and then i get this
relogin to yur account
I do have to say, the quality of conversations here has deteriorated in comparison to prior times, during the times of GPT-4 and o1-preview.
New design is good but old one was pretty much fine and icons much more unsimilar
Hmm
its annoying
@echo aurora no busses left, right? SO WHY. WHYYYYY?!?!?!
its just captcha being annoying
u cant solve those
youre a bot
and then this happens eventually
the bus
Now its arena blogs.
amazing news!!!
Claude 4.7 when
arena
