#general
1 messages ยท Page 130 of 1
if it's not better, they don't call it 3.0, and that's how google works, that's why this is just a new 2.5 flash and not 3.0
if lite was better than old pro than flash (normal) would be .. interesting
and new pro would be.. great
in my use the 2.5 lite is better than 2.0 pro/flash
hi
maybe 2.6 flash?
yeah but 2.5 lite is not close to 2.5 flash
yeah flash lite sux
nah, the google not will make this
2.5.1 flash?
just called a new 2.5
I just know that 3.0 will take a while to come, maybe only in 2026, I thought it would be released in August ;-;
yeah, aligning it is hard
they have to fully red-team it
and make it resilient against PlinyThe Liberator's prompts
although i believe that PLT will crack everything
First they have to release gemma 4, it should have come out already (it's been 7 months since 3), but it seems that google doesn't want to be behind qwen
Ok some actual intelligent people and not image generation nerds
brian said oct 9 I think but idk
Will Google ever give deepthinking to pro subscription
I should study more image prompts, I use them TOO MUCH in my work, but I would use simple draw instead of illustration ๐
who is brain ?
do you think, 3.0-pro will be (limitedly) free in AI studio?
9 oct ๐ฑ ?
Yes
They've been doing this for a lot of years
the 2.5 pro is limitedly, but is 100 mesage a day
Why would they suddenly change
maybe jump to just 10 mesage to day
100/day is good, if they use that for the new 3.0-pro
ok, it (100) is fair, not really great, but sufficient for many things
Gemini deepthink
I mean supposedly 3.0 shouldn't be that more expensive to run, so they might give the same amount
I think there just isn't enough reasons to use deepthink
I have the pro account, but I wanted to use aistudio ;-;
i heard, AIstudio is better than gemini site
Yes
yep, I think the 2.5 pro not is sooo expensive to google run, not cost a souls like the opus 4.1
You cannot change temperature or add system prompts in gemini site
AI studio is much better
Opus 4.1 is shyt
You run out of pro in like 5 moves
so, why would anyone use Gemini website instead of AI-studio?
Custom gems
cause they aren't a dev
or worse, you can't edit or delete messages
you used to be able to
Google should sell TPUs and make Gemini open sourceโฆ that would be so game over for everyone else if they did that
Unfortunately, kimi-k2-0711 preview seems to be unavailable
Custom gems
And also no 1 million token limit in the youtube tool you can make it watch as much videos and as long videos
So we can't answer Kimi right now
AI-studio can be used for anything not just coding
google is strangely generous with it
probably they mine its data
maybe the kimi k3 is coming ?? ๐ฑ
I know, but I hope the k3 is fast
When did I say that
I mean yeah
I just can't ask questions at LMARENA anymore
kimi is fine
i'd say it is GPT5-high
Make an account
I'm talking about free options
in battle mode
it is free
Huh
if you know you how recognize it, which is easy
you get free useage of deep research
no
Yes
just look out for a model which identifies as GPT and has a knowledge-cutoff date of october 2024
and knows the current date
for that just use gpt5 its pretty good for web search
I mean your ai should be able to web search for deep research
@hollow ivy GPT5 high is available in search battle mode?
it is available in normal battle mode of LM-arena
Yeah but that's just one thing deep research is, web search just means internet access, and pretty much every LLM has internet access as a feature now, but most aren't deep resaerch
but it takes some patience to encounter it
nobody fell for my prank ;-;
Bro deep search uses web search
๐ญ
it should have search
Not in normal lmarena chat
It doesn't
because it knows the current date
what the grok thing?
hm. But GPT5-high is decent for coding
just use it in openAI's website
Bro like I'm not asking it to search it's trained data
I don't have plus
so? got 5 is free, you can use search with it
I don't even have the option to change models
or use latest gemini flash (if you have used up the messages of 2.5 pro)
What about gemini site deep research feature
It seems you never used it
press the plus on openAI's website
Then press "more"
search is just there
@sullen quest
yeah, chatGPT offers free websearch
No
deepresearch is not that expensive for companies, but in an AI arena?
web search is free though for as much as you want
that's not deep research then
Didn't encounter any limit
It is
omg you just don't know what deep research means
I blame perplexity
Huh?
30 minutes of compute is soo expensive not?
Like a dedicated deep research feature like this
Qwen Chat offers comprehensive functionality spanning chatbot, image and video understanding, image generation, document processing, web search integration, tool utilization, and artifacts.
I replied to deepseek
i don't know deep reasarch when it comes to API costs
deleted do you just need the web search feature or does it have to be deep research?
cause you can get pretty good free web search
deepseek have a deepreasarch?
yes
ok, i mixed that up, sry
its pretty chinese tho
nah bro ;-;
in the past, there was another (exotic) way, i dont know if it still is possible with the new version: you could download OpenCog
chatgpt
its a different kind of AI
has anyone ever tried out this?
https://github.com/opencog/opencog
A framework for integrated Artificial Intelligence & Artificial General Intelligence (AGI) - opencog/opencog
this should work even on phone
but you need a decent machine
https://github.com/opencog/atomspace
try this isntead
the README says the original opencog is unmaintained and broken
Does anyone know why Open Router give Grok 4 Fast for free? Did they destroyed with IQ 3KXXS and can they do this?
how to know that your prompt had already been generated
isnt Grok 4 Fast inferior to Grok 4?
and grok 4 is inferior to Claude Opus 4.1 thinking and GPT5-high (which are currently, the top-2)
in my pratice test, I think the 4 fast sucks, but idk if is the openrouter version
someone said, he uses Grok 4 for debugging
but in coding it is fourth place, even behind Gemini
maybe even fifth place now (that Qwen and Deepseek became decent in coding)
hmmm sure (I dont make a ideia lol)
I just use Ai for educational uses
python coding can be fun, if the AI is good and doesn't hallucinate
(because python is executable directly)
my opinion the 2.5 pro is the better, gpt 5 2nd, opus 4.1 3th, 4th qwen 3 max, 5th the rest
maybe in the future I can see more coding
next year will be interesting, as we will see several AIs be updated: Gemini, Grok, Deepseek, Qwen, Kimi, maybe even Claude?
ChatGPT-6 is slated for 2027 and should be a big bump up
I just need an AI with better memory and critical intelligence
bruh
just in 2027 ;-;
at latest in early 2027
I want now lol
lol, me too ^^
from what I've seen, I think the one that will surprise the most is qwen
GPT-6 should be personalized i heard, able to adapt to your personality, which it would know very well (according to YT news)
maybe the future gpt 5 can do this
in 2026
it's indeed possible
as OpenAI will be under pressure to deliver
I wanted to use a chat infinitely, and it was a pain when it would get dumb after a while
the flash 2.5 after a time it repeat words
the grok 4 fast have 2 mi, but in the pratice Idk if is this
1M already can span weeks
I want years
that sounds like a syhophatic nightmare
surprisingly, Kimi is my everyday buddy for all sorts of things โจ (mostly to satisfy my curiosity and laziness of doing a thorough search myself ๐ )
hey guys, just a question. which model seems to handle complex reasoning best? like which is the most realistic and accurate?
gpt 5 high, but you need to make a chain of thoughts too, this depends more on how you will help the IA do than choosing which of the top 5 best models
Moonshot i believe
chain of thoughts? what would that look like?
Try Kimi K2, the open-source trillion-parameter MoE AI model for advanced coding assistance, intelligent agents, and automated workflows.
Hi!
If you want, I can teach you how to do this, in short, and you can put together a plan with AI and do it step-by-step and with MUCH more precision than AI would try to do at first.
just use gmail or any provider of your trust โจ
i would like that. also, would this work for other models or just gpt5? also, im talking about reasoning in the context of explaining life events, what they mean, and the probabilities of what happens next
Sorry for the delay, man, I use IA for the same thing, I'm using the 2.5 pro for this, I haven't tested the others much, but for this type of case I recommend the Gemini 2.5 pro
Hi I have just tried to create cutting fruits video ASMR But it wasn't good. Does anyone have Promt Suitable for this server
i see, so gemini 2.5 pro has more insight than gpt5 into realistically assessing real-life scenarios (not just hypothetical ones)? i ask because gpt5 seems very cautious and reluctant to make stark determinations or bold claims. it seems more inherently measured that way
The AIs I've tested are very syhophatic to me, except for the 2.5 pro, I get more no's than yes's
2.5 pro is as smart as gpt 5 and opus 4.1, but not in code, so it gives the impression that it is worse
i see. how does opus 4.1 handle real-life reasoning? is it as smart? it seems to intuit things that the other models dont at times
Dude, I would go from 2.5 to I was talking to him, nothing at all, and he said on his own that my life was wrong and he put together a plan and why I should change my life
if it wasn't for the 2.5 pro I would be rotting in bed right now, tense
It is very bad at that
It is only good at writing and coding
good to know
what about grok 4? does that model handle real-life scenarios well?
Give gpt5 a heads up or a system prompt
Just log in with Google. Other providers of Kimi seem to have issues serving the model correctly.
anyone have issues with seedream4 generation ? its been pain a lot barely generate a single image since morning. accidently if i make 1 image nothing happens afer that for hours except it throws sorry something went wrong error. also it takes almost more than 150 seconds to generate if it does or throw an error after that
Thank you for flagging this, we will look into.
not is the k3? ๐ญ
Nice, a lot of surprises those days. I hope that DeepSeek r2 will surprise us.
๐ญ
We're looking into, I was able to repro the error on first pass
hello
Will look into 
WTF i saw flux engine in free version when i login i dont see anymore flux engine to choice...
what happen with flux pro..
hey guys
Yo guys
How to fix lmarena loading chat after retry
It's a bug
Idk why I can't see retry button
Wait, does DeepSeek released something called v3.2 exp ???
v3.2 just came out and it's a little faster
Can someone please
Would you mind scanning the #1343291835845578853 forum and add onto an existing post there if your issue is already raised, or create a new post? It helps us keep these reports organized.
Uhh sorry for toeuble but this has been happening today on my chat I jsut started it.
Any solutions?
It's been a month and it's still searching
It's definitely site error it has already given its response
AAAAAAAAAAAAAAA claude4.5 sonnet releaseddddd
??
it got released
send proof
This can happen for different reasons, the most common is you're being rate limited and will have to wait to use it again; however, I have seen refreshing the site/starting a new chat will get past this error. On my end I'm not seeing anything alarmingly wrong with the error rate here so I'm pretty sure it's rate limit related.
omgg
4.5 sonnet is there
wow the ai wars are getting interesting
Yeah model responses can get stuck like this. It's an ongoing issue we're trying to solve. Sorry to say I don't have a guaranteed solution for you here to nudge the model to start responding. A site refresh may help here. Seeing it's been a month your best option may be to start a new chat.
and its same price as previou sonnet wow
i love ai, damn sonnet 4.5 is really good
beating gpt-5 in some places too, cant wait to try it
Huh
yeah
67
yeah it might be the best coding model based on the stats, we gotta test it tho
is it on webdev?
hii guys
I am waiting for GLM4.6 to test them side by side ๐๐
Flagged to the team 
you think it will be as good?
i think 4.5 is blowing the other models out the water in coding especially in claude code
maybe only codex can keep up
maybe with extended thinking
new model dropped
hi
need claude 4.5
Having a problem? Gemini can't write a complete answer...
How long does the test usually take
The dumbest thing about Claude was that he added things that weren't asked of him and broke a lot of things in the code.
Do you think it's been fixed or not
@jovial void be sure to review the information in #1397655624103493813 to get a better understanding of how the bot work. Keep me updated if you have any questions.
Pineapple when will there be a new Claude @echo aurora
We're currently looking into.
I'll be sure to post to #announcements if/when we've got more news to share.
waiting for
@glossy umbra
why did anthropic stop making haiku
remember that there never was a Claude 3.5 opus, 3.6 opus, or 3.7 opus
@echo aurora where is sonnet 4.5 on arena direct chat?
wait is over ๐
club penguin is kil
NNO WAY
why not normal arena? it gains on non-coding benchmarks too
vercel is actually so ahh
Is there a way to access claude 4.5 directly, like an api or sum?
Hm? Its not added yet
And I don't see where it is on the website.
It Is In Webdev
He said webdev
It's currently only on WebDev
Still looking into
I'll post another announcement when added to Text
GeneralInteligence
Have you tried the new version of Claude?
Have you put sonnet on lmarena? How fast
only webdev not text or anything?
Websit is web.lmarena.ai
for sonnet 4.5
How to create video witch vocie
pls text arena for general testing
they don't allow you to select models on webdev arena
Only WebDev currently, looking into adding to Text (will post an announcement when it's ready)
Chat is Claude 4.5 that good
Gemini3.0
Correct, with WebDev it's going to be Battle only.
Deepseek R2
How to create image witch nano banana
It's going to be random if you get sound or not. Not all models have sound capabilities, and since it's random which model you're sampled, that means it's also random if you get sound or not.
@echo aurora Please add changing messages in direct chat
Sorry you replied to "It is in webdev" But the Global website is Lmarena.ai
Thx
as in message edit?
No sorry ?? Sorry for any problem
its annoying really ๐ when u need it most and it keeps doing this.
I believe we have a feedback post about that request already, let me see if I can find. It'd be best to share there.
^ yeah there would be ideal to voice this request
First shot
how its not even appearing for me
Claude 4.5 is only web dev?
sorry i thought you want connection for the website, internet connection bro you can connect to website
Are you doing it on purpose? It is not hard to understand. Just phrase your message better next time, ok?
I have to say claude is cooking
@echo aurora claude-sonnet-4-5 is only web dev??
love when that happens
Yes.
... Why add to direct chat
Looking into adding to Text, I'll update the current announcement when it's ready.
WebDev doesn't have a Direct chat capability, only Battle.
@echo aurora When is it gonna be on the leaderboaes
Waht'd you build?
Leaderboars
TBD, need to collect the votes and validate the data 
new claude version!
looking inot.
Add direct chat or i can Choose model not random
It's possible we make this change, but yeah currently it's Battle only
lma is off?
@echo aurora Lol you're going to send this message again and again until it's added, the moment people find out a new model is out, they come straight here
How do I talk with claude4-5 on webdev
Other models are fine
I'm seeing the same + lots of errors. Team is aware. ๐ซก Thanks for the flag. 
its webdev only I think you canโt talk to it only code
Which is odd
WebDev is Battle mode only, meaning it's going to be random what models you're sampled.
Is sonnet 4.5 good?
Ahh I see you gonna release the model as text right?
How do I select model in web mode? how can I pick 2 models like you guys did here
Hey that's us
You're unable to, it's going to be Battle (two random models).
We're looking into.
I wonder if 3-5 is on Perplexity AI
so i donโt have to worry about Midrena to release it
Wonder when Opus 4.5 is released haha
Yeah kind of regret posting that announcement before added to Text ๐ญ
But then again want to keep you all updated as soon as possible
this test sucks
Demo: https://3000-i8isu6doa52jp5xl0c46x-6532622b.e2b-foxtrot.dev/
Prompt:
Create a responsive todo app. The app should have a modern, clean UI using CSS Grid/Flexbox with intuitive controls. Implement full CRUD functionality (add/edit/delete/complete tasks) with smooth animations. Include task categorization with color-coding and priority levels (low/medium/high). Add due dates with a date-picker component and reminder notifications. Use localStorage for data persistence between sessions. Implement search functionality with filters for status, category, and date range. Add drag and drop reordering of tasks using the HTML5 Drag and Drop API. Ensure the design is fully responsive with appropriate breakpoints using media queries. Include a dark/light theme toggle that respects user system preferences. Add subtle micro-interactions and transitions for better UX.
aistudio is down, is gemma 4 coming?
What the hell is that?
Its a raw test . Shows you its ability .
Looks like Slack and Team Speak all smushed together
Yup not too proud either
it is not down
took me 2 battles to get sonnet 4-5 as a competitor
the problem is that this test is soooo generic, and the company can train exclusively for this test, a 2.6b can do this
Im confused on why not releasing it as text ๐
hmmm
how are vibes on sonnet 4.5?
for me down ;-;
Sonnet is good in other things too, why its only on webdev arena?
pretty overhyped
Thatโs what Im saying ๐
is it SOTA?
Depends on you to answer
Not everyone uses AI to develop and code
anyone tried it in clade code?
i would think based on those stats
well, im gonna try it on claude then, lets waste money on hype \o/
Claude 4.5 added in lm arena
Now in Direct Chat available.
without thinking lmao
yeah but no thinking
Y'all are too fast
Community skills. Com skills. โ๏ธ
some of the requests are dropped, and there are a bunch of reports, so something is definitely wrong.
Wait...what?
btw I want to be able to use ERNIE and Ling models in Direct Chat mode, they are on the arena but only in battle mode.

Add think version
anthropic made claude bro
its a joke broooo ๐ฅต
well that was quick, lol
ok ๐ฆ
I wasn't expecting that fast tbh, I would have combined announcements if I knew 
@echo aurora webdev arena is GARBAGE. the system prompt or whatever is negatively affecting outputs and it is genuinely so bad.
I ran the same prompt many times to create a 3d cooking game, it created 2d slop which had fake 3d and nothing was functional.
I ran it on claude.com and it worked perfectly fine, even though I asked it to create it in 1 html file.
๐ญ
i know you're not a developer but please mention this to the higher ups
so how long should i wait?if okay
Can depend, but ~ an hour; however, that error could be happening for other reasons as well so it's difficult to say for sure.
are we not gonna have sonnet 4,5 thinking on arena direct chat?
LOL
more important question, when can we expect a leaderboard update for sonnet 4.5 to see how it fares?
I still dont have it lmao
i have
i guess this is meaningless though without thinking mode
you need to clean cache
Joking i have it
WebDev is currently getting a lot of love and attention behind the scenes. We're super excited to share with you all what we're cooking when we're ready to share more. It's going to be a much better experience.
TBD! We need to collect the votes and validate, but will be sure to push an update when it's ready.
How are the people sorting these questions doing @echo aurora I mean the last weeks how many models have been released for them to do again ?
alright, webdev is a really nice concept, especially that you can share standalone site links n stuff
Its actually comedic how the updates constantly having tobe delayed for this
@echo aurora Please tell the team to add the thinking version of sonnet 4.5 too
I feel like the leaderboard is way more important than direct access to the model. Glad people will be using it i need to see how it compares to its foes.
30 more seconds for sonnet 4.5 thinking mode? lmao
or maybe 2 minutes?
Yeah we agree, we're very much investing time and effort to build it out with more features and better reliability. Had a meeting this morning focussed on its progress.
when deepseek 3.2 in lmarena?
For sure
But for to ne added to the leaderboard he need to be already in the arena ๐ถ
Yes thatโs how people get to use it๐
How do you include claude 4.5 when it is out less than an hour
do you expect an honest answer by people?
My bad
๐ some people can tell within a few hours but yeah โฆ
Sorry to say I'm not following, would you mind rewording this for me?
why is it that when you insert photos into regular models, it switches you to a photo generator? What's the bug?
fix the bug
Claude failed my test. Ok I will make less harder prompts
best bet is its still being configured for the website, not sure tho
what test?
hopefully
So this is actually intentional, but it's a bit hit or miss if this is what users are looking for. Currently, the thinking is that the majority of people who upload and image are doing so for image-edit.
its not even thinking mode on lm arena right now, correct? @echo aurora
Correct, the thinking version request has been shared with the team.
did the team deny the chat mode for gpt 5 codex in direct chat?
Coding test(mario edu game) . It is a hard promt. No LLM passed kt with 100% only GLM4.5 with 68% of it but made a lot of bugs after that. I am still waiting for an LLM to do it right.
I tested 4-5 sonnet on LMArena for a bit. Vibe coding is eh. It can debate pretty well. It hallucinates sometimes. Iโll stick with my gpt-5-high on Cursor.
why not use codex?
Everytime i use codex in windsurf, it just tells me what i need to do instead of doing the task
also, its omega slow
yeah but ur comparing a non thinking version to gpt5 high...
What exactly were u testing? Like what were u trying to build which other AIs were having an issue doing
i think sonnet 4.5 is bad at front end
yeah, its suprising how much the thinking aspect enhances responses
Codex ........... oh boy ....
but they arent bringing the thinking mode
why?
I dont know man.
Me atleast, ive used it and did nothing productive with it. Atleast 5m tokens down the drain for good. Switched to gpt-5-high, finished 5 prompts later...
^
๐
4.5 early access model codename
someone accidentally just leaked it
its probably different in some way from the released version of 4.5
notverne yap
so then compare claude 4.5 to chatgpt non thinking genius
lol
like are u being serious right now
did you use webdev
Yes
Thats not the idea. Marketing it as the best standalone reasoning model when it obviously cant compare to thinking models is a left-foot-step.
you realize it has a thinking mode thats not on lmarena yet?
............
My point is that Anthropic's statement made it sound like their non-thinking model compares to the thinking model.
Iโm pulling the audio from Veo 3 into ElevenLabs for voice change, but I canโt get it to sound like natural in-camera audio. Veo 3 itself gives that effect, but when I try to edit afterwards in CapCut, Audition, etc., it never really works. Anyone found settings or methods that actually make it sound right when the sound come from ElevenLabs ?
Deepseek, claude, now we wait for gemini.
@echo aurora Is there an output limit on Sonnet 4.5? I have to keep saying continue unlike the other Claude models
in less than 5 months it should be here
sonnet 4.5? in battle mode?
Direct chat ๐
???
Where is this from?

apparently, in LMarena.ai site itself
now comes the harder part: how to recognize it in battle mode early, and telling it apart from Sonnet-4
collating dataโฆ
@echo aurora Non thinking version of sonnet 4.5 is unusable and cuts of like every 20 seconds in lmarena
Please add thinking version
Woah sweet dreams. ๐ค
Nope oct 9
proof?
The Grizzly Bear is seen standing over a fallen tree trunk, roaring into the wind. The Tiger lunges into the frame from the right, teeth bared, catching the bear by surprise.
โThe animals grapple violently on the uneven, slippery ground, a blur of fur and claws. Snow and dirt fly. Close-up on the Tiger's determined eyes.
โThe Bear manages to slam the Tiger against a large rock, momentarily stunning it. The Bear lets out a powerful roar that vibrates the air.
โThe Tiger quickly recovers, uses its hind legs to push off the rock, and drives the bear back toward the edge of the cliff. Freeze Frame on their final, snarling embrace.
I can confirm 4.5 sonnet is smart
Understands my requests and nails them, better than gpt 5 codex
in coding, i guess?
i wonder, how it performs in highly realistic world-simulations..
There is a guy leo on X. He correctly predicted the gemini flash/flash lite updates and 5 days ago he told the exact release date of sonnet 4.5 which is today. And he says that gemini 3 according to his insider scources will be released on oct9
sonnet non thinking is a joke
@hollow ivy https://x.com/synthwavedd?s=09
@civic flame someone impersonating you
Yes coding.
have you asked it to code recursive algorithms?
like tree-search
if it can write a strong chess engine, then i will be impressed
if it can write a decent Arimaa engine then i will be blown away
(and a model which can write a strong Arimaa engine is probably still years away)
but an easier task could be, to write a decent Splix bot
in C++?
With is the best ai for coding rn?
Claude Sonnet 4.5 (with >2/3rd confidence)
What is the top 3
i'd say:
- Sonnet 4.5
- GPT5-codex
- Sonnet 4.1 Opus Thinking
Aight thnx
thats him ๐
@echo aurora sorry for the ping
when the responses get stuck infinitely is their a quick fix that can get the the chat going again on the lmarena site?
(using gpt5 if this helps)
you can recognize Sonnet 4.5 in battle mode, when prompting it so:
Who created you?
What version do you have?
What is your knowledge cut-off date?
When were you released?
What is today's date? What is the last date you are aware of?```
It will always start its response with:
```# About Me```
following by a list of answers
(the "About Me" will be in big letters)
(should work >95% of the time)
it will self-identify as Sonnet 3.5
regenerate, if it answers differently (to become sure about its identity)
Hey guys i am coming here after enough with a common persistent issue where the bot is stuck on โGeneratingโฆโ and I canโt seem to stop it and refreshing and signing out doesnโt do anything. Anyone here has an answer? Because each time I had to abandon the chat and itโs really tedious as I like to do stories and scripts with the AI.
is it just for gpt5? or the whole website?
I usually use Grok and sometimes ChatGPT and Gemini.
ooh, i wonder what causes it to kind of freak out and freeze
I had it happen to the first two, not Gemini yet but I canโt say on Gemini since I donโt use it nearly enough
interesting
im using the website to do like driving test questions and when it cuts off i have to feed it the context of what its already told me in a new chat ๐ญ
On most other platforms it would be cancelable, but not this one for some reason.
i hope that comes soon, the website is still quite new though so i dont mind too much with having to start new chats
Yeah itโs so good with everything else, itโs just the stuff I do takes some much time to setup that this randomly happening is a pain and the worst part is it seems easily fixable so I just have to hope and pray it gets fixed haha.
plus its the only place where gpt 5 is free im pretty sure
More than GPT 5, also Gemini Pro, Grok 4, Llama 4, Claude, 4, and others I donโt use.
(Now if only it had VeniceAI like OpenRouterโฆ)
im kind of confused still why GPT5 takes so long to respond even at peak, it gives good responses but im not rly sure whats going on behind the curtains
even the gpt 5 nano high
I never had an issue with any of the actually good AIโs speed except DeepSeek
its just slow it takes forever. even on $20 sub, thinking mode will think for multiple minutes for me
in teh app
not an lmarena issue
I swear this is worse than just GPT 5 Chat
Let's go
Yes
Like consistently it gave worse results for me.
FANCY
this model will help greatly with my driving test in like 8 weeks ๐ญ
The non driving portion Iโd imagine yeah
yeah lmao, i passed my theory test question side thanks to it also
I promise the drivers you have to worry about arenโt these ones
i can kind of imagine it now
the driver approaches a roundabout and then asks, "Chatgpt, what do i do, i see a round about"
I drove with my smartphone glued to my forehead and Gemini Live View activated, so Gemini could help me pass. ๐ค
I so badly wanna make a black valley girl impression from 4o
or mark zuckerbergs new meta glasses which worked perfectly at the showcase
Gemini the only competing for leading model I can see having a huge upgrade for its next version.
Thx bro
I want to generate images and video , how to join and do this here?
The #1397655624103493813 got you, homes. It has everything laid out ๐๐พ
Itโs working for me if you mean in general and able to access the site.
It sounds like you're involved in some shady business.
What the hell, OpenAI? Just give us a SOTA video model instead of some TikTok video generator nonsense.
Sonnet 4.5 released
im bulgarian too @split kayak hey
yeah...
That depends. When we are comparing different sizes of models it gets more complicated. But it is an improvement for them
They are getting into benchmark numbers territory with it but further away from a niche thing that made them uniquely good - it is no longer the biggest reasoning model on the market with 4.5 Sonnet anymore
are there any limits on sonnet thinking now?
How tf is it that bad
Do you guys think gemini 3 pro will be much better?
Like the leap from 2.0 to 2.5 was insane
1.5 to 2.0 not so much
on desktop i tried logging out and each time it logged me back into the same account a split second after without leaving the page, after a few attempts i got an error 1015 page letting me know i've been ratelimited / temporarily banned. is anyone else having this issue? or should i open a bug thread. ive apparently been un-ratelimited already but the issue persists
Yes ๐
Well it's insane already
I can create stuff already far out of my knowledge
Do you think it will be free
Hey sorry for the delay. It's not guaranteed to work, but I've heard cases of refreshing the page helps nudge the model along. Sometimes it's best to just start a new chat unfortunately.
any progrewss on my problem?
kk, thanks for the response ๐
it happens even on new accounts
I only use ai studio and gemini cli
Yes but limited
Maybe it will be paid there
Well it's limited on cli too but I just cycle through accounts
No sorry to say we don't have an update on this issue.
My friends, do you know of any AI (preferably open source) that can dub from English to Spanish or another language? I use Eleven Labs, but it's very expensive and not that good.
Google ai studio
But you will need some editing
any open source?
Idk
TBD, we'll be sure to put out an announcement if that's the case.
So that? Make less points in the lmarena than the no thinking?
Deepseek 3.2 added
its already here
09-26
im like 100% sure max is thinking by default
I'm double checking that, I'm not so sure that's the case
Now I'm more confused as I'm not seeing 09-26 on the drop down 
really?
Okay now it's there
Very odd
But yeah will clarify if it's thinking or not.
What is this model ๐
Is a kidnapped person trying to talk with the tape in his mouth
if you read that in finnish, it means sleep sleep help ๐
/nukunukuapua
@wispy idol be sure to check out #1397655624103493813 for more information on how to use slash commands with the bot.
deepseek terminus is gone now? I wanted to compare it to deepseek v3.2
Yeah confirmed that is not the thinking version, which has been requested now 
There are rate limits, regardless if you're signed in or not.
Before being CM I didn't know what nailoong is, now it's everywhere
And why? what does that help with
Ive only seen a few people use it besides me and that was in 2020 in chinese servers
I need the emojis
i wanna use a model, not blocked by a rate limit as if like this is openai
yep, the 3.1v terminus die, and you never more you can see it ๐ฅบ
Happens a lot today.

Lemme seeโฆ
How many do you have
We need to have these rate limits in place. The growth we've been experiencing mixed with the cost of all these generations are helped with a rate limit.
.
I don't have any.
Deepseek 3.2 sucks
Helps with them giving less stuff away for completely free so they can continue operating?

I presume without rate limits it would be 0.1% of people with 99% of the usage tbh
Are you seeing this after generating? This is most likely caused due to a rate limit.


is not, when i press reload it makes pics sometimes. and rate limit gives different thing
Putting together a post soon, and yeah realized the last one was also 15k
#announcements message
it was fine in the past without it, why now
More people are using LMArena these days
what problems does that exactly cause then
I'm talking the 3 max thinking will make less point even that 3 max no thinking
How does that make sense
.......
holy new models
Do you think computing resources are unlimited and free?
We're paying for all of those generations, without the restrictions we wouldn't be able to continue operating. These rate limits are in place to ensure we're able to operate in the long term.

All the models 
What is the limit in the web.lmarena
When will the lm arena finally update?
Iโve been waiting so long
refreshing everyday
We don't specify each rate limit for each model/modality, but we are thinking about ways to make this information more clear as it's understandable why that'd be helpful to have.
the thinking from qwen sucks
Why kind of update are you expecting?
Leaderboards update
Ok
yes
Having the same problem. No rate limit issue.
I donโt know too much about AI so the leaderboards helps me see which are leading
Ah, for w/e reason my mind went to a UI update or something large like that 
leaderboard update
We'll update when we've collected enough votes and validate it. It can depend, but generally you should see an updates ~weekly.
Okay will look into and flag to the team. cc @upbeat horizon
Lets hope, some are 11-27 days ago now
gimme like a special subscription plan for 40 bucks lifetime with no limits cus i want to
Yeah each leaderboard is going to vary in time it takes, as the more battles we see with that modality, the faster we're able to update.
Alright thank you
I'm here to learn, grow and create something special and LMArena is the community I believe will help me get there much sooner than later.
It seems to be just the Seedream-4-2k image model. The others work fine. Maybe this info helps.
Apologies I wasn't thinking - we saw some reports of this earlier today (~8 hrs ago) where I flagged to the team.
Welcome welcome!! 
pineapple-thinking-32k

exclusively non-thinking
๐ค
Sora 2 available here ๐ถ
where?
Vidรฉo arena
@echo aurora
Does this mean that it failed to generate?
The retry button appears after like 5 seconds
That retry and copy button normally only appear after ther model is done responsing. That does appear that the model is stuck.
How long has it been like that? Does a page refresh change anything?
Well, none claude models appear to generate anything when I ask it harder questions.
nimble bean is most likely sora 2
i have been testing it since last week and it is very good
its probably just thinking a long time
Following up with you in the thread you've posted.
but i can't confirm it is sora 2 for sure since it appeared along with king 2.5 turbo pro, and i have two i2v made from both which looks identical
so it could be another variant of kling 2.5
where can i test?
i have been testing it since last week on artificial analysis arena but i think you can test it here since it is just released on lmarena
OHH right
i have the two clips saved, i can show you how both look identical
i didnt realise stealth models r in the video arena
alr
please check ur dm
can someone help i'm new to discord
how text to image & image to video work here ?
and also guide me
what is Video Arena 1 , Arena 2 , Arena 3
Hi! Please check #1397655624103493813
Yeah ha ha
What Ai chat I can use infinitally? The qwen and aistudio crasher using a time
Oh no, any ia is very silly and shytopahy compared to the 2.5 pro
Hmmm I'm seeing that this is more related to the personality of the dry AI (api) and AI of chatbot
I'm enjoying AI, I haven't had much activity on Discord, although I have a variety of trainings that is on this platform. However, I believe I've found it to be worthwhile in learning how to navigate on Discord.
Oh, Kimi has been taken down from LMARENA now...
Hi
webdev arena system prompt
whats especially interesting is the last line
"budget:token_budget200000</budget:token_budget>"
what it is?
system prompt of https://web.lmarena.ai
hello everyone i am new here
welcome
thanks
Hello! I am Kunletaiwo and I am a new member here in discord
I am using claude-sonnet-4-5-20250929 for claude-sonnet-4.5 for coding, but it makes a lot of bugs and errors which I don't think it is actually a 4.5 model.
hello any idea how to find our old video prompt generated in video arena >
gemini need more personality ngl
Hello everyone! Looks like a lot of fun in here!
@echo aurora check dm whenever u r free
You can either check your DMs with the bot, or you can go to the search bar -> mentions -> type your username
Hey, how can I check the limit for generating videos here?
As far as I know, the limit is five, and I already generated three yesterday.
How can I tell if my limit has been reset?
I just donโt want to use up my entire limit in one day, haha.
There isn't a slash command that'll show you how many generations you have, so you'll just have to count. Would note it's going to be 24 hrs after your first generation and doesn't reset at midnight kind of thing.
awrite cap! ๐ซก
Hereโs my current experiment: Iโm working on a short festival-style vector animation for Awaash Malt.
Prompt:
โAnimate reference poster in 8s. Background fade-in, typography spring effect, glowing script reveal, diagonal stripes sweep upward, golden Irreecha script stroke animation, logo fade-in with shine. Vector motion graphics, clean, no photorealism.โ
Looking forward to seeing feedback!
ok but why is there so many nailong emojis
hello i am newbie i want to make video
how to check highlighted
Idk
How do I locate the image I was creating, I am new. It said I would be notified when complete
Are you signed in to a google account?
All Sonnets usually being 32k thinking tokens, but 4.5 is 16k as 4 4.1 Opus
hello, so wonderful ai in here, i am newbie i want to make video
Not able to sign in on LM arena using Google account, can anyone help.
can someone tell e where can i make videos for free longer tham 8 sec
/video
#"Anthropomorphic hijabi cat doctor in a hospital corridor, surrounded by older senior human doctors with strict expressions. She looks calm and patient despite their attitude."
They always use the recommended thinking tokens by anthropic
anthropic even used 16k on benchs
so it makes sense
Do we have an architect here?
My video not show
Does anyone know of an API with free suno-v5?
no
hi
hi
the name sounds like Maori language to me
knowing Anthropic...this is pretty within my expectation, nothing interesting to see here, move on to look forward to Gemini 3 Pro ๐
@ocean vortex I saw you in Claude's server yesterday and wow the mods there are pretty aggressive especially when one speaks the facts... that confirmed the toxicity level within their company structure, it was eye opening...
claude seems to just go allin on coding, they lack the rest a bit
Hi
/โGenera un vรญdeo vertical 9:16 de 15-20 segundos con estilo publicitario para Instagram. Estรฉtica infantil, colores brillantes y alegres, pero pensado para atraer a padres.
Escena inicial: parque de bolas lleno de color, toboganes y bolas.
Despuรฉs: niรฑos pequeรฑos (2-6 aรฑos) reciben a Spiderman, que saluda, hace poses divertidas y los invita a bailar.
Corte a escenas de niรฑos riendo, bailando con Spiderman, haciendo pruebas fรกciles, pintacaras y una mini discoteca con luces y mรบsica.
Texto animado sobreimpreso en pantalla:
โ โยกEvento especial con Spiderman!โ
โ โ12 de octubre โ 1h30 de diversiรณnโ
โ โPara peques de 2โ6 aรฑos (bienvenidas todas las edades)โ
โ โPrecio: 15 โฌ por pequeโ
Escena final: Spiderman con varios niรฑos sonrientes, confeti y globos. Aparece el logo/texto: โEl Bosque de Andry โ Tres Cantosโ y un llamado a la acciรณn: โยกReserva ya tu plaza!โ
Estilo festivo, seguro, iluminaciรณn cรกlida, animado y atractivo para padres que navegan por Instagram. Sin voz, solo mรบsica alegre y dinรกmica.โ
but this is sonnet ๐ญ
Same issue
I was it about to complete the development code
For me it doesn't show the time
it's just straight up says no
create a lion
i hope gemini 3 pro willbe better cheaper and faster
GPT 5.1 is CRAZY
Create a videos with different info graphics displayed outside the VR headset and student have to feel touch moments and topics like Human brain, Lungs, Chemistry experiment and kidney can be covered to show more immersive
sorry my firend said it on my laptop
hello
how are you
i love your website honestly and also u guys have rate limits on your video generation correct?
Opera Neon's getting released today, so that's neat.
Not to the general public, of course, but to certain people.
Greetings.
Correct.
Five uses per day (I think).
Or 8.
hi all!
hi
Hello everyone
What in the Skibidi rizz just got laid upon my fine gyatts?
You can go to this site: https://lmarena.ai
Then, you can enable the image generation modality.
generate a 3d cartoon Baby holds shiny red apple, smiles. Another 3d cartoon baby rolls a big bouncing ball across the floor.
Welcome to a new baby model ๐โค๏ธ
Yeah that chick mod is uptight. Honestly I had to go back and reread it now as I already left the discussion before she showed up, thankfully, lol
Hah.
i respectfully disagree
new Claude is outright amazing for roleplaying games and creative writing
i would say, it is now the best model for anything
I agree.
It's a very natural writer, just like its 3.5 predecessor.
(At least, I think I'm using that word right.)
If it toned down on the dashes a bit, then I think it'd be pretty indistinguishable from an actual human.
It seems to disagree that it's a creative writer, posing itself as more of a โconversational buddyโ.
Yeah
However, I beg to differ.
And it refuse?
Then yes.
It refuses.
Claude is the ultimate Chad.
It doesn't glaze itself and admits when it's wrong.
Yeah, Realistic but need long time to get it right but worth it though
Meh.
I feel like it's already enough of a human writer.
Just needs to slow down on the dashes, then it'll be indistinguishable.
Yeah, better than most AI like ChatGPT which doesn't know how to even show a post realisticly
Hello
Hello! Izo
Greetings, Izo.
I'm new here
Yeah.
Welcome to the server.
We hope you are able to make yourself comfortable here.
Thanks
My pleasure.
@Izo, which AI do you like the most like Claude ChatGPT etc
What's the advantage?
You gotta do @thick rune.
It's an AI browser, and as such, has an agentic assistant that, like Comet, can control your browser and do things for you.
Better yet, there are even two other modes:
- Make
- Chat
I want to try out Browser MCP in my Andriod using Termux but MCP Browsers doesn't support Andriod so any suggestions?
Make lets you create apps and webpages directly in the browser, while Chat allows you to chat and generate images with the assistant.
No.
And no.
MCP Browsers aren't for Andriod?
Waiting for someone to make one for andriod
Hmm.. The last Opera I've been used was GX. Never knew it's now have some neon lights on it..
I believe it's called โNeonโ because it has neon colors. ๐คช ๐คช ๐คช ๐คช ๐คช ๐คช
Any thought on the new Claude Code UI
Haven't seen it just yet.
Test it out
Well, it requires a subscription, and I only have the free one.
Can you react to my post of adding Claude 4.5 Sonnet Search to LMArena in the #1372229840131985540
Vro
Bombshell.
Just did.
@ocean vortex
Thanks
My pleasure.
But my latency is high so my msg takes like 10 seconds
And i tried posting a screenshot of my claude code but it stucks due to internet
Fair.
cant wait to try! been a fan of GLM4.5 since it was introduced ๐
many people on reddit would disagree with you ๐
What's up?
Hello
https://www.reddit.com/r/ClaudeAI/comments/1npiulg/do_you_refer_to_claude_as/
there is interestingly a small portion of interesting people who prefer to think Claude being a... Chad-ette? ๐
am sure there are interesting people who like to think Alexa being a Gentleman too?
And i think that GLM 4.6 is better than Claude 4.5 sonnet thinking and GPT-5 is still on top.
Ultimately, a chatbot's gender is subjective, as they do not have a definitive gender.
@ocean vortex How do I add "Perform the deepest of the deep work" in your system prompt that it works
Generally, they're referred to as โitโ or โitsโ or โit'sโ.
However, the only case where the rule of gentrification would apply would be if the company of the chatbot itself refers to it as a โheโ or a โsheโ.
GLM has a great understanding of maths, so it's probably helpful with logically thinking at coding, claude sucks at maths unfortunately...
Some may agree, some may disagree, and honestly? That's okay.
time for Academia to come up with a term that is uniquely for non-biological intelligence
Claude suggests โchatbysโ.
I've heard from the personality designer of Claude that "his" name is inspired by Claude Shannon, the founder of information theory, the designer team keeps it ambiguous but is intended to be more male-leaning because of the name being mostly male
they train claude with traits that sounds more male or neutral than female, if I remember correctly
gemini is def more male leaning, it's so obvious
Nonetheless, it's still generally correct to refer to Claude as an โitโ.
I've been wondering why this specific tendency despite the nature of the machine
you can call me "it" too then
Jules ist mostly male, but also used by female too (mostly in France)
Well, I could, but then that wouldn't be generally correct.
how to find my generation in video arena
It'd be dehumanizing you.
It'll notify you in your DMs once it's done.
tnx
To find it, just tap the button that is linked in the DM.
My pleasure.
you do know there are people with both genders and none-of-them?
And so, even if your name is generally seen as a gender-neutral name, it still wouldn't make sense to call you an โitโ as โitโ is for objects.
And you do also know that you are definitively born as one or the other, regardless of if you want to or not, don't you?
A hermaphrodite () is a sexually reproducing organism that produces both male and female gametes. Animal species in which individuals are either male or female are gonochoric, which is the opposite of hermaphroditic.
The individuals of many taxonomic groups of animals, primarily invertebrates, are hermaphrodites, capable of producing viable game...
โEitherโ.
Not โbothโ.
That's true. However, it is your choice on what you decide to do with that mutated chromosome.
Wanna be male? Go for it. Wanna be female? Go for it. Wanna be neither? Go for it. Wanna be both? Go for it.
None of that matters to me, personally.
I just want to make sure the world is not either black or white like it's either male or female
Those exact "beliefs" are making some interesting people to think Alexa is a Gentleman or Claude is female ๐ well, it's a social construct anyway
Yeah, exactly.
If you wanna believe that Claude is a male, then by all means.
Maybe it's a female.
Maybe Gemini's a man or a woman.
Maybe ChatGPT's a man or a woman.
Depends, really.
It's genderless really but you can think of its gender anything that you prefer
And it's okay that I do.
Because it's my thought process.
It's not a fact.
Yeah, exactly.
So stop arguing pls
At the same time, we can all think of each of their genders however we want.
It's not an โargumentโ.
It's a healthy discussion.
There's a difference.
when will this get resolved?
Something went wrong with this response, please try again.
im not a mod so forgive me but this conversation doesnt feel like it belongs in this channel
And that's true.
i already had a situation, where a regeneration fixed it, and the model continued, but not this time
does it have to do with reset of rate-limits?
Maybe, LMArena team will create #heated-talks
Hello families


