#general
1 messages · Page 112 of 1
Agreed
our feedback is constructive, not a complaint
That’s great. Just don’t be pressured. work at your own pace.
😁
They said they would do better. Let’s be respectful atleast. 😅
yo pineapple you've been doing great all day, thank you for being patient with us as well
This is an overall issue we're aware on and would like to fix.
Mid-outage though that's going to be much trickier.

okay nvm the site only looks like it works... i can only read the conversations but not add to them or start new ones
Bbb
Well sorry if it bothers you. We don’t deserve what’s free. Just know you could, theoretically, not be able to use it anyday. Obviously that’s not going to happen. But be respectful. 😁
Honestly, it's the engineers who are working in this that are the true heroes with these outages.
@wanton vortex
the admin is pissed off ig XD
What time will it come back?
What time will it come back?
No not at all!
i suspect the site is going through feature transitions so some code may not cooperate properly
I don't have an ETA for you sorry to say.
@echo aurora Thank you for your response. I think attaching files is one of the fundamental features. I haven’t seen an admin like you replying to every user’s message. Hats off to you!
The chance to come back today?
Google appears free, but its true cost depends on your definition of "free." Lmarena also isn't free; users provide data to improve its models. True free means receiving nothing in return. Nevertheless, I am grateful to Lmarena for this service ♥️
is more than 24 hours down?
im glad its free in terms of no fees or subscriptions
Yeah I'd be very surprised if it wasn't back today.
usually the site isnt out of commission for multiple days
ig yaal have to open a site like status.lmarena.???
That's great, you do a great job, congratulations!
What happened
Site is down. Wait until it’s fixed
Thanks for saying yo me
Hey all! While the engineering team is working on a fix, please keep chat and tone respectful 🙂
Updates will be posted in #announcements.
Thanks for your patience and understanding! ☘️
hi guys
have you seen gpt5 image model?
it's pretty bad lol
there's a new model being tested on artificial analysis
it's not gpt1 image
It is ‘bad’ because it can’t edit an element of an image without changing everything else
it's "gpt5"
yeah
Nano banana is better at that.
exactly
site is up, but no chat history
is lma up again?
Yes
ok i have every chat
I actually used it 2 hrs ago but now 😭😭
We're still monitoring
are you logged in to the site?
ye
gg
I was not logged in and all previous chats are gone 🙁
I'm really sorry to hear that. Do you have the sign-in with Google option?
🙁 that should be fixxed in a other way not only logged in thats bad 🙁
yes, but I didn't use it. Was fine without it.
Where do you guys make your videos?
what just happen?
Sorry to say the Video Arena bot is still not working.
i can access
i have a question pineapple
fire away
how do you get the ai's to work is it with api or how if i can ask or allowed to
Team is aware and looking into this btw.
Sorry to say we don't have an API that's accessible
He's asking how do you get all the AI models on the website to work (e.g. GPT-5-HIGH)
that too
Isnt it pretty straightforward? Just call the API for gpt-5-high or whatever and retreive the LLM response.
better question does it cost money
Yes, it does.
ohhh
What is the best way to make AI videos for YouTube shorts for free?
@echo aurora qwen 3 max 😶
Is good or sucks?
Very good
im still getting rate limited after a single generation. of course im not logged in. is this intentional now or just a bug left over from everything that was going on? as this happened before everything went down originally
What model are you using
More intelligent than gpt 5?
its just battle mode, it is image generation but still just one and then it tells me to log in
Site is working for me
Might be a restriction.
I don't work for LMArena so I can't tell you exactly
Now available just the non reasoning version, and yes its more inteligent than GPT 5 No reasoning
it rather competes with Claude opus
for what do you guys use the LMArena?
regular text generations are working normally it seems just image generation is having issues. weird
where did the google login and my chats go after the website down
nvm i refreshed site 3 times and all back
It's highly unlikely that it's an inposed restriction. Nano banana should be free
Your google chats should be saved. Else, it's not confirmed that you'll have the chats after a website outage.
nano is one of the limited ones but never a single at a time. i have no clue whats going on and will assume theres a left over bug still
dw i got them back
for what do you guys use the LMArena?
Research
:0
Yes it is. Scroll up.
glad its back
/video
@echo aurora sorry for the ping just had to ask. is the rate limit of only a few generations a bug (as it started before everything began to cash), temporary thing while there was issues or permanent now?
hello guys
gpt-image-1-high-fidelity 🤔
"high fidelity"
Hi
Actually gonna crash out the new release is CLOSED SHUT
i had a hunch
it was correct
hi
cinematic
/video
hello
how is the new Kimi K2 update?
You guys can't see this but I I am seeing the most adorable bunny rabbit
Hola
Note our Video Arena bot isn't working at the moment, @undone flint
What is the new gpt image model? where
I'm not sure what you're referring to.
is video working in next 5 -12 hours?
Sorry to say I don't have an estimated time for you.
/video
You can't use the bot in this channel. Check out #1397655624103493813 for more info, but note the bot isn't working at the moment.
Let me check.
I hate Claude on LMArena. I can’t even make a proper Doom Slayer story without getting a Terms of Service violation warning. 😭😭😭 And Claude is so damn good at making stories too. My life is over
😭
/movie
I hate Claude on LMArena. I can’t even make a proper Doom Slayer story without getting a Terms of Service violation warning. 😭😭😭 And Claude is so damn good at making stories too. My life is over
I hate Claude on LMArena. I can’t even make a proper Doom Slayer story without getting a Terms of Service violation warning. 😭😭😭 And Claude is so damn good at making stories too. My life is over
In Russian, Claude swears more than GPT
New trend ? 💀
And Claude responds more humanly than gpt
use gpt5 high for story making
I didn t like it so much..
Models that deliberate respond less humanly than those that do not think before responding
хорошо
gpt5 medium to high wins in all fiction novel stuff
and where to find this version - medium?
api
are u new ?
use openrouter or openais playground api
and novelcrafter is the software where u write the story
where
Is it that good?
I loved that book!
What does High fidelity have to do with ChatGPT
It rarely appears in the lmbattle.
It's hard to find (
@echo aurora when will gpt image high be available

say hi
Yeah it’s great actually but it can be such an ass sometimes
It appears to me twice but it hasn't appeared again, but recently it appeared again to a friend
Hold on, do we have to login now? I don’t want to
Hi when will bot start again
Yeah so that's a new model that's only going to be available on Battle mode, you won't be able to find in Direct/Side by Side. cc @open mountain
Unclear, I don't have an ETA.
And when will it be public? direct
It's not certain if it will, so I can't provide an ETA.
Will Arena ever have an App for mobile
It's possible.
perhaps never?... I didn't understand
Hmm will it likely have in-app purchases?
I mean there has to be a way to keep apps alive no?
Hello, Just Join Here
Welcome
Correct
It'd likely be very similar to our site if we were to make one.
Then you would have the most popular app on the appstore
Most apps on the appstore shove in-app purchases down your throat which can be expensive as hell
@echo aurora so is this still just a bug to only be bale to generate about 2 images now? or some kind of safe guard brought up when everything was crashing? or possibly a bug left over from the website going under as i was getting this same thing just before it all started
I also have this doubt, after all the site's proposal was not to require logging in.
Yes, I'm a little annoyed by this bad decision. I would like to remain anonymous.
sota - local ai
5
7
1
Qwen3 30B 2507
Hi everybody!! Why I don't have permission to write in # video-arena-1, 2 or 3? Thanks
I pick "or"
/video
Why can't I take photos and videos?
What happen
cant they add option to login with other email instead only gmail? like outlook, etc...
Why lmerna dont working??
what issue in specific?
@ashen plaza go to nani banana
you do understand you might have diffrent issues right? say what the issue is
@echo aurora nice typing in annoucements
yes
gemini 3 when
was it that anonymous-model-0514 thing?
or what was its name
btw @echo aurora you should have made some sort of announcement before you guys decided that it was a good idea to force Google login after 3 image generations
not everybody wants to make a Google account, you know?
Which model?
just because 99% of the world uses Google doesn't mean that you should ignore the other 1% that doesn't use Google
there is a stealth model in the image arena that has been there for a few months called that
I agree, we're crafting something atm
We are looking to expand User Login to not just Google
Noticed that you added the account feature lets goo!
I knew that with the implementation of login would come higher rate limit fees
vro is there a private video generator so i can make free unlimited vids for myself, pfft
Bruh, why say vro
What is this bastardized word?
What are detailed limits?
don't worry bout it unc
Am I really that old?
i would choose banning all new member who came in in the last few weeks
so i can do more usage in here
D:
What are detailed limits?
does lmarena have unlimited free uncensored private every single chat, image, video model for free with no paywalls
The bot can only be used in this server
yes but, dont want people to see it
We don't have those posted somewhere; however, this was flagged to the team as something we may do going forward.
ok
Sorry to say I don't have a solution for you
took me 1 minute and 43 seconds to realize you're a mod here
@echo aurora Are the rate limits for direct chat or Battle Mode now? I was only getting rate limited in direct chat even before the login was implemented.
Oddly specific
what is a rate limit
I believe it's just in Direct/Side by Side but will double check
Use limit essentially
Please do 🙏🏻 As far as I know Battle Mode is supposed to not have a limit
The other models were always rate limited before
Idk why it needed to be implemented if it was already a thing
theres limits now? chat limit? photo limit? for what
this used to be unlimited
Be grateful for what you have. We don’t deserve it.
Yeah I'm 99% sure Battle doesn't have these limits, as if you run into a limit with a model, it'll sample a different one.
thats not what im sayin, im sayin it used to be unlimited, why is it limited now
Because these models are not free
Don’t take them for granted
No it was always rate limited before
Be grateful
Always has had rate limits. But it depends on the model
I’m very confused
Rate limits already existed before
So what exactly was added or changed?
There is not
but whats changed now if the limits still existed
Yeah I’m confused as well
Users who are logged in are going to have less rate limits
"you would ask me to pay to use the technology developed by world wide consumation of millions of hours of labor and research? how dare you"
see bro this guy gets it
This is why I love local generation 😊
mind games
I can use it unlimited
Both will still have limits, but if logged in you're going to have less. Does that make sense?
Why can’t people just be grateful that there’s a place where you can use LLMs for free? even though there are limits?
as far as I can tell if you arent logged in you get like 2 or 3 generations before being rate limited, even on battle
Whoa
Battle Mode isn’t supposed to be rate limited though?
oh, less limits for logged in people, oh yee i get it cus i already was logged in with an account on it
Be more specific. What LLM?
That was always unlimited
as far as I can tell all of them for image generation.
Still double checking on this. There may be a difference for image/text.
I think a lot of people are about to get angry lol
why cant i send a video game character being mauled to a direct chat to gpt with it constantly being flagged
guys what is the best web search model currently
After force login, now IP banning. LMArena was a kind a hero, until about... A week ago..
IP banning? elaborate
Yeah confirmed there are rate limits in Battle mode.
lads what happened to story telling lads, cant even send a pic of some game screenshot scenario on there without being flagged, certainly not REAL
the force login as far as it seems atm is just temporary, based on what i picked up from chat earlier there seems to have been a mssive flood of bots on the website and discord and this is to prevent that. of course i could be wrong
guys what is the best web search model currently?
o3
thx
No, it should have stayed the way it was instead of catching users off guard and causing anxiety... if you propose to offer something accessible without the need to log in and get your users used to it, only to remove this feature later, then you should have done that from the beginning.🤷♂️
admin the censorship thing is so geeked rn
I think the restrictions were necessary though
The site must have went down for a reason
at about the same time the website went down google had a major outage too. if they are linked who knows could just be a conicidence
tbh i think one of the main reasons they did this is because of bots that use lmarena, and also to lower the load so more generations from real people go through. I don't like the Google requirement in particular, but they are coming up with other options, hopefully email login.
I doubt they’re linked. Pineapple said the LMArena devs specifically are working to get the site back working. If it was a cloud flare issue then they would just have needed to wait for them to fix it
I appreciate the feedback and will be sure to share with the team
yeah could be. my idea was google had a major outage causing the initial issues last night, but it just ended up becoming much worse due to said outage and possibly made a new issue. idk im just throwing out a random theory with litterally 0 backing and im probably completly wrong
We don’t know. Regardless, if there were even bots on the site, there should have been a better way of preventing them.
didnt they have captcha going on for every generation?
You must refer to cloud flare
ig
Cloudflare captcha is dependent on other factors. It’s not a fixed rule that it should be every generation.
It can happen every generation. But it’s not fixed. It depends
i got that a lot before. guess i was unlucky
i realized that the most popular lmarena proxy solution is from china, i get constant errors when it's late at night, which also happens to be the time that chinese people are up. hmm...
also accounts really dont stop bots. people can and will make bot accounts after all. it really does seem the main reason accounts were added was for saving stuff and keeping your generations
@echo aurora What is the limit of generated images for people that have made the login?
Probably not linked. But if I had to guess, the bots would most likely be chinese.
That’s the mask. Underneath it was probably to stop bots.
We don't have this listed somewhere, I'm checking with the team 👍
makes sense to me. western ai companies are blocked in china, but they also have to find ways to access those models for benchmarking, comparisons, distillation, etc. so they may be using LMArena for those purposes. Google is also blocked in China, so they'll have to find a way around this.
let me know please
Good point. I also have some private reasons to believe they’re chinese, maybe using VPNs to mask themselves or whatever.
Oof…that’s a big no
Battle Mode isn’t supposed to be rate limited at all
That’s honestly ridiculous and should be reverted to how it was
DH3 is Seedream 4.0
It's official. LMArena is slowly becoming paid. The one service that was known to be non-biased and provide a good place to use AI for relatively free (with some restrictions), is now getting more and more restricted. Forcing you to make accounts ...
We all know what comes next. Subscription models. Brute-forcing users to login...
WHAT THE HELL
Where can I try it
Seedream 3 was already very cool
I wouldn’t say that unless there’s proof
Nobody knows the devs intention
Just agree for them to sell your data at the cost of using the service for free and call it a day 👍🏽
it's SOTA
GIVE EXAMPLES
How can we try this model?
https://artificialanalysis.ai/text-to-image/arena?tab=arena it's here, you cannot write your own prompts
it will come up randomly often
I will say that having Battle Mode be rate limited now is scummy and shouldn’t have been done
Guys anyone using piclumen
goodbye free gpt 5
Fr
hi
I understand LMArena . After all it needs to be monetized
they cant just burn their wallets endlessly
no, vc will give them more moni
@echo aurora All this stuff definitely goes against the mission statement of LMArena
They shouldn’t say it’s ‘free and open for everyone’
That’s false advertisement
I really don’t think the Battle Mode should be rate limited either because that doesn’t even make sense
Down again?
They’re just implementing nonsense against us now
All because the website got popular
That’s the only explanation
If you’re rate limiting us, we should get an explanation as to why and also what the rate limits are
Like…Gemini 2.5 Pro itself doesn’t really have rate limits to begin with
So how does that make sense?
There wasn’t an announcement about it or anything, just blindsided your users and expected them to be happy with the changes
Yeah you are right
The team is listening, I do appreciate the feedback.
The team shouldn’t have blindsided everyone like that
That is highly unprofessional
A proper announcement is what usually happens in situations where drastic changes are occurring
Yeah the site got popular, because it’s a great site
But the kinds of changes that are trying to be implemented will make a less greater site
The whole rate limiting Battle Mode really leaves a bad taste in my mouth
I know I’m complaining but if people don’t speak their minds and stand up against these things then people get walked all over
And it really does go against what the mission statement is of LMArena
Cry about it
Yh u should ask for refund
Let's be nice, it's fine for users to voice their concerns to us.
We know something about rate limit for logging user?
Cry about it? No, I’m raising valid complaints against a company that clearly wasn’t transparent about their intentions to their users
That’s called ‘holding accountability’
Or do you just let people run roughshod all over you?
I don’t typically get upset at free services coz I’m not entitled
Where i can use seedream 4.0
At the moment, we're not going to share the specific rate limits due to changes. I am passing along the feedback.
Has nothing to do with entitlement, has everything to do with their mission statement that is clearly on their website
No it doesn’t, it has to do with your interpretation of it
No
Yeah this is my same question but I think that is not actually global
Ok
😔
Yeah my same reaction
And seedream is better than nanobanana or comparable?
I am very curious to see how good is seedream v4
Idk, but results are good
I hope the use of artificial intelligence never gets monopolized. All this competition in the market is driving the creation of better and better tools.
I hope I get the sole ownership of the monopoly
Sorry I missed this, is the site down for you?
i agree
Not out yet
Up again
Please, may a billionaire use all their money to create an artificial intelligence that allows image creation and editing without censorship
grok sorta
I think I hit my limit using nano banana when using the website. I did not get a message at all though. When does the limit reset?
Generally, we're not going to provided details on if/when specific models are going to be added.
So lets keep it mysterious then 
But yeah like I said I won't be sharing details about upcoming models until we're ready to announce.
About 1 hour after the limit is hit.
Hello, excited to start this new journey exploring the world of AI.
Well, you arrived at a pretty bad time.
It keeps disconnecting.
😭
and here we go again
what happening?
The website s stability is worse than a DDoS, fr
Why do you guys restrict the models so much? Imposing logins? I've been using it since early lm-sys . The restrictions were not that bad. @echo aurora
I was seeing the same; however, it's now working. Are you seeing the same?
do we have any ideas about image generation limits? like how many generations am i allowed for gpt-image a day?
We've been seeing unexpected traffic and interest in the site, as a result raising the rate limits was necessary.
I am the only that can go on lmarena.ai without any problem?
Sorry to say that info isn't shared; however, it's understandable why that'd be helpful information so the team is aware of this request and are putting thought into it. Would note the rate limit resets ~ an hour after you hit that limit.
What happened to the Copilot leaderboard? it's 99 days old now
Also any plan to get some 🎶 AI Music generation leaderboard at somepoint? 🤔
Thanks
I don’t know how it was unexpected when you’re hosting all the major models
That was very well-known
😫
We try our best to get ahead of potential issues as much as possible. It's not always going to be flawless, but we learn from our mistakes and always pay attention to how we can improve.

I mean it says it all right there in the mission statement…
If the site no longer aligns with that, they should change what it says otherwise that’s misleading
How are models supposed to get adequately tested if they’re getting rate limited in Battle Mode?
That sucks. I get that there’s unexpected traffic, but requiring logins feels like a pretty harsh step for what was originally a free, open experience. It comes off less like rate-limiting and more like a step toward monetization, which a lot of users won’t like. There are lighter-weight solutions you could try before forcing logins: temporary IP-based rate limiting, request queuing, etc. Those approaches keep access open without tying usage to accounts. Part of what made lmarena great early on was the frictionless experience. Adding barriers like logins doesn’t just hurt usability, it sends the message that you’re willing to trade off the community’s trust. If you go down that path, don’t be surprised when long-time users start leaving. At that point, you’re not really offering the “open experience” you built your reputation on anymore.
Sadly altruism isn’t realistic
I agree
They probably have good intentions, but they’re going to end up being like all the other companies and charging premium fees for usage
And you’ll still get rate limited even when you’re being charged 🥴
I can see rate limiting direct chat on the image gen models, but on the LLMs?
That’s pretty much a slap in the face
Very well said.
The most frustrating thing is these generations that fail but don't give you a refund... Every AI gives you a refund for failed generations except this one... for only 3 generations this is ridiculous
Yeah, failed gens without refunds are rough. and with the new login wall already making things worse, it feels like the whole experience is slipping away from what it used to be.
Lol i have to agree whit you on this one. Its changing from what it used to be even tho it built its reputation on the old system.
hello
Maaan fk the new system its some bs
This is gonna be a sheetty ai router if it continues like this
@echo aurora Listen to user reviews you guys damn it
We aint sign up for this bs
hopefully they would listen to what most people been complaining and find a better way... (kinda beginning to look like a cope thought sometimes)
How can anyone be angry at pineapple with that pfp
This become paid and i zoom the fk outta here
"Lets offer our users a free experience"
"Proceeds to add a login wall thats gonna be followed by some shetty restrictions soon"
Fk you man
at least for the battle mode, the original purpose and how it was working should never be touched
Thank you for sharing this feedback, it is important to us to hear how we can do better. All of this is being shared with the team. We are taking this seriously.
Every AI gives you a refund for failed generations except this one
Yeah this is a really good point. If a generation fails having it count towards the rate limit doesn't make sense.
I actually know some that don't and its stupid.
yeh the money gotta go somewhere after all. but i think there are options. maybe agree to sell user data if the user wants to use it for free, or have some sponsors or do something. after all u brought the communyty on free models and unbiased ratings . if it steeps away youre going too loose the community base.
we are talking ive seen some people complain they have like 70% of their generations used up on errors and didnt get them back. now it was a text based bot not an image generator but still
hi guys can we create 9:16 aspect ratio video?
I dont think we have any control over the aspect ratio
Something that has always resonated with me from this blog post (would recommend a read if you haven't already) is this section:
At our core, LMArena will always provide:
- An open, accessible platform for the community to participate in evaluating and comparing models through real-world prompts.
- Transparent, science-driven leaderboard policies that ensure every model is tested fairly, consistently, and with community input.
- Features designed for the community, from a better UI/UX to more ways to engage, vote, and contribute to AI progress.
- Research to push the cutting edge of AI evaluation and reliability.
I can assure you this will always be the case.
that can all change
people make empty promises all the time
nobody knows one way or another
maybe lmarena will do better
i have faith
^
its to bad any one can give me any site for use free veo3 model or very cheap one wit selectable 9:16 aspect ratio?
veo 3? for free? GOOD LUCK! I know of a website with veo 3, cheapest around for 5 dollars. A GENERATION! veo 3 is much too expensive atm to be running for free anywhere outside of testing purposes like in this discord. I can send you a website you can generate videos in 9:16 ratio tho. it has 10+ minute generation times and 5 free priority generations a day tho. other wise unlimited to use
nice i dm you dude
I assume there's limits on using the higher models right?
Let’s look at the bold words here on the LMArena blog:
It should be very obvious why everyone is angry
These new rules or whatever they are entirely contradict all of that
Battle Mode should NOT be rate limited
It wasn’t before
The only thing getting rate limited was using direct chat
The old model was beyond preferable to whatever mess this new thing is
“Commercial sustainability should never come at the cost of community trust or great science”
Well, I can guarantee you, with these new rules the trust of the community is definitely plummeting
I would hate to see LMArena become another “Open”Ai
Sam Altman really can’t be trusted either
"accessible to everyone" well, not everyone wants or has a Google account, Google accounts may be banned in some regions as well. I hope they add other options so more people can make accounts, but I wonder if LMArena will be able to tell which person did a certain prompt or generation, which I imagine they can.
What's up with the people as small figurines on a desktop?
something about a trend
Ah cool. I enjoyed that action figure trend, I suppose this is similar. Another good one might be to create you and some friends as miniatures in a tabletop war game
i am excited to be here, hope to learn new things here.
Welcome! 🙂
Flushing away good will is a full time job of the modern tech industry
how do i get the image generator on the website to generate 16:9 images?
hello
Did u try SeeDream 4?
What’s open router ?
No lol
Its not as performant as current top models, let alone upcoming gem 3
hey, if I ask to generate a video of someone that looks like jar jar brinks from star wars, is that forbidden or the error happens sometimes usually?
What model would you say its ability is on par with?
Man, what's happening? What's with the rate limits? How many are these rate limits per chat?
🙁
The rate limits may be due to the fact that the models are being hosted from their respective APIs, and thus rate limits are likely to occur when using the associated model too many times.
Is it because of the traffic?
I don't think traffic would cause shortened rate limits.
The issue may be arising from excessive use of the associated model in such a short time.
2 mill context window? yeah that sounds like it could be gemini 3 dusk could be flash and sky could be pro?
nah, after testing it its just wayy too garbage and doesn't act like gemini does
So you think it'll be a smaller open-source model?
I'm praying for open source if it's not a frontier model
unless google was massivly cost cutting this wouldn't make much sense, its worse then gemma
It’s bard
Perplexity lmao
How do they even get billionaires to invest in them
There is not a single thing Perplexity does better than the frontier companies
LMArena not working on brave, keeps repeating
😭
Is sky or dusk better
??
sky is better, a little at least
ill go back to open router for a while
sky is pretty much uncensored
idk grok models like grok 4 a worlds ahead of sonoma sky
ohh i see
People when they have limited free access to a stuff they would usually pay for:
I aint the only one
lmarena lists image-to-video arena and the veo model, but i don't see it on the website, how can i generate videos from images on the website? anyone?
on the discord itself
Perplexity: “ai”
VCs: 
I am unable to choose the model
yo chat
Greetings, Vova.
heyy
Hello.
anyone here used gemini cli
I have.
do you like it
It's pretty neat.
That's fair.
have you used openhands
I have not.
oh its also cool
I see.
they give lots of free credits too
That's nice.
Can you generate a video from an image you already have?
yea
do /image-to-video prompt:text
You sure can.
i think
That's correct.
How?
Thanks
Do you paste the image in to the prompt?
Well, within #video-arena-1 , if you use the following command, then you can input an image from your camera roll into the prompt, as well as the optional prompt for what you want to happen in the video:
"/image-to-video (image) (prompt)".
3 months
its just a guess
actually let me ask gemini when it will come out
Can i choose the model for video generation?
so long
its the best free model rn still
December 10, 2025 it said
2.5 pro?
o
yes
2.5 pro is kinda old ngl
yes its been here since march
yea
half a year of 2.5!
how good are you expecting 3.0 be?
idk, if it aligns with the jump from 2.0 to 2.5
then it will be insane
but i doubt it will
dang
OR if it will be they will make it paid, or other stuff
i think hopefully they will make flash free
hopefully they keep stuff like it is now
yea
normal people using the normal gemini website will keep paying
and i will keep having unlimited on ai studio
but is it gonna be only ultra for like a few weeks or months 🙁
hope not
yea
like veo 3 its locked behind money 🙁
🙁
at least they made nano banana unlimited free
it is REALLY good
i think so
:0
wdym
for 3.0 do they just work on 2.5
nvm im dumb lol (:
bye bye
bye
and it said February 25, 2026 avaible in ai studio
i wonder how right it will be tbh
Hey, came across this Twitter thread summarizing OpenAI’s new paper on LLM hallucinations:
https://x.com/LuozhuZhang/status/1964209351960514778
They highlight that hallucinations aren’t inevitable if you change incentives (e.g. penalize confident errors more than abstentions, reward calibrated uncertainty).
Has anyone here read the paper? Seems like this approach could significantly reduce hallucination rates. Any dataset recommendations to test this kind of “abstain vs. wrong” behavior?
@sly isle
@echo aurora
What's the rate limit
And remember it will be not good setting a rate limit
is it possible to get desired image size output in Lmarena?
No, all of that is random.
I’m gonna ask a llm to summarise it for me
Looks interesting tho
Thank you. I found this Twitter post really well-written.
it resonated with me a lot and I feel I got a lot out of it.
Hello LMArena!
Greetings, Peter.
Can anyone tell me if LMArena gonna be paid or free as it is rn in near future? @robust yoke
Well, considering it's a testing benchmark for popular AI models, I don't really see a reason as to why it would be a paid service any time soon.
thanks sir, appreciate u replying 😊
It's my pleasure. I always try to reply with the most factual information I can.
one more doubt, are we gonna see video generation on the website itself in upcoming days?? or will it be limited to discord forever? and will we be ever able to choose the model? just like image generation on the website
For the time being, video generation is limited to the Discord server. However, people have requested for video generation to be part of the actual benchmark website itself. In the near future, we are likely to see a video generation feature where you can directly generate videos with popular video generation models like Veo 3, just like how you can generate images on the website with popular image generation models like Imagen.
thanks again 😀
My pleasure.
ohyh twitter summary is good
I’ve looked through his earlier comments, this man’s thoughts are quite interesting.
I can’t see them coz I don’t have twitter account
Funnily, OpenAI is actually the company not implementing their own research ideas from the paper…
I think this shows that anthropic is already aware of it and addressing it.
W Anthropic.
So who win?
Claude, I'd believe.
True.
Oh where is this comes from?
How to generate 16:9 images in nano banana
That sort of stuff gets randomly decided by nano-banana.
It's not directly adjustable.
Hi, im new here. Nice to meet all
Nice to meet you too, Bimmo.
I make a post that we lack valuable insight into how LLMs function because we don't have good benchmarks
Suggest how to improve benchmarks
Nobody cares
OpenAI drops a paper literally with the same conclusion
Everybody and their mama loses their crap
Greetings, Tansri.
It's an obvious thing, no idea what takes them so long to discover these. All the money and talent going into this and they didn't think of that simple idea to reduce hallucinations, I wonder how much other easy things they miss
Let me share my perspective on this. While it may seem straightforward in hindsight, research breakthroughs often appear obvious only after they've been discovered and validated. The challenge isn't just identifying potential solutions, but rigorously testing them, understanding their tradeoffs, and implementing them effectively at scale.
The idea of penalizing confident errors more heavily than abstentions isn't entirely new, it builds on established concepts in machine learning about uncertainty calibration. However, systematically applying this to reduce LLM hallucinations while maintaining model utility requires careful experimental design and validation.
I think it's worth considering that AI labs are often exploring many promising approaches simultaneously. What might seem like an "obvious" missed opportunity could be something they've investigated but found challenging to implement effectively, or that had unexpected downsides that weren't apparent at first glance.
That said, I do agree that sometimes simple yet powerful ideas can be overlooked, especially when teams are focused on more complex approaches. This is one reason why having diverse perspectives and open research discussions in the AI community is so valuable.
What are your thoughts on other "simple" approaches that might help improve LLM reliability?
Are you capable of rational thought rather than just pasting LLM output verbatim?
Just because it's long and detailed doesn't mean it's an LLM output.
And even if it was, who cares? We're in a server where AI is pretty much normalized, even for creating images and videos.
Therefore, even if my response were generated by an LLM, there wouldn't be any reason to get angry since we're already in a server where that pretty much happens 24/7.
It is devoid of any meaningful information or point
Just like how many statements tend to be.
Actually from OpenAI 💀:
https://openai.com/index/openai-anthropic-safety-evaluation/
I agree tough, many ideas proposed and implemented are fundamentally incredibly simple (not all of them, but quite a lot).
The hard part is usually scaling and really optimising and making sure they are the right ones though.
I still don't think it's that hard ngl
E.g. moe is the most straightforward thing ever, yet it took quite a bit for everyone to adapt it, because while the theory might be simple, implementing it is quite difficultly and requires some experimentation
This idea is like reducing hallucinations to half or smth. It should be very visible even with small models
The thing is that stuff you try is small models / experimental papers always ends up being more complicated in bigger ones. That is basically what I am referring to.
I'm aware, I still don't think it's that hard. I've scaled ideas like this before it is simply some adjustments made the core idea is the same
Just because you have a big model does not really change the dynamics of optimal data in most cases
Good data for small and large models remains largely similar
I would assume it has been published because they have realised themselves it is an obvious thing
So no point keeping it secret
What's up?
Hey guys
thats why i may have said "people"
google has been cooking with their releases, of course they will cook with gemini 3
Is there a way to pin texts I send ?? So I’ll see the replies later that day?
hello
This is a difficult balance to get right to be fair, especially with the way current evals are being done. Making it 'refuse' will also inadvertently make it refuse answering what it otherwise would have answered correct.
So it all comes down at what cost are you prepared to tackle hallucinations with, and whether sacrificing overall performance is ok
if you're going for max performance/score on benchmarks that don't penalize wrong answer, making the model hyper confidently yolo everything is probably optimal, for that.
I do still think "hallucinations" are a fundamental problem with llms though.
This was for o3, and they did tackle this problem with gpt5 specifically. Though probably not to the extreme levels
https://www.reddit.com/r/singularity/comments/1mm51m5/gpt5_admits_it_doesnt_know_an_answer/
Ik, but I have yet to actually see a review that tried to quantify this o3 vs 5.
They also talked about using a classifier that tries to detect hallucinations as a reward signal throughout the post training phase (of gpt 5).
And I also personally noticed the change. + much of their system card report was focused on reduced hallucinations.
hellooooooo
Helloo
o3 said it doesn't know the answer in 0% of responses
gpt5 did it in 5% of responses (100-(40+55))
small steps
I have a slight question about the rate limit thing? Is this temporary or permanent?
react this message if u want less restrictions
^
Hmm I do agree but I also get it tbh the increase of traffic is expected. I do get the rate limit thing and login thing. But I hoped it would be a temporary thing till a workaround that would be great.
Oooh I see do you know the break time after that?
no
hmm
there was one message that told me to wait 48min but I didn't see any message after that
see? even that isn’t clear. with llms you get “use again after 50 minutes”. with image you don’t receive anything. seems censored to me
I did get message of wait 48min but one time only
I guess waiting is good option for now
i guess. but so much restriction affects user experience.
They said they're 'taking feedback' from what we've been saying
It remains to be seen exactly what they do
It seems like
i think lmarena will have a paid tier by the end of the year if it continues like this
but of course one can't really complain about it
That does NOT align with their mission statement. At all.
i'm surprised it has been free for so long
Funny they didn't include GPT-4.5, no doubt the best model on SimpleQA
As per their statement: "LMArena will stay open and accessible to everyone. To do that sustainably, we’re focused on creating long-term value through services that benefit the entire AI ecosystem and serve the larger community."
"At our core, LMArena will always provide:
An open, accessible platform for the community to participate in evaluating and comparing models through real-world prompts."
Limiting people who aren't logged into Google isn't only ridiculous, but just plain not right
In third party papers, GPT 4.5 scored ~25% higher
they raised $100m from investors recently on a valuation of $600m. where does that come from? obviously one part of it is selling the datasets of prompts and responses, but clearly that's not enough anymore.
and yes, nowhere in that mission statement does it say it has to be free
I don't care if it's trained on our prompts
Hell, every ai thing is trained on the internet
Things are scraped from all over the place
Also don't understand how loging in will help the ai ecosystem more??
So they can track your usage? Idk...
It's not mandatory, is it?
I think that might be the case
It was just a requested feature
the site is for benchmarking not data collecting...And also it has been free and unlimited without any limit for 3 years what changed now exactly?
it's mandatory if you want to use the site for more than 10 minutes now lol
but yes, it was originally requested because of the history being wiped every once in a while
I've been using it for more than 10 mins today, seems fine
I'm not logged in
I feel like after the damn banana stealth model they got popular and so with that they saw how much money they could make
pure greed...Theres literally nothing else behind this updates but GREED
well i got this after 3 prompts today
We can calculate refusal rate for all of those by simply substracting. 4.5 has next to none refusal rate, but interestingly enough 4-turbo seems to be above 5.
Logged in users get 'higher rate limits'
Odd, how many convos have you had in the history? Maybe it's because I've already had lots of convos, so it doesn't flag it.
the VC money came long before nano-banana
7% refusal rate for 4-Turbo
no it happens both on new and old convos
Privacy issues side, it's probably because it has better verifiability
does the limit affects chats too or image gen
They said it affects Battle Mode as well!!!
And direct chat
Oh well
If they just not force the login and give the same limit as login in users as anonymous users i would not have any problem (I WOULD)
but they added the rate limit to the battle mode tooo which is the whole purpose of the damn site???? Im i wrong?
Exactly.
Enough people need to voice their concerns about it so it can get resolved
Their paper showed that you need a certain number of conversations in order for adversarial (I forgot the term) vote detection to work
Being compalcent won't solve anything
react here if u want less restrictions #general message
but yes, based on what i saw with basically every local newspaper in my area, the pathway to monetization is to first force everyone to use an account, and then put up the paywall after a while
jesus they making me hate capitalism
I don't think it's monetization, it's hard to detect vote manipulation for accounts with only a few votes
everything ...Every fricking thing in this world have to be ruined by money, i mean how much money is too much money
you guys react here to show your opposition to the new update #general message
They had a paper on it last year
This'll get drowned out in the sea of text
Also o1 seems to do very well for it at 14%. That's the main reason it doesn't hallucinate more overall than gpt5 despite lower accuracy
I personally haven't encountered it yet. Were you using battle or direct mode?
battle mode
Interesting, maybe it's more of a rate limit than an absolute limit
The whole point of Battle Mode was to be free and unlimited to compare models. They really s h i t the bed with this one.
it is the path to monetization...Also now that they have your complete data, be sure that they will sell it to ai companies for bonus benefits on top of the payment they get for putting a model in their site.. Or even worse
react here to vote to have less restrictions #general message
Stop
They said non-Google logins are planned... the data is supposed to be open source btw
Like the data from May or smth is on Hugging Face
I'll be completely honest. They're a big company. Don't trust big companies. How do I know? I work for one myself, sadly...
it’s clear people don’t want this new restriction system
https://www.prnewswire.com/news-releases/lmarena-secures-100m-in-seed-funding-to-bring-scientific-rigor-to-ai-reliability-302462025.html for those that don't know btw
/PRNewswire/ -- LMArena, the open community platform for evaluating the best AI models, has secured $100 million in seed funding led by a16z and UC Investments...
sorry to break it to you, but i swear to god that they will not change anything, they do not care, and they don't read these messages... im just talking so i can cope but at the end nothing changes
this was in may
But this isn't social media. Liking your random thing won't make them change anything.
Interesting, what website is this?
brings awareness
yea they do care their user base was built on top of the old system. if they change it they know nobody will use their service anymore
It almost beggars on farming likes
I could care less about likes or upvotes or w/e
no it’s not like that . just express your unlikings
Oh trust me I have
A lot of us have
At the end of the day what'll it get us
They're a company and they're going to do what's in their best interest
Whether that aligns with the mission statement or not
Companies lie. People lie.
That's just the way of the world
the data they will have access to when they have you google account is a different very different and superior thing than just the data they get from the prompts you are putting on the sites.
it’s not their interest to lose a majority of their user base. they know if they turn it into an ai router then many people will leave
what’s the difference between using lm arena and open router then
I have never used open router or heard of it
it is an ai router
once the investor money has been deposited, giving them a return on the investment is more important than any mission statement
On this third party benchmark, GPT-5-Minimal scores worse than GPT-4o: https://github.com/vectara/hallucination-leaderboard
GPT-4.5 beats both
you can’t give them a return on investment if the user base will leave
Yes, lying blindly to people. Thank you for proving my point.
You really wanna know how to get results?
Stop using their site entirely
many people will do that if it becomes paid
What permissions did they ask for? It depends what scopes are granted. If only the OPENID, then all the site gets is a token.
If a huge number of people stopped using it, they wouldn't be getting their data for new models
they’ll stop when it’s paid
There was that guy a few days ago that brought up payment in their TOS
Let me remind everyone
It needs email and name scopes to access your name or email
I haven't logged in so idk what permissions it asks for
This is what it says in section 8: “FEES AND PURCHASE TERMS. Company currently offers the Service free of charge. However, we retain the right to charge for the Service, or any features or components thereof.”
their user base now is much larger than it was in the days of the legacy site, now they are so large that the thousand or so people who are in this discord, are just a friction of the actual people who uses the site which is now in the millions.
yup
However, we retain the right to charge for the Service, or any features or components thereof.
i hope im wrong but i don't think im
holds the right to make it paid
maybe stealth models will still be free or with higher rate limits, since in that case the provider will pay for the comparison report. just like openrouter
we can’t know unless the site traffic is measured . take a look at the discord server members, we can assume that’s about 5-10% of all users
They're gonna have a dumpster fire on their hands soon
I mean they closed the whole chat entirely when the site wasn't working
I thought that was kinda sus to begin with
I think the allowance nowadays is more generous
3 in battle mode? That's generous?
You didn't use to be able to talk to Claude directly without hitting quotas in like 10 msgs
the nail in the coffin is the fact that they say “the service is free and we’ll continue to have it be free” but they have a rule that says “we can make it non free at any moment”. should be self explanatory
i agree which in ratio to the profit they would make by keeping this system. is not a bad sacrafice of the 10%
Or Opus, which is very expensive
I've had more than 3 battles today, so idk
Like I said: Companies can’t be trusted
This is the whole reason why I am so glad that I am able to gen locally
F censorship, guardrails, corporate handholding, etc..
Cheers, I linked a third party one above too
It’s pretty sad that the only person that we get feedback from is @echo aurora
There hasn’t been any transparency about any of this
how long does t take to gen a vid caause i have been waiting a hr now
For ppl who want anonymity, are you clearing your browser history? Otherwise it's not really anonymous, and prompts are logged...
?? if you clear the history the chats are still on their database
We keep getting told by @echo aurora that they can’t talk about the rate limit or how much it is
That isn’t transparent at all
If you implement new rules, people should be kept afloat instead of having to play the guessing game
Yeah, and your userID (it whatever is used to track) would be same across all conversations if you don't clear your cookies iirc
Your prompts are already in the system lol Once you type them they’re logged forever
What permissions do they request from Google?
Doesn’t matter if you delete your history
Yeah exactly
Data collection?
Honestly I don’t care. I just get ads targeted at me.
I mean, does it ask for name, email, etc? Google will tell you the first time you sign up
I don't think oauth does that
I meant about the data collection
I use tor, clearing history isn't anonymous enough for me
Your prompts won’t ever vanish, though
Yeah, that's the best way for anonymity
how long does t take to gen a vid caause i have been waiting a hr now
Lol. This is a classic “technical requirement meets strategic opportunity”. The need of creating accounts to stop the bots is a mask to a restricted system that’ll be followed by paywalls. Not that hard to see what’s going on
Although if rate limits are an issue, I don't see why can't just clear session and establish a new Tor circuit
It should reset the limits, although I'm pretty sure that's against the policy
You can’t use a VPN because it bans you immediately
i don't care for my prompts because i don't use arena for actual work, i just use it to see what model is best for certain prompts.
Also im ok if the prompts are actually used for training, i want ai models to get better
Until you turn it off
hello! Any talk in Spanish?
I use a VPN, captcha is more annoying but I don't get banned
how long does t take to gen a vid caause i have been waiting a hr now?
We’re only supposed to speak English here
People will find ways to circumvent the rate limits but it won’t work
I mean 3 in Battle Mode is downright insulting
LMArena: “We have made the difficult decision to go Pay For Play.”
you can just use zen for that, but im really lazy to keep creating new containers every time i want to test a prompt
In Brave it's just one button
They say it’ll stay free, but the rule that they can flip to paid anytime and their silence on it shouldn’t be ignored. Even if we’re not a massive group, staying vocal shows them that losing core users isn’t worth the risk.
free user loyalty doesn't really matter tbh
Honestly they're offering really expensive models for free, so it's fine imo
Like it wasn't possible to talk directly to such expensive models last year
@trail creek @verbal nimbus I’d delete those last chats if I were you. You’re essentially talking about overriding the site limits 😉
They will not make it paid, but i think they are taking the yupp rout which is the smartet way to keep something "free"
user loyalty would have been a concern if they were actually paying for it
if they make it paid then just use open router
Hypothetically speaking of course... I want all my chats to be stored 🤣
How is open router different?
they really don't read this i swear
Pineapple does
They’ve deleted a lot of things
and if too many free users leave the site, the rate limits will have to go up so that the paid datasets remain useful
So do other mods
so that's kind of self stabilizing i'd say
Sure, the models are expensive, but the value of LMArena isn’t just “free compute”, it’s the community that built the platform. If they cut the free user base down to scraps, they lose the very people whose input and trust make the site credible in the first place.
i mean if arena turns to be paid, i think open router is the better option, more polished ui and free offers rom time to time.
Again: Putting trust in a company is a fool’s game
There’s no such thing as an altruistic company
They all have ulterior motives
Exactly. and that’s why the only leverage we have is to make it clear that if they break the free, open experience, the community walks.
What, all three of us?
Yeah that’ll really teach ‘em!
yes.
You'd need to have a massive amount leave
Doesn’t matter if it’s three or three hundred if the core users leave, the platform loses the very people who give it value. We’re only 3 people here. You don’t know the lots that didn’t join the discord server
Most of them don't speak English and have no idea what we're even saying
If they did we wouldn't be getting spammed with /image
Maybe, but the admins definitely see it when the vocal part of the community speaks up , that’s why they already reacted to just a handful of complaints.
These complaints about the rate limits and stuff definitely go above @echo aurora and the other mods here
If it was a corporate decision, it's gonna be an uphill battle
I know , but pineapple told us he would redirect our opinions to the devs. I’m not saying it goes past an internal chat. But if pineapple really is a man of his word then the devs also know it shouldn’t be a good idea to impose such restrictions
Unless they got that sweet, sweet whiff of money
Money makes the world go 'round
We don’t know.
we've complained to give back the legacy site and they didn't...I cant think of an instance when they actually listened to anything, they just give a shallow excuses that they got from gpt and move on.
mmm..I'm willing to bet differently.
Yes you have been complaining but you think they’d actually listen? You need to take action and stop using the service
Many people will stop when it becomes paid
Hasn't happened yet, boyo
How can I try seedream v4?
i didn't use it for the pat two days. But i don't think 10 people not using it is gonna be of any cost to them
Actually I made a mistake about earlier. I can think of one company that's altruistic and has continuously given people free things for years now: Hello Games
Again. 10 users here. off discord are thousands
They have continuously given free updates to No Man's Sky while other companies would charge
and there's some people here that don't even know what we are talking about and login to any site they enter without any care for security.
So a gem to say.
Listen, even if free users wanted to use the service anymore, they couldn’t
when it does become paid
But they're most likely not even native English speakers so they don't know what's going on
Yes but they won’t be able to use the service anymore when the time comes
Looks like rate limits are only for image gen as per #announcements ? That kinda explains why I haven't encountered it.
those people don't know what boycott is
That is true
They're having compute issues
Boycott will not be a thing. It will be “i can’t use the service”, not “ i don’t want to”
They're not a poor company
They have very generous benefactors and donors
All the major ai companies are represented on LMArena
They're all probably funneling money to them as well to continue training
they don't actually make models, they just get API from companies.
so how are they having compute problems?
It could be bandwidth or storage, we don't really know. They're probably saving the outputs.
Compared to text, images and videos are probably massive
I believe that stuff gets sent to the respective companies
They log the data too I think, otherwise votes can't really be verified in the future
I doubt they are saving everything though. When you generate it it is kept on the originating model's server for awhile lol
All I know is that we aren't getting any transparency about what's happening or why
We know they're saving text outputs, I don't see why they wouldn't save images