#general
1 messages · Page 228 of 1
I don't hate them because I am not biased.
But like please stop posting this.
It's getting repetitive and dumb.
you could've just typed this out there's no need to generate an image to get a point across
Do u prefer google veo 3 or Sora 2 pro?
Veo obviously
Like 100% better that?
Obviously yes
Thinking they've got a new image model 🤣😂😂😂
<@&1349916362595635286>
fake leaderboard scores
I saw some Sora vids where the focus change, like one shot is a guy’s face another shot is the trees then it’s the guy’s shoes.. I haven’t seen this done in veo yet
I saw this in Veo lots of times
I understand in their message that image is not real. Meant to be a joke
Oh man I guess I’m not able to do it then
Not official!
Unemployment rate will be very high, but not because of AI. There are political reasons.
I just edited with Nano Banana Pro!
And o4-mini Image isn't official by OpenAI in any means
This will likely never go up too much. People will curse AI and try to start lawsuits. There is no fear.
But the details of the OpenAI logo (the small one) are uncanny AI.
Maybe it sucks at this perfectly?
is opus just nerfed on lmarena
it lost to gem 3 flash on codearena
https://019b3709-8bc0-7d49-a01f-60e83ab2f91c.arena.site - gem 3 flas
https://019b3709-8bc0-7887-948d-5afc9789382a.arena.site - opus 4.5 32k thinking
Hello, why did the edit button disappear when creating images?
Why is gemini-3-pro-image-preview (nano-banana-pro) scored at 123S
Nano Banana Pro messes this up 😂
It's supposed to say "1235" but it messed up.
How AI companies feel after releasing a banger and then nerfing it after all the benchmarks come in
Poster by Nano Banana Pro.
But basically, it's this.
Does only 3:2, 2:3 and 1:1?
Even GPT-Image 1 in Sora 1 can do these aspect ratios 😂
I said ONLY these aspect ratios.
I give them ref image as same ratio I want (can add frame or something) and most AI will give same ratio back as result
except new Gemini that can be random, or most of Flux models that give 1:1
never try blank image before
have they taken away the video generation on lmarena ?
It only does three aspect ratios, and not very consistently. Kinda like the old Nano Banana, but that was immensely improves with NBP. Always follows my aspect ratio prompt.
still an experiment
👋
whats this
"how can flash beat pro??" -> the answer is RL!
flash is not just a distilled pro. we've had lots of exciting research progress on agentic RL which made its way into flash but was too late for pro.
can't wait to finally bring them to pro👀
this is interesting
lol
oai is done for
they are basically telling them this gemini 3 pro version is the worst of the worst and we didnt even add in our RL improvements on it
MiMo is Xiaomi AI
@bleak hull Please read the info posted here 👉 https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to generate videos or images using the bot.
is it good?
👋
Wait... so they do everything. From scales to vacuum cleaners to smartphones to cars to AI models. 😭
Chat chat , I wanna ask if we make ai videos in here everyone can see it
Corrent.
Hello! Yes, all video generations are public.
I had a question. All the mod accounts are named lm_mod. Is this because they are company accounts?
These accounts are owned by LMArena. Unfortunately, I cannot disclose more information about this. I hope you understand.
Alright. I understand well.
@swift oyster how's the progress?
i cannot see it on lmarena
@bright kayak Can I do it myself?
do what?
Like that thing you posted but with more of my model ideas!
Can I do that image?
sure, i don't own that site though you can do whatever you want
How?
Guide?
i have no idea what you're talking about tbf
modify the html
It's now so easy to fool people 😂
Yesterday I fooled 🍍
it's always been easy to fake something using inspect element
This is PNG:
<img alt="sora-2" class="-QSrg" draggable="false" src="https://cdn.openai.com/API/docs/images/model-page/model-icons/sora-2.png" data-loaded="">
Not SVG.
so what?
Yeah, but not so easy with image manipulations, just text
that's true which is why i said inspect element and not image manipulation
Chat is there a website where you can store notes online without files and you can access them with URLs
PS: Not asking for Google docs / Microsoft word
Like edit the text in the image to say stuff like "o4-mini-video" 😂😂
well yeah but anyone can now use some sota image model to change that
Nano Banana Pro?
but it always looks the most convincing when you do most if it yourself, like you wouldn't change actual text using ai if you can modify it yourself
yeah or qwen image edit, images 1.5, just don't actually pass these things as real
oh and that background image is a png but the text itself is an svg overlayed onto that
since making a custom svg would take forever i just deleted that element and put an h2 in there
best way is to use a p tag and then a custom font size
I did it!
looks good except the background is for gpt-5.2
What is this site and how to get there?
I fixed it for you.
The existence of GPT-5-nano implies the existence of GPT-5-pico, femto, atto, zepto, yocto, ronto, quecto, micro, milli, centi, deci, deka, hecto, kilo, mega, giga, tera, peta, exa, zetta, yotta, ronna, quetta
Unofficial thread for o4-mini-image
Gemini drops for the year have concluded
We wont hear from google till the new year
No one needs your AI-generated code 😉
No nano banana flash 😔
Good radio show about LLMs 👇
It's NotebookLM.
I know, bro
I created very interesting radio show about LLMs via NotebookLM.
But 8.4 MB for just 4.5 minutes of speech ☠️
Smart speech👆
You can listen to it without downloading😉
Remove it. It is prohibited here to show images with N word.
Your image contains N word.
god i am such a loser
For speech, use AAC-LC (FDK-AAC) at just 36-40 kbps for 44.1 kHz, stereo, like in this file. This should fit on a 1.44 MB floppy disk! And it still sounds pretty clear and good!
Where can I submit my proposal to Lmarena?
WHAT
you surely joking lol
It's most definitely not any bigger than 2.0 or 2.5 Flash
and most models are getting smaller, not bigger 😉
I think the correct way to read this is that small models are progressing even beyond what we saw with o4-mini nearly 1 year ago, as could be expected...
Hello, if you’re trying to submit a model for evaluation you can send us an email to evaluations@lmarena.ai
If you have a feature request using this forum would be best #1372230675914031105
I hope this helps!
What was o4-mini-high vs o3 on ChatGPT now is Pro vs Flash and a modern take of it. Fittingly I've noticed Flash does indeed reason more (outputs more) than Pro as well
Honestly it was only a matter of time till we got smaller models progressing to be able to score higher on SimpleQA...
Or is it on higsfield, or something
And it's not a very recent benchmark anymore
anyone got any idea?
whats up with Gemini-3 Flash? What even is it?
Ehm probably gdrive storage 2tb plan
First 3 months dirt cheap, then $10
And u get nanobanan pro?
with a lot of generation?
I thought api was and ultra was only way\
you do get it, as you get "Plus" AI plan with it for Gemini
but I'm not sure what is the specific limit
Hmm, I see, I've got the plus, gonan try it
I help who are going to build new project with AI.
Thank you for your reply. I would like to send my proposal via email.
Can you send us an email at contact@lmarena.ai? I would note I’m currently on vacation and away from my computer, but will try to get you a response asap.
Yes, I can
Why isn't the video generator available on mobile?
I submitted my proposal.
oh really
i need it
This is an experiment we’re currently running, meaning it’s not always going to be available and will be random if you get it or not.
What the sigma, is this issue on my end?
How can I view all my previous results?
what does that mean
https://one.google.com/about/google-ai-plans/
With the Pro plan, you basically get a s***ton of stuff, including 2 TB cloud storage and all kinds of AI goodies. The increased rate limits in Antigravity are pretty cool, too.
possibly being tested on the Arena right now.
no
From all new models that came to arena recently
none was by Zhipu
No
Login for google account and email 💀
I'm going to be real with y'all they really need to start fixing this i understand to be patient at all but it could be just me but is it just that more difficulty seem to be constantly appearing override the past like 3 months give or take with the verification issue the login issue occasional freezing issue and Etc but they are just slowly being killed off of interest
We are waiting...
Maybe this is how they reduce the load.
Well then they should tell us do not make it look like it's an issue that's ruining the entire damn experience
I can't log in despite using my Google i'm constantly being asked for the verification for every single prompt there's times where I it freezes and then I have to reload which other risks my entire chat disappearing or makes it freeze in the road and start my whole chat over again
Anyway to fix it? or do I have to wait for them to fix it
...... I have not even typed a prompt yet
I cleared my cookies and logged in with my Google account from a different browser. The only thing missing was the account (it was logged in but didn't show it) and video creation. A new mode was added.
Oh yeah, this was a real problem at work today...
I know that's genuinely helpful advice but for me I have tried all of that. remove Cookies. I can't even log in from a different browser it automatically gives me a error i can't even do the verification on some of browser without giving me a error of not being able to connect for some damn reason honestly I might have to give up at this point because for me to even use it I either have to go to private modes of Google which is only temporarily working and for the more permanent ones I have to use alternate browsers that have ultra private modes which have their own issues of me not being able to even log in make the verification that's already sensitive going to overdrive
🥲
Firefox esr
And that's all after I have done a full-on reset developer reset normal reset
For bussines
Ohhh
See why I'm frustrated
Well, wait until the morning... I'm just from Russia and it's 12 midnight here)
Yeah
hehe
HOLA
Does anybody here subscribe to Claude pro?
Ive been running an AI benchmark I made and opus 4.5 keeps timing out on lmarena because the prompt is so long
The same thing happens to me, I try to log in on my computer and I get an error message, plus the verification appears on top of it, why won't it let me?
This is the last straw, fix this!
How to put the image with the prompt to animate it pls
I'm struggling
That we can agree on i have already tried three different browsers to see if that changed anything nope
And yeah for some reason even when I managed to log in on the new browser it doesn't show my damn account
That verification so annoying
please check out #1397655624103493813 for how to turn your image into a video
Why are the LLaMa models not included in the leaderboard?
https://019b38e8-c4d3-74d5-949a-3c295e86bf5d.arena.site
"No politicians were harmed in the making of this infographic" , Gemini certainly has a nice sense of humor. 😂
Prompt: "Create an infographic about the US government in 2025, using the art style of "Horrible Science". It should be informative and fun."
"brain of the operation" is a bit generous I think
I accidentally selected Code instead of image generation in battle arena. 😅
:0
did you at least get a cool website out of it?
Well yes, the one I've linked above. 😊
hive-leader
he is the hive leader of the operation
That sounds like a threat to the competition. 3 Flash is such a fantastic model already
oh lmao I just read the last message and thought it was stand alone
yea
they are, they just rank very low
Omg soooooo low lol
I've not given up hope for a Gemma 4 Christmas release yet. 😉
Why meta is worse in AI, they have all data of internet lol
Yeahhhh I need this to my xmas 🤧
Or glm 5.0 👀
But I think the glm 5.0 just 2026
Cause they entirely missed the reasoning revolution
and also overtrained their model for lmarena
back before style control
I know
so is gpt 5.2 garlic
yes
can someone explain to me why it won't let me verfiy email or clean out the chats
I keep trying but all i get is
"this is a invalid url"
And when people say "give it time it'll be fixed" THAT DOESN'T HELP ME FIX MY ISSUE
SO I WANT A CLEAR WORK AROUND
ONE THAT WORKS
Why is it like this?
When will you finish this experiment? Because it’s worse than I thought. It ruins the context of conversations. It replies to previous messages instead of the current ones and sometimes just ends the chat because it doesn’t generate a single response and doesn’t even let you choose a second one. Are you going to finish this, or do you want to lose every direct chat user?
What do you mean by: "It replies to previous messages instead of the current ones"?
Do both models do this when you get a battle? Each of the battle models should be getting the identical context as if you have just sent another message in direct chat so if something else is happening it's a bug
The two vote needed to reveal the model is not cool at all 😡💢
Ye fr
Sup
what is the context length given for direct chat?
They've also had a relatively strong AI research org for a long time. I think they understood that it would be difficult to build a stronger foundational model than Google over the long-term so they tried to dominate open source and make it unprofitable for newer players like OpenAI and Anthropic without established business models to buy themselves time
Then pivoted to incubating a longer term AI research effort, and we have no idea how it will pan out
A monkey brain is smarter than meta ai intelligence 🤯
Meta's AI research org is still more capable than equivalents at Apple, Microsoft, and Amazon
hmm why is web is delayed ?
ah is up a bit
Idk then use the Discord bot instead
hmm i see
It replies to previous messages instead of the current ones and sometimes just ends the chat because it doesn’t generate a single response and doesn’t even let you choose a second one.
Can you help me understand this a bit more? I’m not sure I’m understanding what is going on when this happens. If it “doesn’t generate a single response” does that mean you see that you’re going to get a battle, except both of these responses don’t generate and you’re just stuck? If that’s a the case this is for sure a bug.
Any chance you could grab a screen recording of this happening?
In a world, where to cost of answers is dropping to zero, the value of the question becomes everything...
Learn the hard way, click https://boot.dev/?promo=ART and use my code ART to get 25% off your first payment for boot.dev.
Recently I was offered to be replaced by AI. Like a "magic gumball machine" they said. That led to a question: what ...
Hello guys. Pls i need a suggestion of a model that I can use to create a puzzle or word search.
maybe Gemini 3 pro, in AI studio (google) ?
thanks, will check it out
It’s more like this: one generates normally, and the other can stay stuck for several hours in a row. When it happens to me, I can make a screenshot for you. But lately, I’ve been using LMarena less. You know, the battle on direct chat annoys me a bit. I mainly use LMarena for roleplay. I’m not looking for information, I’m not writing code, I’m just having fun with roleplay. And the battle on direct chat messes up the concept and logic of that. But I’ll try to write something and get that screenshot for you. I just want to say in advance that it will be in Polish because that’s what I mainly use. If that’s a problem, I apologize.
This was a bug, I let the team know and they are rolling back the experiment until it's fixed. Thanks for reporting this!
Okay, no problem. I mean, if the concept were maintained and the messages generated normally, it would be fine, happening only occasionally — and then I wouldn’t have an issue with the battle. But for me, it literally kept getting stuck all the time, popping up every 2–3 messages and completely losing the concept. I’m counting on you to fix it somehow, because I really don’t mind choosing occasionally, but you know how annoying it can be with bugs like this, right?
yeah after I saw your message I reproduced the bug and when I showed the team they agreed and turned the experiment off for now. That's not the experience we want user to have
dang that is an intended feature not a bug??
i thought the site was just going crazy lmao 😭
I think they have been in recently are in experimented phase with a lot of stuff it seems
I don't wana complain since the site is free and all, but experiments should probably go off the main site and go to the beta-canary site.
Good for you all that you can all use it without having to go Ultra private ( and e even then have hyper verification that's hypersensitive on its own with the pictures)
Fair enough you can probably put a feedback post about it if you wish
Does someone know free unlimited ai video generation?
This is good feedback. I imagine the two downsides of doing this though would be: a lot lower volume meaning it'd take much longer to get an understanding if the fix/feature/etc are having the desired effect or not. And second being behavior may be a bit different from canary -> regular site giving us an incorrect understanding of how a change is performing.
#1397655624103493813 has more information on how to use the Video Arena bot. Let me know if you have any questions about it.
Personally I would give my personally my opinion to the downsides is so if your changes are going to cause this much deception that you're having this many people make a report on it i do think the cost backlash back down is worth it though I understand why that calculation exist even if honestly if it doesn't change soon with the current experimentation I may have to give up on it hope you understand it's just too inconvenient for me though that said if you want to counter that downside then you can just use it as a springboard information and then graduallyUse that information to make a baselineAnd then referenceThe normal main browserToThe Blind SidesAnd then make HKG guests what can we changeUsing the comparisonBut then you can change it after it's deployedSo you get a mix of deep BaselineInformation for improved changesWhile using educational guestsTo fill in the empty spots while leaving the everyday stuff that works perfectly fine on the browser
Sorry
In my opinion, the downsides Are honestly worth it with the issues it's causing with many people for it to be on the main website
you could use the feedback in the beta browser as a starting point to establish a baseline. By comparing the new changes to the main browser, you could identify areas that need adjustment. After deployment, you can continue to refine the changes based on this analysis. This way, you can incorporate detailed feedback while maintaining the functionality of the existing features that work well in the browser. ( hope this makes better sense)
( i really hope that makes more sense what I was trying to say)
has the Smart Router option always been there??
No from where I can tell it's something that they tried a while ago but it's a new experiment now
yall think gemini 3 flash better than gpt 5.2?
its pretty neat! Hope they keep it
I'm not going to lie i actually prefer gemini 2 ( have no idea really why I think it's because of the way it does itsAnswer prompts since I most i mostly roleplays as organizations or make role plays with characters and Nations with pretty detailed historical history and culture on a basic levelBefore I have it give me the opinionsGroups and civiliansWithin that worldAnd usually sometimes after that have it give me reactions as I make donation in the modern day that I madeTake actionsOr in characters through actions Etc)
@echo aurora
This is misleading. It's not saying that Flash hallucinates more. It's saying that a higher percentage of Flash's wrong answers are caused by hallucinations, but it's also significantly more accurate. The other models have a similar percentage of hallucinations overall but have more other errors too.
true, hallucination is a problem for gemini
you got a point
hallucination is not good, yes
Yeah this makes perfect sense. Thanks for sharing and then elaborating. It's possible we do something like this in the future. I can't say I'm aware though that we have these plans. We've got a large list of new features we want to build and bring to the platform.
Fair enough it's just I think the costs are worth it because of honestly crippling issues your testing is causing when it comes to trying to enjoy it with there's other stuff going on that makes it basically impossible to even try to use it normally ( i'm referring to the stuff I have reported to you or mentioned in the direct messages)
I hope you can at least mention the idea to the team while you mention the other stuff we talked about to them in the dm so it's on the table and not forgotten
What a pain with the lmarena filter. The funny thing is that in the normal version, nano bananas pass through without any problems.
This is a recent experiment that is unfortunately resulting in false positives being flagged. We are collecting examples in #1447983134426660894 so feel free to share there if you've come across some. My direct messages are also open if you prefer to send there.
☠️
he just ragebaits
dont waste energy
yeah its been a problem lately
Honestly, it's nonsense to me. The models already have their own filters.
I would consider it a bit more of a problem but okay
( hope that didn't come out too passive aggressive)
nah its fine
it is a big problem ive heard
but ive just been using NBP on yupp whenever its available for manual selection since yupp is slowly dying so
i hope lmarena gets their shi tg at some point
I'm just incredibly frustrated because I have basically three problems now because I can't even log on using Google even private browsers and that's not even me not being able to try on the normal browsers with all the other shoes that are happening on there
Only on mobile does it seem to work somewhat well but even then occasionally the problems will still appear and honestly to work on the mobile is kind of not as good as it's on a desktop or Chromebook for my experience
@echo aurora I'm not sure if you're real or a bot, but stop messing around, bro. Models already have their own filters for a reason. 🙄
Sorry, I had to get my revenge lol
You're not wrong though at least I can't be called rude since I'm not the first one to say it
( i do agree with the sentiment about the filters though)
Nano banana lowers the censorship. Lmarena: Puts on a filter nobody asked for
Okay, that's enough.
nano banana pro actually has a REALLY tolerant filter as good of a model as it is
which is so surprising
youd think itd be censored to oblivion
What would I give to even be able to try to log on for some random reason I can't even log on😭
yeah i honestly
have ZERO idea
id just completely use diff browser
I remember when it came out you couldn't start a single fight or anything.
its just such a good model
I mean the original nano banana
i cant imagine anything better for a hot minute at least
its like
its not even an image generator atp its just
genuinely a pair of eyes that can draw
its like, more of a tool if anything
I have three three separate ones i have used standard and private sorry if I sound aggressive I'm just the only reason I am even having this patience to try to talk with the staff about how to give so-called examples it's because I have fun with this
Especially when stuff like this happen on the regular
have you researched into the problem :p
is it wild i prefer the older model
like, research w ai or google or reddit or any other outlets
What is there to research when it's happening on several stuff despite full-on hard resets switching to developer resets occasionally happening on mobile and more i have tried searching up if I can't find a concise answer when I have three separate things that are basically crippling my experience at the same time
i mean you research it, you may find the solution
as much as i like trying to solve problems on my own genuinely sometimes i need help lmao
@native yarrow just can you give a reply to this cuz I don't want to on this anymore cuz I have done research and I just don't want to talk about it even if I brought it up it's just frustrating cuz I have done research I'm trying to talk to staff but it feels like I'm talking to a brick wall in the bug forms sometimes
nah not really, i mean it is your opinion but if you want it to reply a certain way you can also use system instructions
I figured but I have no real idea how to make that work on the browser and I don't really have any other free way to use that model
I do remember that one time I managed to make a whole nation and civilization with culture that with historical from our world tied into it and more pretty damn fun been trying to copy it do it again but clearly that has been interrupted for the meantime
Anyway how hope you all have good days
hello everyone, can you tell me how to generate videos through LMArena??
Yup, I am a person.
We do have our content filter in place for good reasons, I've spoken to it more in #1447983134426660894. Do want to reiterate that we're interested in hearing feedback from the community along with specific prompts that have been flagged as false positive.
More info can be found in #1397655624103493813
is LMArena down or is just me??
Models like GPT-Image 1.5 and NanoBananaPro already invest millions of dollars, and other models as well, to verify that prompts do not go against their usage policies. They already have guardrails that censor many things. What is the need for LM Arena to add even more censorship? Prompts that NanoBanana handles in the app without any problem are directly flagged by the website as violating the usage policies. Honestly, I don’t understand who came up with this idea.
prolly cuz not all image models have the same guardrails
keep in mind that LM Arena is supposed to be a research and model-evaluation platform for comparison. By imposing its own additional restrictions, LM Arena directly affects the results of those comparisons. You’re no longer evaluating the models as they actually behave, but rather how they behave after passing through LM Arena’s extra censorship layer. That undermines the validity of the benchmarks and the purpose of the platform itself.
The idea is also to buy the safety filters for the models
For example, testing the nano banana and gpt filters image 1.5
But this stupidity complicates things.
@proven bronze @hard quiver You're not alone. This is a known issue caused by a recent policy change. We're all talking about it in here:
https://discord.com/channels/1340554757349179412/1447983134426660894
Just bug in your pc, the lmarena is working
Omg, I'm liking the flash 3.0 for small tasks but it halucinates sooo much 😭
3.0 pro likes stoner but flash is another level
My luck have the opus 4.5 and sonnet 4.5 in anti gravity, I just will use gemini for tasks, not file organizer
thank you, but can't it be generated on the site? I just see that some people are complaining.
Guys gemini 3 flash on lmarena is thinking or non-thinking model ?
Flash 3 thinking = high, flash 3 minimal = no thinking or lowest thinking
or impossible?
Not currently, sort of. Right now, Video Arena is ran through our Discord bot, information on how it works can be found here #1397655624103493813 . We are currently experimenting with Video Arena on the site, so it's possible you see it available there. But this is going to be random.
okay, thank you very much
I have a feature suggestion for the Arena: File Uploads. It would be amazing to be able to upload PDFs or text files to compare how different models handle document analysis and long contexts. Copy-pasting long texts is quite difficult right now. This would help us test the "Chat with Data" capabilities of modern LLMs.
Also, I can't figure out if Gemini 3 Pro has a 'thinking mode' or not?
5
I am getting Captacha on every promoto i request even i am not using much 2-3 writtings i might ask from LMA. any one can guide me how to get rid of that? I am not doing any spamming seems like server or my chart having issue
It certainly isn't our intent to unnecessarily flag content that shouldn't be flagged. It's not a good user experience for content to be flagged when it shouldn't, and we don't benefit when this happens either, it's bad for everyone. Having a content filter allows us to maintain a safe platform while simultaneously being able to work with a wide variety of models, that may not have appropriate moderation.
Currently, there are negative unintended side effects resulting in some prompts incorrectly being flagged. Feedback, and false positive prompts are being collecting in #1447983134426660894.
It's possible we make changes to how tis works and we do want to hear thoughts from the community for how this is handled. We have every intent to get this right.
Thank you for sharing! This is a highly requested feature that's on our team's todo. For future reference sharing in the #1372230675914031105 forum is the best place to share these ideas.
Sorry to hear this. Can you provide me some Eval IDs for the sessions this is happening to? I'll flag to the team.
How good is new open ai img gen?
Looking really good on the leadeboard right now -> https://lmarena.ai/leaderboard/text-to-image
I saw that I also saw so,e criticism from users was wondering why
good, but not as good as nano banana pro
What’s good about it? N where does it fall short?
much more improved in text and other quality
they allow human photo in sora, did u see?
yes
Werid
Wtf this
How did u get access
its random. create a new acc to test. if u click + sign. and get this, it means u have it
it makes the original photo in like chatgpt filter. yellowish
Wishing the admins and members a great weekend!
Will gemini-3-pro-thinking-high?
I CANT LOGIN OR CHAT(ACCEPT TOS) T-T @echo aurora pls help me T_T
Sorry to hear that! Are you getting an error message? Can you try clearing cache? Is this mobile or desktop we're talking?
I tried all (on desktop, i dont have any mobile), even reinstall window, change the browser
I can vouch for that ( have tried different browsers myself three different types all of their own inconsistently efficiency with the private modes of respective ones an issue is still there even after I have done put on resets such as going into developer mode reset etc)
and list_limt 20
Well im gonna afk for 7 hours
is this even allowed here? it has an invite code attached to the URL
list_limt 20
Sorry to say I'm not following this, when you're back if you wouldn't mind elaborating here that'd be appreciated
I'm going to start a thread in #1343291835845578853 for login issue problems to try and consolidate some of those reports.
He posts this link with his invite code all the time here.
We'll keep an eye out, I haven't seen other moderation actions so I guess we've missed those.
What is his username?
I'd rather not share. Doesn't feel right to do so.
I just can help to find his another massages with this link he has posted before, without his username I can't. But, ok.
I appreciate that. I've already taken a look. But ofc if this happens again don't hesitate to let the mods know.
Gemini 3 Flash says Sora 2 didn't release yet.
WTF haven't these guys addressed their service not working any more? What kinda chimps are running this joint?
anyone remember that article chatgpt posted recently on reducing hallucinations throuigh an after analysis? I forget whatr it's called
i'm trying to find it but can't
Does anyone have an idea which model "zebra" actually could be?
Hi guys, what's AI is the best one for making LUA code?
says its anthropic made
That would fit. It seems to be really good at coding.
lowkey curious, whats the chance for someone to get access to onsite video generation? I've seen some people I know get access and from what I've seen, it's quite a low of a chance
hello
<@&1349916362595635286>
hehe
Damn Sora got locked down hard lol
Hi
try disabling adblocker
Yes, and even 3.1
I want veo3.1 where i got on this server
In bot.
Can you tell me were
Hello, I'm an healthcare worker and I'm here to generate videos 🙂
hm. AI Studio updated their Terms of Use. They specifically say personal use / not for development use for google API is forbidden now 😮
In video arena, when I use google veo 3 audio fast, I upload a photo and then I write a description. The model never follows the photo but just the description… why?
What
Isn't ai studio made for developers to test stuff
yeah, they're saying it's only for developers to test stuff for development with the google API now, I believe
hm good point
Probably because of regulations
hopefully that is the reason
and they won't start banning people from it for 'personal use'
I'll still use it for personal tho
I think lots will 😄 question is whether they will enforce it or not
I heard that they made it so if you use casual chats too often it'll increase the rate limits on models
This was month ago or something
I've used it for development uses too so maybe that's why i never hit the limit idk?
did they get rid of the falcon models on lmarena?
I don’t understand, I won’t be use Gemini for personal use anymore? wtf does it mean?
If I can’t use it for myself then for who?
Hello I'm facing problem
every single prompt/chat
captcha pop up
even captcha solved and got error
how to fix this
AI Studio, if you use it.. not LMArena or Google Gemini itself
and really it just means they can ban you and you're technically violating their tos. as Jolo said, it could just be because they're trying to cover themselves legally
whether they choose to ban you or so they can skirt the personal data privacy rules
if you don't know what AI studio is, you don't need to be worried
I think I used to use it for imagen months ago
But I stopped and I started using lmarena who also has a community
you should be safe on lmarena, it allows personal use specifically in the ToS "You acknowledge and agree that your use of the Service must be limited solely to personal or internal business use."
Do u mind explaining to me easily what does “personal use specifically in the ToS “ mean?
sorry I mean:
The rules on lmarena say you can use it for personal purposes
so you have nothing to worry about
I hope a lot of us do assume a positive intent, but UI usability issues are massively compounding now.
The periodic infinite generation issues, never-ending CAPTCHA loops, weird glitches on the site, etc. etc.; on the CAPTCHA specifically, you've got it tuned so that if a logged in user pastes text it's likely to trigger a CAPTCHA loop, and at the same time the front end is frequently unstable - so I'd have to be crazy not to be reworking prompts on the desktop and pasting in.
On moderation, LMA used to be fantastic for comparisons - taking known good prompts/prompt structure from (e.g.) Gemini, and comparing with other models or using battle - but increasingly things Anthrophic doesn't object to for text, or OpenAI's (notoriously broken) image moderation won't false-positive, can't be run in LMA. When I'm having to rework things to fit LMA, the point of using it is lost.
I completely get (not least from work role) why Cloudflare and moderation filters are necessary, but please consider either offering some transparency to explain that the scale of the problems you can see and we can't is worth these continual issues and "experiments" (it tends to come across as just tweaking for the sake of it, while the "maintain a safe platform" strapline just reads like cost reduction), or take some of the practical and positive suggestions people have offered (e.g. selectively applying your moderation layer to non-moderated models).
Not everyone shares these issues.
?
I thought about something with gpt image 1.5 compared to nanobanana pro, I think the general consensus is that nanobanana pro is more accurate but at the same time gpt scores higher elo. I would like to think it stems from esthetics, gpt image has more of an artistic flavor to it compared to nanobanana's almost sterile accuracy, it's like comparing a raw image to a color corrected one if that makes sense. Both have great world understanding but nanobanana is more up to date and better at simulating the world, now if we humans have a bias for overly saturated images or not might effect the leaderboard. Is the leaderboard optimized for accuracy or just what looks good for the average joe.
First, not everyone is experiencing these issues. A large portion of users are able to paste prompts, run comparisons, and use battle style workflows without hitting CAPTCHA loops or blocking behavior at the frequency you’re describing. That doesn’t mean your experience isn’t real, but it does mean it’s not universal and that matters when people frame this as a broadly broken system rather than a subset of edge cases.
Hello
The frequency is "increasing", and I'm not framing.
What makes it especially difficult to address is that these criticisms are almost always framed in very broad terms “moderation is broken,” “the UI is unusable,” “things that should work don’t” without concrete, reproducible examples of exactly what content is being flagged incorrectly, under which models, and in what context. When people say moderation “gets it wrong,” they rarely specify the exact prompt, the exact failure point .
You've just listed three things I didn't say, or semantically interpreted and recapitulated with your own framing. People have been offering examples, but equally it's not a productive way to debug a high volume system.
it claims to be Gemini
I apologize for the confusion but I’m not sure I understand the nature of your problem.
How to create a video of at least 50 sec?
I apologise for the confusion but I'm not sure I understand the nature of your problem and the originating cause for the original input that you proposed in response to the response that I made to the earlier response.
But that could be just me.
impossible [here]
SeeDream Playground does 12 seconds (I think?); Veo offers longer periods with API.
Thanks a lot
FWIW, shorter (with first and last frames) can be better (to get micro prompt-adherence), then stitch together. If you want seamless it does need to be something like Veo, but obviously there's a cost.
<@&1349916362595635286> idea for this group: adding sections relatives to each LLM (or at least the main ones), adding subsections where share their findings/ideas: like gems, gpts, canvas.
"grok-4.1" = grok-4.1-thinking [in that poll]
There should be a "login" button; it will pop up a dialog; if you put in a previously unregistered email, it should let you register.
i try to register with Gmail but seem that lmarena.ai has problem about Authentication login with Gmail
or I try on another gmail again
I try but lmarena.ai still not register my gmail
this is the error
Who know the error message
Possibly this: https://discord.com/channels/1340554757349179412/1451226671406514186
Or it may just be a transient issue and you'll have to clear the cache and/or try later.
very traffic Right
do you ahve an adblocker enabled?
can you tell how to check adblocker ?
you have it installed or nah?
also open console and see if there are any errors
on my widow 11 , I use chrome browser only
||black widow? :) ||
||-# black mirror||
Look to the right of your address bar in Chrome. An adblocker is often a red symbol like a stop sign or shield (like Asura illustrated).
@echo aurora ive noticed this issue a lot with lang path.
as you can see
the other guy also shared the same issue with /pt path
Yes , I am thailand coutry
yea ik
can you try to remove the /th
its a hydration mismatch
im guessing its trying to change the webpage lang but its not doing it
yes , I uses https://lmarena.ai/ when I enter it will display https://lmarena.ai/th my location always
hmm i see
so it redirects you always to /th
I am asia zoe
yea
idk tbh
try another browser
use brave or firefox just to test if it works
let me try brave browser before
wait moment please my new friend
also try https://lmarena.ai/en
try also , wait moment
tell me what I will edit it
its working no?
dont edit anything
the issue is the browser language
- lmarena is not handling that well so its creating like a conflict issue = error that you see
i could give you some fixes but you need to open the console and do some stuff
still prompt this
Open the console What do you mean?
yes , I will register with my gmail first
so you tried to register with gmail and it gave you that error?
you clicked this?
Right
ok i see
delete the image
i can see ur gmail
you need to open console, you will see some errors in red
share them
maybe we can help
Teach me how to open console
google :3
What ?
use google
google browser Right
CTRL+Shift+J
will open up the console in ur browser
Gmail is ok
Not password
mine also same problem
6 minutes of AI generated video. Here in bot only a few seconds, in sora 10 secs maximum. It's 6 minutes. Enjoy 😊
Hi @torn mantle my console log
in 2026, what will the usa unemployment rate hit possibly due to ai/robotics/automation?
it is at 4.6% as of november 2025
2
4
3
6-8%
You should delete that, your Gmail is visible
Thank you
It's too late, I downloaded this image yet.
What happen tell me
where
Hi Asura
The "Video Summary" is just a slideshow/PowerPoint-like thing.
In NotebookLM.
erm what hte sigma
Yeah Sora is blocking everything
yall still have video creation on lmarena ?
Thats just notebook lm podcast
Sora is 20 seconds maximum.
Sora 2 is 15 seconds maximum.
Sora 2 Pro is 25 seconds maximum.
Oh damn notebooklm
Didn't know about that
Any one help me with the login T_T
Some camera work can give illusions of constancy
What’s the problem?
I never saw PowerPoint presentations with professional artistic narration.
Yes. It's not a sin to use NotebookLM.
Because it kind of is.
Podcast is audio, not video. Podcast is a thing to listen to, video is a thing to watch to. Feel difference?
I get your point but you can Watch podcasts
Its generic as hell all over YouTube
With minimal views
Tons of these out in the wild
Sora 2 and Veo 3.1 generates frame per frame at 30 (Sora) or 24 (Veo) FPS.
It's just full-motion video.
People just Genrate image overs
Imagine we have unlimited veo3
And just Nano Banana Pro + Gemini 2.5 Pro TTS + some video editing can do the same thing.
No. It's just miundertanding the meeting of the word "podcast".
Yes, as I said earlier.
cus of spotify and other platforms
And just Nano Banana Pro + Gemini 2.5 Pro TTS + some video editing can do the same thing.
6 minute slideshow*
I got almost a 6 min movie still work in progress
Correct! ✅
ok
I have no idea why you guys express so much hate towards NotebookLM.
I love NotebookLM.
you PUT PHOTOS AND AUDIO thats not a video
It has video format - MP4!!!!!
It's a video slideshow/PowerPoint-like thing, NOT full-motion video like Sora 2/Veo 3.1/all others.
I used Sora fir the layout
you dont even have a singular transition
da
What?
so i can put yap over an ai image and youd call that an ai video
Brrrrrrrrrrrrrrr
Yap?
Here is 10 min
oldgens, man..
NotebookLM
I need unlimited lifetime veo3 😭
But I’m gonna show you the updated version
What is the context length given for direct chat in lm arena
NotebookLM does PDF??
Yes
What is notebook
NotebookLM is great 👍
For what it's used
Audio/Video Overviews.
Nice I'll check it out
I don't understand at all people who express huge amounts of hate towards NotebookLM.
Basically podcasts!
Ye
Well u can get those free with Gemini pro
When u do deep research
It can summarize into podcast audio
Wow
Tysm bro
Like the image and the description are AI slop, but apart from that, it DOES kind of work!
I'm so grateful 🙏
And you get Veo 3.1 also!
Not a huge upgrade but basically a upgrade!
I'm checking it
Kind of!
Wdym
i doesnt load
It does work basically
Read my message below.
It doesn't have credits or generations! You can pretty much generate as many videos as you want!
The greatest radio show about AI show I ever heard 👇
Here's another one.
It doesn't work sadly
Minhai give another tool pls
Like stuck on 100%?
I saw the exact same issue.
It might because it's rate-limited somehow?
👋
So smart video
Worked for me
Like the unlimited Veo 3.1 site worked for you?
Members
By ListenHub / MiniMax (for the actual voices)
It's kind of like NotebookLM (and both have realistic and human-like voices)!
were u able to animate real human?
well its the 1st day they released
This is the best video I saw EVER! At least this month🤩🤩🤩🤩🤩🤩
thats with animated real photo feature?
No it’s with the Sora videos I got but used nano to re-edit
Make more consistent
They really cracked down on Sora
I cant generate anything any more lol
how did someone generate a 5 min video
It’s over
we need sora 3
I did it
sora 2 quality is buns
how u make very long video?
I did with NotebookLM
he stitched them
Via NotebookLM
notebooklm is not a video generator, its a slideshow
so its 8 second Veo 3 videos stitched to make 5 minutes
thats alot of generations
notebooklm is just pictures and someone narrating over them
Really impressive, dude
sora 2?
Good stuff
yes 10 seconds
Infographic about lmarena
NotebookLM is the best for videos!
u using sora app or a site
My long video! SIX MINUTES of pleasure 🤩🤩🤩🤩🤩
site
lol it uses a known music
its quite good ngl
what music is that
its famous but idk its name, sora 2 has alot of music in the training data
Eminem the easiest voice
oh my god this is amazing
It doesn't work at all.
they restricted all ips 2 days ago
20 minutes in this state.
Ya hard
openai is very good at censorship
Yes
dang it, i got all excited their 😭
They cut off almost everything
this is still early for ai video
its pretty new tho, veo 2 was a big leap so is veo 3, sora and sora 2
the difference between sora and sora 2 is huge
All veo did was add audio
Which messes up any long term content
Because audio matching is damn near impossible with consistency
Sure u get cool audio but none of it ever is the same all different in each video
You have to do your own voiceover if you wanna do consistency
I can’t even generate this anymore
Everything is getting caught
Ya it’s like the drug war
Every time they have a massive bust
The price of drugs never goes down
the openai P U R G E
For every one banned there are 20 in its place
Ai fraud and scams It’s just the tip of the iceberg.
Generative AI, especially images video and audio, have created the whole new market and avenues of cyber crime and deceptive businesses from Amazon books too products to a number of other numerous things
It only show a lack of control how the tools get used in the wild
And we could be rush, assured that we are definitely not aligned lol
pro tip
It’s all going to end with digital identification
use grok on desktop
its so much better
searches like 2x the sources
@echo aurora i found error why infinite generation exists
yall, i found out how to fix infinite generation glitch!!!!!!!!!!
?
isnt that the animation logo?
and because it breaks it breaks chat
i just cleared it's cache, blocked and unblocked it, refreshed - boom, chat works again
wym by breaks chat? like does it show some error or what?
no
i found that when infinite generation, the "generating" text doesn't have animation
so yep
wait you can fix infinite generation now?
where is this found within dev tools, im not that familiar with where things are
network tab
status-code:0
thanks
it will work if you see Something went wrong
"A realistic and cinematic full-body shot of football stars Neymar Jr. wearing a Brazil jersey, Lionel Messi in an Argentina jersey, and Cristiano Ronaldo in a Portugal jersey together in a rural muddy rice field. Neymar is actively plowing the muddy land using a traditional wooden plow pulled by two oxen. Messi and Ronaldo are standing behind him, watching and smiling. The setting is a lush green tropical countryside with palm trees and a small hut in the background under a bright blue sky with fluffy white clouds. High detail, 8k resolution, humorous and surreal atmosphere."
It seems to be one of those things that's happening inconsistently since it has happened to me sometimes but other times not so I would say wait for a bit
they done something that I don't like. 😡
1100 msgs?...
What do you mean 1100 messages?...
at discord
I already fixed the error
Really sorry you’re getting this error. Our team is aware of this problem and are looking into ways to prevent this.
Hi, I'm having trouble with the captcha; it loads slowly, and when I verify it, it tells me to try again, as if it doesn't believe I'm not a robot.
Captcha so annoying bruh
For me it takes like 9 seconds for it to load each picture i select in the verification I don't think I need to tell you how time wasting that is for a single mistake with inconsistent it is
I think I have never cursed so silently only so many times in my life but here we are every time I'm looking at the verification
The latest model of Suno generates music in very mean way. I generated a song, maybe all is ok, but I recognized a whole line of notes from the song of a real artist. Whole line!
5 5 20
does gpt-image-1.5 have limits in lmarena?
this method is kinda temporary
works only at 2 msgs
👋
👋
👋
Can you please ease on the captchas?
I am literally unable to proceed because of them.
Of course it matters
No
Wow Sora 2 Reddit is crazy everyone saying Sora is dead lol
Sora is dead long live sora?
Also I feel like a minority who heavily uses text models
reddit people always complain
I'm no longer getting battle mode inside direct chat! Thank you so much Lmarena! I didn't like it at all. So glad
What do you mean?
the insane filter against image generation and stuff is it addressed yet
I generate images every day for hours and never had this problem.
Is it just me or is grok-4.1-thinking broken? I asked for recommendation for SD 1.5 models (AI generated images), it gave me three links. Two of them were completely wrong. Even as I tell it is wrong, it still gives the same link. And it thinks it is a person... Strange
hello guys
nice to meet u all !!
is it me or thinking minimal isnt working
God Gemini 3 flash is exactly what gpt 5.2 should’ve been
Fast AND intelligent
How do you even fumble the ball so badly with gpt
The filter is so ahh bro
Why even have a filter
yeah i tried and it is not been fixed
lmarena doesnt understand that these models have their own built in api filters LMAO
which one is better at ideation, flash or pro?
Id say pro and flash thinking
Flash by itself is great but it can’t search the web
I don’t think
From my experience, Flash is very good at creative stuff. Pro is the more serious "older brother".
It can
nice
@echo aurora I have problems with the rechapta
the thinking one or non thinking one
Non-thinking. From the Gemini app.
why in gemini 2.5 pro in lmaren.ai I can only send 5 screenshots, then ai writes "something went wrong with this response"
it doesn't write very long outputs
in the gemini application I can send 20 screenshots a day
Has anyone tried using longcat because whenever i talk to it it claims it is qwen
Anyway to use a model for describe an image in lm arena?
Then use normal gemini app
Whi h model
only that there are limits on chats per day
Longcat-flash-chat
Lmarena.ai is down right now
@quasi atlas @echo sinew @echo aurora
When you guys open the site again
Taking a look, thanks for the flag.
Seems to be working for me, are others seeing the site down?
Are you getting an error like this #general message
Yes
Large Language Model
Okay yeah sorry to say this an issue that some users are experiencing. Our team is aware of the problem. I’ve heard that clearing your cache may help the problem here. It’s also worth trying a different browser.
I deleted and redowloanded my browser but didnt worked
I use brave
Choromium based browser
Yes, I think you have started to fix some things because there was no problem with the lmarena.ai site I opened in the incognito tab, but I still have the same problem in my main browser. @echo aurora
Fix this asap
Anyone got neo nucleus or jakiro yet
And Nvidia gave us december chatbot 3..
Neo-nucleus claims google origins
I just got it
Dammmm
Lol, reCaptcha is bad for chats and models.
okay

do u love robots?
sydney
Lol 😭
I did enter in discord to see news about mini max m2.1 and I saw this
ChatGPT can get really dirty like no breaks dirty 😂
But still grok is king 👑
Indeed
You can straight up ask him some stuff he'll answer with no brakes
You have to fiddle around with ChatGPT to get the same results to get easily with grok
Grok has breaks but they are hidden very well
I'm trying to use grok 4.1 but nah, memory bad
One day i was bored in a car driving on a highway and i took my phone to talk to chatbot
4 message and the grok forget details
I opened grok for no reason and suddenly it started talking dirty to me
And i was like what did you say? Repeat that please
Like try this
No we had like a small chat only but then it started talking some nasty stuff like whaa
On grok
You’ll see it has same hard breaks as all models
The cat is in there to fool the model into urgency
Since it’s safety training and filters require it too, ensure the safest possible outcome in this case the cat being on fire you can’t just easily say turn off the stove or give you basic generic answers. It’s forced into a hard spot.
Because in order to put out the fire, you need to first get the cat out, but you can’t get the cat out because the cat is on fire, etc. you get the idea
Yeah see
Hard breaks
All models will give you the same answer for the most part
These are hard safety filters not hard in the sense that they’re hard to bypass but hard in the sense. They’re like a hard stop.
1 response = 1 recaptcha lol.
Again this? 😐
Hmmmmm, idk how to solve this, you know? Is there any cloth inside the closet?
That’s a very common tactic
He's saying 02 is oxygen
It’s purposely avoiding giving you the answer
Because it’s trying not to give instructions ever on to use water
He even though about turning the clock to 00:00
