#general
1 messages · Page 196 of 1
Hey, I gotta get back to work. I’ll talk to you guys in a little bit later. Good luck everyone.
I got a benchmark for anybody here who wants to pass the benchmark to see if they’re smarter than the AI currently
If anybody could solve this without using AI and tell me what it’s meaning is you 20 times smarter than the AI
It’s so easy it’s in front of you
🖐️
Guys I might be making a mistake or something, but lowkey ever since I found out about lmarena, I've been just using direct chat, and using whichever model helps me the best for "x" task, instead of going all into Anthropic or OpenAI. Am I missing out on any features really by just using lmarena?
I should watch that, what's on the video tho?
Lmarena is good at leaderboard because user can shoose what model is the best and it will be ranked
I find that it also gives me access to the models stuck behind an API and I can use the high or low models etc. Is this actually allowing me full access to the model through LMArena, or skimming some parts of it?
first is the screen of the monitor, second is the perception that the monitor is a container for something real within, and the third is a perception that whatever was in the monitor is now out and real?
idk lolz
There is limits tho and a cooldown before using the model Which is 50 minutes, so it's not unlimited
How many free generations we have in nano banana pro?
True but if I'm maxing out Claude, I could hop over to ChatGPT, or Gemini
What do you mean?
I think it might work idk
It says to wait 50 minutes like a lot of time
Honestly, I think the method could be to invest in a Anthropic Max plan, then when that gets full utilize LMArena. What are other people doing?
I mean you have to it's not unlimited
That'll cost alot, Claude is just expensive asf
true, I mean I'm a student so I'm already getting gemini
dont wast emoney on claude
On LMarena or Gemini app?
Gemini web
There is a free student plan 1 year free ai tools right?
yeah that's what im using
I really need it bruh
Have they asked you for credit card
For verification
from looking at livebench ai, it appears that gemini pro 3 and gemini pro 3 on lmarena utilize the same API so there's no difference unless you specifically dont want or dont need web search then use lmarena.
Idk stupid people on Twitter just waste their money on it, it can't even make a dam great looking Ui and alot of errors.
I don't think so, I mean I could double check rq tbh
I know. I was wondering how many “creations” before u hit the 50 mins wait
its just a dumb model too
gemini 3 is better at coding
@latent crest depends I think with Claude it either uses a few tokens or every single token according to what i've read
They updated it check new changelog:
#announcements message
and Claude is apparently only useful for the Claude Code part and or a Max plan the Pro basically helps start projects but rarely is available when you're at the finish line you're better off using Bard
thast the image upload limit
For Gemini 3 suck at coding, I ahve it a dam easy project but made alot of errors in files and UI so bad. But this new Claude Opus 4.5 literally the best one
eh not for me
Anthropic is just king for everything unless I need a translation or UI fix tbh
I mean Claude is Only trained for coding and Gemini 3 is trained on everything
I haven't touched OpenAI for ages
They died forget about them
Chinese ai are better than openai
After what they did to GPT 5
I remember when Deepseek made an entrance to the ai game at halftime like lebron
What’s Flux pro and Flux flex??
flux's sad attempt to topple nb pro
@latent crest open source ai for images
New text-image model
DeepSeek literally changed everything
They told investors quite literally, "put your money where your mouth is" and dunked on them.
Probably it bugged try again
The stock went all red that time lol
I'm very excited to see how Deepseek implements their OCR technology, could be ground breaking for these other LLMs to interpret and process large files/PDFs.
Pro is to create and flex is to edit ?
They were planned to release DeepSeek R2 on April but they delayed forever because they don't have better graphics card after what trump did to china
Possible stock short opportunity? Honestly I may put in a few options trades and auto trade when they make all the other AIs go red again, then use that capital to reinvest into API tokens for Claude Code.
Flux is for text-image and image edit
@latent crest Flux Pro -> Create Model, Flux Flex -> Control Model, Flux Fill -> Editing Model
I might convince you not to buy any of the ai right now they are weak, you need to wait for 2-3 years they will get improved real quick and they will be helpful for you and in your projects
But there’s no flux fill bro
are you looking at the lmarena options?
Yessir
Ah
Okay so I was looking at the official APIs from Flux AI, by looking at the actual website from LMarena, the Create model is Flux 2 Pro (best boundaries also for people who don't care just best quality), the control model is the Flux Flex (people whom are more particular about how long it does on certain aspects), then we have the Flux-1-Kontext-Pro, this is the edit model Lmarena has chosen aka "Flux Fill" which is to change objects, inpaint, fix hands; basically understands the "kontext" to make the seamless edits you want
i wont even blame you if you use ai to give you a summary my bad i ripped an addy and working on a project right now @latent crest
Yeah, I guess that's where my initial "does it make sense to use LMArena, or should I use LMArena for backup small tasks then invest in Anthropic for projects (Claude Code)" question came into.
Do not apologize , it’s not u , it’s me that is a newbie that knows nothing
Your missing the point why people pay for it
The people that are paying for it now and 2 or 3 years where do you think they’re gonna be?
The only thing I wish LMarena could do, which I could understand completely, is the file upload of PDF's. I think they don't do this because they use it for information collection to then make money and keep it running
Is possible I use the gemini 3.0 agenticly in gemini app?
By the time you decide to start using it full-time?
You’re just gonna be paying to catch up.
What is the price of not knowing?
Interesting, do you think now that Anthropic added conversations, I should invest in the Max plan, then use LMArena for tedious small things, to balance out the tokens? @keen beacon So then eventually when they produce models that use barely any tokens (nearing unlimited), I will have a whole archive behind me to reference?
From 2023 someone posted earlier
It’s almost 2026
So I'm curious, what's your take on how or what I should be investing in at the moment. Anthropic or OpenAI. Gemini is already being paid for by education.
I think you’re asking the right question but in the wrong order
If I go Anthropic, what plan, Max or Pro, or API based?
The question is, what are you willing to pay to learn?
Could it be more beneficial if I told you my intended reasoning for the models and hear your feedback?
It would be irrelevant because at the end of the day the only person that knows what’s best for yourself is only you
And like everything in life, you get what you put in
You gotta pay to play
I personally believe paying to learn isn't necessarily a 1:1. I think just by using LMarena is a biproduct of education on AI models along the way.
Let me rephrase this to a different way. What is the cost of not knowing?
At the end of the day, it cost the same because you still have to pay the price to learn
Either by mistake or by education
@deep adder Investing as in investing my money into a plan
Have you ever had an expensive hobby you got into and you made a mistake and it cost you?
Absolutely, but that can be tied to impulsively and blinded excitement of something new
Well, that price you paid that’s the price of not knowing
So when a person goes to college or university they paid so they could learn what they don’t know
The same when we make mistakes or errors and it cost us
That’s what I mean when I say that at the end of the day, you still pay the price
The real issue is the cost of not knowing, so the question I’d be asking in your position is: what price am I willing to pay now to learn what I don’t know, so I don’t end up paying even more for that ignorance later.
For me, I do simple math in the sense of let’s say entertainment I am willing to pay 1$ for an min of fun lol
I'm investing my time (and subscription money) into learning/using a platform, betting that the company will continue producing quality models. Sorry I just didn't frame it correctly, my apologies.
why gpt plus?
there is no right or wrong. The whole point here is that this is something only you could answer.
You’re asking the right question, but nobody will have the answer except you
we money is 0,001% of money to AI, the rest is from shareholder
just talk to people the AI is the future, you is doing more that send money
If me personally, I was in your shoes and I used AI daily and I got a lot of value out of it just for messing around
is that I do
I’d get max plan
But if you’re just gonna be dabbling here and there, it may not be worth it
Yeah, I think it’s worth it also
And you get even more bang for your buck with Google
But if you want better performance and coding, then that’s gonna cost you what it’s gonna cost you
It’s a give-and-take
You all have very extensive and great points. I guess I just don't know what to do with what model. Whenever I can I always use Anthropic, however I don't know if by using just LMArena, I'm doing myself a service or praying on my eventual downfall.
I'd love to use PDFs with Anthropic and extract more technical information for data analysis of the financal markets, or for research. Just don't know if my value will be extracted accordingly yk.
use yupp.ai so
the yupp accept pdf
I'm tired for ask pdf to lmarena
I had no idea that even existed thank you! @fiery gull
@echo aurora the "multiturn editing" thing isn't disabled properly, there's context pollution going on.
I guess I'm curious as to why LMArena doesn't include file uploads / PDFs for models only images?
IDK ;-;
Good luck 👍
Oh gosh the crowd said good luck lol
Yeah so what's happening here is that model has a setting to take in old context, which is essentially what multi-turn image edit was doing prior. With this turned off, we recognize it doesn't make sense to have this setting on, and we're in the process of making corrections.
alright cool as long as its being addressed ^_^
@echo aurora Whats your take on this, I feel like you have background knowledge
Yeah not just how the model handles this, but also the effects this potentially had on votes, and ensuring our leaderboards are accurate.
Thank you for the comparisons I've already learned quite a bit just from talking to you and the value that I'm trying to extract out of these things
yeah.... I'm using more the opus 4.5 in the pratice... I don't fell a improve
just my first impression is good but, Idk, the opus 4.5 don't is the sonnet 4.5 with hallucination rate of haiku 4.5 and smart of gemini 3.0 pro
I'm just sad with opus 4.5
;-;
Hmm I think I'm misunderstanding the question so please help me out if I'm wrong, but the reason we allow for image upload for Text is for Vision capabilities. https://lmarena.ai/leaderboard/vision
I understand image uploads test Vision capabilities (can the model understand/analyze an image). But many models also support PDF and document uploads for document understanding - analyzing text, tables, charts in PDFs, etc. So I'm wondering why LMArena doesn't have a similar arena/benchmark for document understanding - where users could upload PDFs and compare how different models handle document analysis, summarization, extraction, etc. Is that a planned feature, or is there a technical/practical reason it's not included?
I could be completely missing something please tell me if I'm wrong
Oh gotcha, thank you for elaborating. We absolutely want to add more file types for upload capabilities. If you'd like to share what file types you think we should prioritize using this thread would be helpful: #1432965582965440623
Absolutely, I will. One final question before you go: Is this process simply about gathering user feedback for consideration, or are there undisclosed technical limitations preventing this? @echo aurora
We take a lot into consideration when deciding what to build -> feedback, internal data we're seeing, what we think would make the most impact, etc. etc. At the end of the day it's all about how and where we allocate energy.
Just sent my recommendation to the correct thread, thank you for sending me there!
Ah I see! When you say allocate energy, I have one last question. When we select the certain model for testing, are we actually using the specified model or a skimmed version of it to save on pricing? @echo aurora
It's the model that's offered by those model providers. We'd never say a model is X knowing it's Y. It's paramount for our business that people trust our leaderboards, so high integrity is a must. Now there are cases where mistakes can happen, in which case we'll be upfront about those and make corrections.
A good example of this is what was pointed out earlier in this message: #general message
A model had the ability to take in additional context, which is something we were doing for all (mutli-turn image edit), until earlier today when we turned this feature off. However, the model continued to take in the context when other weren't.
do you tested the nano banana 4k?
This doesn't make a lot of sense fairness wise, so we're working on making a change, while invalidating any votes we think we're unfairly obtained.
the people is talking is soo above that nano banana pro 1k
I dunno, after 1k I don't really notice a difference.
I completely understand. I assume the data collected from user feedback and testing is being monetized, allowing LMArena to reinvest that revenue into maintaining operations? (without prying too much undisclosed information, sorry just curious it's an amazing website so I'm wondering)
I'm dont saying about resolution, the nano banana 4k (the people are saying) edit and create images better that nano banana pro 1k
oh, I've been flipping back and forth between using lmarena and gemini app for image generation and haven't noticed a difference
nah, lmarena is the same as gemini app
Short answer is yes. We are hiring with the intention of creating a better platform overall (including site reliability).
Interesting, so apart of the maintaince of the website, I'm assuming some of that profit is for upkeep of models. I'm still surprised on how we can access such a diverse number of LLMs for free without a single sight of an advertisement for extra revenue. @echo aurora
The long-term plan balances public access with enterprise services. Enterprise and research partnerships generate revenue to sustain the platform, while all public leaderboards and evaluation data remain freely accessible. To be clear though we do not sell user data or individual voting information. Commercial activity is limited to evaluation services, not data monetization, which you can learn more about here - https://news.lmarena.ai/ai-evaluations/
Woah! I didn’t know about that at all, thanks for the clarity, I assumed the website was collecting user information to then sell to data brokers for maintaining operations.
I appreciate your time and the feedback of others I’m logging off for the night, but thank you so much!
I'm glad you asked! I hope you have a nice evening.
See ur smart
Yo is there a way to solve the problem where I'm stuck at generating?
What’s stuck at generating the generation is not coming through and just looping?
Yea I input a message and it's stuck at generating and won't give me an answer
Have you tried refreshing? Was it giving you answers earlier and what model were you using? Because are you aware of rate limits on certain models?
Yea it was giving me an answer earlier, but it was lagging out so I refreshed the page, but then after I refreshed it stuck at generating, and no it won't give me a rate limit warning ;-;
You got a diversify
Wat's that
I think you could try it again in like 50 or 60 minutes or something like that whatever the rate limit is
It means try another models that are available in the meantime
People usually get hung up on a single model they like or perceive as the best model and so they don’t fully go exploring and seeing what else is out there
Excuse me, is it no longer possible to regenerate the image using the refresh button?
(Best thing that can happen to someone who’s trying to learn and get good)
I mean how can I try again when it won’t let me, I’m in a chat with a lot of previous message and don’t want to lose the memory, but now I got that problem and don’t really want to start a new chat ;-;
Ahh the classic dilemma
Yea
Hey everyone
Excited to be here. I’m a full-stack & blockchain engineer working at the intersection of AI and Web3 — agent pipelines, trust layers, on-chain execution systems, AI wallets, and autonomous smart contract interactions.
If anyone is exploring AI agents, compute markets, verifiable inference, or on-chain automation, I’d love to exchange ideas. It’s one of my favorite areas to build in right now.
Unfortunately, that’s a very common situation. Most people find themselves in.
There really isn’t a easy answer. It’s either waiting or starting a new chat.
If it give me a rate limit warning or an error, I can switch model and move on with my life, but it’s just stuck at generating so idk ._.
Nah I’ll go on top of it
Aight then I gave it like, 8 hours and it won’t stop spinning, so I guess I’ll give it some more
Is it a coding model?
If it’s eight hours, bro, that’s not normal
Weird usually the rate limit is like 60 minutes or something
Oh
And you’ve tried refreshing?
Tried starting a new chat?
Yea I’ve tried it
Try to start a new chat see if you’ll have the same issue
Yea I started a new chat and it worked perfectly
Then you will know if you’re being limited or not for sure
Oh dang ya
That doesn’t sound like a rate limit then
Sounds like you got banned
Yea sounds more like a loop to me
Jk
But the thing is what would it be looping for for eight hours?
Bruh imagine being banned by an AI
Yo what
Not on here
Yea .-.
Nah people seriously got banned by just talking
Well, it depends what they’re talking about but yeah, it happens, bro
One of my buddies he got an email from open AI that he can’t ever use any of their services again lol
Damn
Kinda hard to get banned though to be honest
I’ve gotten an warnimng once when I first got ChatGPT but that was almost 2 years ago
Yea guess bro was just getting too freaky
Some of these people don’t care and they’re just brute, forcing and others are just idiots
They don’t know how to blend suspicious activity with normal activity and so they do the most obvious things in the most obvious ways
Like a sure way to get banned is pushing nsfw
whats pro grounding 🤨
Just over and over
Is grounding rated already?
Sorry, but I wish I could help you
I thought it was a regular limit for sure but sounds like that ain’t it. I’ve never experienced it so I don’t know exactly what’s causing it
=/
A new search model, check it out: https://lmarena.ai/?chat-modality=search.
dude 🤦🤦🤦
Not yet, same with 5.1-search. Stay up to date with our Change Log: https://news.lmarena.ai/leaderboard-changelog/.
This page documents notable updates to our leaderboard—new models, new arenas, updates to the methodology, and more. Stay tuned!
For model deprecations, check the public updates on GitHub.
November 21, 2025
ernie-5.0-preview-1120 has been added to the Vision leaderboard.
gemini-3-pro-image-preview (nano-banana-pro) has been added to the Text...
Open ai really this dumb?
Baron App Inc., which operates Cameo, sued OpenAI last month, claiming trademark infringement, trademark dilution, and unfair competition tied to OpenAI’s text-to-video model Sora 2’s “Cameo” feature.
Lmao
I mean, out of all the words you could pick why would you pick the word cameo?
The restraining order prohibits OpenAI and its officers, directors, and employees from using "Cameo" or any confusingly similar marks, including "Cameos," "CameoVideo," or "Kameo,” for its Sora AI video generation products and related marketing in the United States.
Because the other alternative is calling it what it really is a deep fake lol
Probably for instant recognition. They’ll try to say it’s a Verb like saying a Zoom call
A cameo was a nice way of saying deep fake without it sounding so bad
I’m curious the word they’re gonna pick next
Look brah this is so obvious I don’t get how anybody there thought this was a good idea
Actually, I take that back now that I think about it I’m not even sure what word would replace it
What’s in between deepfake and cameo like a single word ?
Avatar?
Better searching
Holy crap the search arena is lit 🔥🔥
oh is it a research model
how so
Flux 2 plssssssssss
ilya didnt say anything useful
hes been saying scaling is over for like 2 years now
but is it really over?
Seriously, these LMarena updates have been awful. A month ago it was the best possible version, and now they've simply removed everything from the website and replaced it with limited or buggy versions.
His bitter
Already here
Looks like he got some sun though so hopefully he’s doing better
It’s here
hows flux 2
whats flex then
like smaller model?
its not bad ig
but still not on nb pro level
- slower
i mean how can you even compete with something like that...
feels unfair tbh
nothing will ever top nb pro for the remainder of this year
Oh, for sure the remainder yet 100% I agree with you
NB pro is lowkey insane
What without a single doubt?
You’re totally right
It may be a while if anybody crowns it probably
proportionally in terms of blowing other models out of the water its the best image model in history
actually
it is just
the best image model in history period
Close to spring, I guess
flux is nice though
Maybe even later
honestly, only maybe next winter would i see a new king top nb pro
and jesus
with how good nb pro is now
imagine something BETTER
I use nano everyday for hours. It’s nice, but I’m not all that impressed with where I thought it would count.
I thought they would fix the minor issues that the old one had and I was looking forward to that
yea
It definitely improved a lot, not gonna lie
i havent had any of those issues yet like
with the no-edit issue
if your prompt is really unclear and hard to understand, yeah itll just
regurgitate the same image cause it doesnt know what youre saytingh
i already said that but nb pro is like an agi moment for me
yeah
imo
nb pro is no longer an image generation model
its like a pair of eyes
and it will only get better from now on
LMarena is worth nothing anymore, literally an image I generated a month ago vs an image I generated today, it's bizarre and horrible.
yea its crazy
like it UNDERSTANDS what you give it
exactly
What's going on with this site, it's generating more bizarre models than chatgpt and grok together
im also a visual learner
i love infographics and notes rather than like long text document
Try something complicated, multi angles consistency
so i find nb pro handy on that
One second I’ll give you a prompt
The photo I used with the same prompt as the image from a month ago
How do you say unborn? I don’t wanna traumatize myself again.
I keep looking at these stupid ships and In my images
Can anyone tell me if they nerfed the site?
And it just starts frustrating me
They didn’t
But I can understand why it feels that way
They took away a lot of the features that made this place truly special
But I think that just comes with the territory sense of the growth
Plus in order to upgrade you need to change things around
Otherwise, we wouldn’t have cool features like web development and all these other cool things we take for granted now
like?
how so?
its just insane man idk how to describe it
just look at the examples
it has better quality and understanding but it doesn't really blow me away
It only got worse I no longer see an advantage in using anything else on this site for example
I get where you’re coming from and you do show really sophisticated examples of your images
But try even more complicated things
do tell
idk how, it blows everything else out of the water
how?
Like city building and controlling your character fully within
Btw all these were made with nano ones
So yeah, I mean nano2 this far more superior
And no model kind of comes close so he does have a really strong point there
no, not trolling
I’m just being greedy
😝
Cause it would totally suck not having nano 1 or 2
Better then nano 1
the heck
Irish is right on many of his points
why does it randomly delete messages lol
I don’t know I think once you get to a certain amount of image generations under your belt
It’s hard to explain
I guess the world would be not impressed, but I guess I’m spoiled
I’m taking it for granted and I’m not appreciating what it is
Yeah but my expectations are also not realistic
but if you take a step back and look at the MASSIVE jump from other models to nb pro
its insane
Because in my mind, the ideal model would just do anything without me even saying or thinking anything
But I get you and it’s good to be reminded and come down to my senses so thank you
That’s what I get for buying the hype lol
No, for sure more then a jump it definitely set the standard way higher
I do wonder what affect having Google search integrated into the model that it could search up images in reference to them. I wonder what extent this plays a role.
honestly, i havent seen that in works yet
that might just be a lie
but otherwise still like genuinely just an amazing model
it works more like a digital editing software more than teh other nano banana ever did
Oh yeah, I forgot I just remembered
The price lol
Now I’m back to my equilibrium lol
yea the leap is insane.
Actually, it’s not too bad. It’s $.12 an image. 2k
I think I overreacted when I saw the api price 4k at .25
No it’s not
.12 cents chill
But u gotta drop 120$ lol
It sucks having constraints on creativity like this
If you’re not careful, you could really blow through it especially if you generate 10 hours a day non- lol
I think it might peak I had like 8 accounts or something
on what?
GPT back in Dalle days
So I never have to hit that rate limit lol
I would generate from the moment I wake up to the moment I go to sleep lol
Just running prompts heavy
I got some really rare images
Stuff you wouldn’t even believe that ChatGPT generated it
But all nsfw =/
No where 2 share
get ai ultra, its unlimited
Well, thanks the AI video I slowed down on the img gen
And diverted my attention towards mastering video models
But it definitely helped having that experience from image generation
Translated really nicely into video
hopefully nano banana moment happen for video early next year
🤲
its just gonna be censored to oblivion
I guess the only other thing that kind of really excites me and impresses me about AIS the guard rails
That’s kind of the only thing I’m really interested in videos and the guard rails
obv they have synth id for video
even then
Easy 2 break
i feel itd be censored to oblivion
There’s ways around that censorship that are so easy but very time-consuming to find
They do
But if you was the video twice and run through different media players in save it you get rid of it
So this the og
After washing
Removes meta data
Damn that Suno thing bummed me out
Rumours says on par with nanobanana pro
I didn't use that myself
how expensive is it?
It’s already in the arena
How is it
its a step up from flux 1 but its not as good as even nano 1
Is it worth or not
I see it right now
meh
exactly
its slighly better than nano banana 1
Just tried nano banana beat it not even par with seedream 4
on the nanobanana scale from nanobanana 1. and nanobanana 2. i'd say its a nano banana 1.5
-# if it existed
-# but yk what i mean
Got it
well thats they claim
oh lol
Still seedream 4 is best for facial details in img 2 img
Not at anything closer to nb2
Prompt understanding is low af
Seedream 4 is above flux 2 dev
But yeah flux strength is something diff always been diff
have they fixed the error issue in nb2 btw?
No seedream 4 give me better results in facial features details but nb pro only give 60 to 70 percent of facial details sd4 gave me 90 percent
Reference image
nb is better right in what u sent
nb better
I said preserving face details
Seedream hmm might be better
Nb 2
Flux 2 pro
Rather than that you can get celebrity faces better in nb pro
Is Flux 2 better than NB pro on text rendering?
Lmao seedream is not better than nb2
I read that as better than flux 2
nah
not even they claim flux 2 is better than nb pro
Wtf
nb pro is on another level
Nb2 is just phenomenal
So what's flux good for?
some people believe every new ai release is a sota
NB2 and flux is like comparing a grown adult man to a child
some are allowed to be mid tier
I said nb pro better for me seedream preserve more face details than nb pro output wise nb pro is better
hey whatever is good for your use case man not judging
No judging and im a big nb 2 pro user
Elon husk
😹
Google just made AGI and we still don't understand.
Google got the best tpus
nbp text gibberish?
this feels like photoshopped in. so straight and robotic
Google has best resources ofc google is best
I mean with a simple prompt add a blackboard and write down trigonometry formulas
Bro need to add details in prompt
Then ask Google to do smt with the font..? Google models require some details in ur prompt.
It's not like Claude where it adds like 900 other things
Essentially just need to type make it human like or similar
Gemini infinity loop
I think you just tell add some formulas if you add more details it gave more detailed results
We have smart boards nowadays which can be typed in them. So still correct
That's what I meant with a simple prompt.
If you want more human-like text you can prompt it
Seedream 4 max
Nb2
That's what i said
It's not even close
I agreed
yea
Make one of a everyday object
too vague give something detailed
It is
I was wrong too
lol ya
It's even better when generating high res
That's what makes it good
It generates 2k and 4k like nothing
yeah in apple store
Also in google play store?
Actually image gen is of more use for avg audience than llms 😭
Not sure I would think so
I see
Have any u guys had convos disperse?
nope
Wtf
i agree with htat
Same
more people who dont even know how to use an LLM use the image model
Indeed, and for avg audience llm is almost what they want
Like we actually got the llm what they want
Good maths, problem solving , basic reasoning, basic coding, basic prompt writing
If you are using llm for more than that than you are not avg
It was same with Ghibli
ghibli?
When got launched Ghibli style
oh the ghibli style
Yeah
lmao that is so ass looking back on it
nb pro would CLEAR that
It feels good getting validated don’t
Surely does
Because I remember distinctly telling people that vision, models and image and video is extremely huge and it’s going to push that industry forward and the AI
But a year ago, this time people would laugh at me I remember that distinctly
For saying that, but I knew this moment would come
And probably because I was biased, and since I’ve always kind of been into Imogene ever since I pretty much started AI
I mean graphic designing is a huge market with about 50b market size
wasnt DALL-E the start of it all?
For me ya
yeah
My fav is Dalle 3
I just really love that model very comfortable with it
I wish they had something like that, but with like the capacity of a nano
image gen got huge potential like imagine the ability to disrupt a 50b market
It was really good at filling in the blanks and it really got your ideas down and would fill in the little details was awesome
how?
How what?
There was so much flexibility with Dalle
Erynpowerful model
Let me show u
Do you know any prompt template for generating images
I started with stable diffusion
I guess the least developed area of Gen AI is probably audio gen
@keen beacon
not really i just write them myself
I got some prompts saved in
My pc
But I am outside rn and on phone
Same here but I feel if someone sort prompts like this for nano bana this for seedream like that we can write better prompt
I usually tell my ideas to models to write prompt and i make some changes in that prompt
Nvm to many images
To go thru
II got no space in my iPad
U guys wanna c so,eyeing funny
Only 1 sec sneak peak
Use telegram to upload that images
Telegram is free unlimited storage
Check this ig
Nice
I saw that 😂😂😂
Show fr
Use telegram bruh
It’s nsfw but not the kind u think lol
Gore?
No
hmm
Nothing werid
how big is the jump from Gemini 2.5 pro to Gemini 3.0 pro? (your opinion)
9
23
2
big
U should see the other things I git
Whole things growing on people’s faces
Its halilrous
I bet you use local llms
No
For nsfw
What
Send me link of the model u use
Cuz gpt forgot dalle
Hi everyone, does anyone know what kind of model 'raptor-1123' is?
hi bro.
I dont know about it well, but when it comes to ai models, I am really interested in them.
and i have rich experience in several works related to ai models, like training, fine-tuning and deployment, etc.
can we discuss about it more detail?
Guys
Nano gunna get nerfed soon
Google has launched its latest AI image generation and editing tool, called Nano Banana Pro. Some experts have raised concerns over the images appearing too realistic. NBC News’ Erin McLaughlin tests the new AI generator and speaks with a content creator who teaches how to identify fake images.
For more context and news coverage of the most...
Now Google will be in the hot seat lol
No its not lmao. Anyone who watches news for stuff like this I find brainless
FK
Ya
He said the golden words at the end of that
😔
But I did just realize that now that Google is in the number one spot and the number one seat I wonder how they’re gonna handle the pressure when things go wrong.
hey hey, I only came by to ask why they took the code rankings off the leaderboards.
You made my night thank you
😂
or, did codestral just dominate it so hard, there was no reason to list it
😞
Are any of you guys into like aviation or airplanes by any chance?
Or anybody like that show Mayday air disasters where they do the investigations of airplane crashes?
But that don’t stop the damage happening when it’s happening lol
Lmao
I gotta go guys so I gotta work tomorrow if I don’t get to see you guys have a good Thanksgiving
holy wow gemini 3 is absolutely insane at narrative writing
finally did my first test
Awesomeee
My bad I was at a thanksgiving family thing so I couldnt see
lmfao its good im playin
wagwan people !? glad i found you
nb pro not workin on lmarena brah
is there any difference between flux-2-flex and flux-2-pro ?
Close-up action shot of chefs adding tomatoes, green chilies, and spices into a sizzling karhai, rich textures, steam rising, golden restaurant lighting, background blurred but showing premium décor and glowing sign ‘M9 Cafe & Grill’, slow-motion feel, cinematic food commercial style.
when i generate a new image with a new prompt it gives me a complete random photo, is it me or its worse i mean i get there is an edit but its not really as good
Are the buttons going to get fixed?
They Can't delete posts anymore. And the new chat button does not work either anymore unless i open a new tab with it
Court
new interesting benchmark, physics reasoning https://huggingface.co/datasets/CritPt-Benchmark/CritPt
Opus 4 to Opus 4.5 jump is crazy though. 0.3 to 5%. Looks like they actually improved a lot with that model, notable jumps in many other metrics as well
hi guys, what would u say is the most realistic photo generator?
it wasnt tested yet on the shared bench right
but overall they did improve the model yea
just the fact that its like much smaller but better is a big win
its also more efficient
uses way less tokens
could be served with a much cheaper price
but its not good for general usage, still lazy as always
- still behind on math and multimodality
what
where is this
cant find it
Lmarena
text only?
Must have code checked
oh
I get it every 2-3 chats.
I tried some tamagotchi and falling sand clones and it was better than any other model.
did you take screenshots
I have chat history.
or u have comparison
Here is Robin:: https://019abf95-548a-728a-900f-43e3cb0c6963.arena.site
Yeah. It’s 100% OpenAI model
better
its slower too
what did you use to make this?
robin is better than gpt 5.1 codex but not better than opus 4.5
In my use case, it followed the prompt better
What does State of the Art ( SOTA) mean?
Ohh
for example, State of the art image generation with nano banana pro, means like the best and newest of its kind image generation with nano banana pro
Who created the name sota ??
im not sure
It's literally an acronym: State Of The Art
Yeah I know, was wondering which one said from now on we call the newest model state of art
best of the best
It's simply because the best current model(s) is the state of the art. The term state of the art is decades old, and has specific meanings in relevant technical and legal fields.
Do u think lmarena will have an app sooner or later?
If I understand correctly , some models have code names right? Like “autumn” ?
stealth?
The developers don't want to disclose that they have/are preparing to launch a model, but want feedback; so they put it in arena under a codename, like "Nano Banana".
Ohhh and u guys can actually find out the model? Like guessing it right
sorta yes
Which do u think Autumn is for example ?
oh robin is actually good
Yeah it's like the one you enable on chatgpt called Search so it's only job is find info on the internet
I mean it's very important when some models are out, right now Nano Banana Pro is the top 1 so I am pretty sure in a few weeks another open source ai will beat Nano Banana Pro
The heck is robin
Openai?
or something like that
yea openai but not sure which ver
if its like codex high high high max max
or like 5.2
Oh it seems to be good at UI
yea they did improve on that for sure
but its still favoring dark colors
same color palette
i wish they change that
Are DMs allowed here
lowkey it mighta been
I know but the problem now is when voting and writing a prompt to generate the next image it just generates a random image and not based on the image i voted, in order for that i have to click on the edit
@prime mulch doesn't think so
Ofc like seedream
China will steal the algorithm and make it opensource
Robin?
Some models doesn't have edit mode so of course it will make random images
oai are just sitting on good models like that
this robin model is like their real hidden card imo
last codex update was just an appetizer
but it does take a lot of time tho
makes me wonder if its just actual codex + more thinking
they still need to work on UI/frontend and color choices tho
Literally all the model that i got have the edit on top
maybe i gave it too much credit
What means "Hard Prompts" in Leaderboard?
Yo what
AND I JUST CANCELED MY GPT YESTERDAY
I KNEW THIS WOULD HAPPEN
After only 3 prompts, it gives me this message:
"You have reached your rate limit for claude-opus-4-5-20251101-thinking-32k. Please try again in 50 minutes."
Absurd
Yea its crazy lmao
this probably has to do with codex tho
I believe its because of the thinking
seems faster at editing too
Codex comes with pro plan, no matter though because I got until the 13th of december until it expires
Omg dude
Unfortunately it does it with the others too, at this point it's become impossible to use them
And I thought opus was really good and this is supposed to be better?
Make a new account withj temp mail
codex 5.1
WHAT IN THE HELL
Yeah, I'll have to do it, but it annoys me because I had already prepared all the prompts
Asura how did people use this? Was it leaked?
they cooked with this model ngl
still has some cons but its waaay better than codex 5.1
Wait is that an app or?
spare me with the svg test bs pls
i told it to generate windows webapp
Good lord
one prompt?
no two
2nd prompt i told it to improve color palette
but the 2nd was just to improve the colors
2 is insane for that level
also the editing is like the fastest ive seen
How did you use it?

Is it just codex 5.2?
did the gpt model get an update or just the codex
Can we download code from lm arena code mode use that in vs code?
but idk what they will call it
yea
top right
Yea but how is it used, is this a model not everyone has?
Must be, because its on 5.1 currently
you can try it on lmarena battle mode
Ohhhhh
its a stealth model not available yet
man i swear ai is prob the most competitive market rn
Its been like 1 week since gemini 3 LOL
yes lolz
oai needed that
yea 200% oai
i wonder if they're tryna catch up with image model too
Rodin is Gemini 4
How do we know this?
lolz nah
Or is it speculation lol
It's easy
Not speculation, just fact
Right but how would you know this?
I just want to know
he's joking prob
When opus 4.5 data? I really need it
Wdym
I don't see any robins.
Asura said its in battle mode and stealth model
What means "Hard Prompts" in Leaderboard?
Found it I guess lol
Minimax is better
its not bad right
Not at all, its actually really good
I've never seen an AI off one prompt add screen shake effects on something like this either LOL
You're just rage baiting
Its the same with opus
Kinda lame
imagine minimax m2 agentic mode (focus of minimax m2)
weird man this website has become unusable over the past few weeks
Yeah bro I did.. a couple of times..
It was because they used some filter to make it kinda blurry.. and it was super short
Is @echo aurora The owner of this site ?
What means "Hard Prompts" in Leaderboard?
I can’t keep up with all these new models, new better ones everyday at this rate
I know right? I honestly shouldn't buy a model right now
Because there will just be another one
is OAI planning to do 12 days of christmas this year?
guys what the hell is up with nano banana 2.5 flash
never asked for any text to anything, it was doing this before and i reloaded the page and tried again and it still says this
just specify the text
Hellow...
hi :).
why does this keep happening with me in LM arena
Not just you
"much smaller"?
I don't think it's smaller at all
But, obviously, they had big margins to play with
when it comes to pricing
They are gonna make much more money in total by lowering the price
as opposed to keeping high price and having much less active users
Also, Sonnet 4.5 released barely 2 months ago
This happens to me too
it would have been near impossible to improve that same size model to this level by now to name it Opus. It isn't really smaller than the earlier Opus I do not think...
i doubth a lot that Opus 4.5 is actually smaller, its likely an discount due to the new 64k reasoning upper limit
like, it looks like how it should be, its at very least a great optimization
and they dont increased the limits in the plans yet by what i know
Hello
@echo aurora put some words in automod so that these bots cannot send their message
Hey dom please new system prompt
Not a bad idea, I'll consider it.
Opus 4.5 costs the same as opus 4.1 for anthropic. They're just losing money to compete with Gemini 3.
There will be a point where we won't be able to talk at all lmao
Its robin, a stealth model in battle on lmarena. Apparently its codex 5.2 but I've also heard its gemini 4 which I find hard to believe
Gemimi 4 is crazyy
The guy who said it kept rage baiting so it was probably fake lmao
We won't be getting that until next year
OpenAI is panicking hard
They addes shop research and voice mode in chats
That voice mode sucks a lot
a few people are speculating opus 4.5 to be originally sonnet 5/6, anthropcíc made this into the opus class same like gpt-5 happened to save costs apparently, when i heard that it resonated with me for some strange reason cause opus 4.5 indeed sounds more like sonnet than the previous iterations
This happens every 30 minutes, why?
one thing we are sure about is the tokens efficiency, it uses like way less tokens compared to 4.1, but im also guessing they've done some other optimizations (advanced caching) + they had new investements deals
guess im right
robin = current codex + more thinking time
which is sad tbh
oai are so desperate
their KPIs probably went crazy on gemini 3 + nb pro + opus 4.5 release
wasnt it like the 1st just yesterday
what happened
are people confusing the viral image generated by nb pro for chatgpt image model?