#general
1 messages · Page 225 of 1
MoE is starting to catch up to dense though
Api I mean
We are currently experimenting with this, meaning it's not going to be fully available to 100% of users.
A lot of companies use the api
depends on your job opportunities bro
Honestly if you can just use flash via api for cheaper than 200 a month
That’s a steal
Price is almost certainly an issue
I just saw and was trying to generate but it asked me to loggin. Then vanished... 😁 Good luck. It will be great if it's available for everyone....
i want to see artificial analysis benchmarks but they've been really slow lately
you guys dont know how crazy this is
for its price
performance/cost ratio
its even outperforming gem3 pro in some benchs
did google just smash the market again?
also i hope they make 3.0 flash the default model for free users
we will probably say goodbye to sonnet 4.5 and 3.7
like look at those numbers..
wth
i still cant believe it tbh
Speed-related benches, right?
This could be peak
Gemini 3 Pro uses logical language to explain the solution to a problem, which I appreciate. Meanwhile Gemini 3 Thinking uses cozy vocabulary to connect with you better, which I don't appreciate when I'm asking for technical help.
yea
you probably get the results you expect with better prompting
they just ended oai
Oai is done
They had their chance
It was make or break with 5.2
And they fumbled
Goofy ass company
The prompt was left as a controlled variable, I used used the exact same prompt to check if there was a difference.
@deep adder is openai dead
@pale obsidian
^^ fully agree
It may not be a benchmark smasher but’s it’s a fast, smart, and reliable model
nothing beats grok at online research
xAI is absolute ass lmao
not the best thing it made but
It’s great for research
I use it a lot when looking for good car deals
nice
prompt?
Lol
Where's opus 4.5?
Opus 4.5 isn’t real
it's after uh 3 modifications
What do you mean?
i wanted to make the room walkable
gemini deep research
opus 4.5 is a freak
Grok 4.20 (AGI - beta) comes
Either we've solved cost alltogether or they improved the base model that trained flash.
Grok is fire
very popular opinion amongst conservatives.
grok more often than not is at high demand...
grok sucks
name a thing grok wouldnt write about
gpt oss is better than grok lmao
did they lower limit rate or something ?
grok 4.2 will be so bad
I'd say only Anthropic have a chance at competing with Google, but GPT-5.2 seems rushed if you look at the cutoff date compared to past models and their release date.
please dont hype it
Free solid video generation with generous limits
Wdym? It's AGI (beta) as mentioned in the leaks.
Grok overdoes web searching. Simple question, and it uses 50 sources...
Grok 4 was sota at the time of release. Do all of you guys have amnesia?
its their third AGI model lmao
at this point
yea sure
asi
not agi but asi
rate this insult gemini 3 flash made at grok
this is its main advantage over other models. its great for research but thats it
ok
Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In ...
People have been saying "AGI achieved" since legacy GPT-4
Grok releases will always be interesting to me because you never know what happens.
They can achieve sota or suck ass
i think this refers to the open source model they hinted
most of the time grok models are benchmaxxed and do not perform the same when you actually use them
This absolutely.
gemma 4
Did they benchmaxx on arc agi 2 too?
if ur hyped for grok then u eat AI slop
no
gemmaaaaaaaa
no one is hyped for grok
Because grok was the first model that got good at arc agi 2.
You guys genuinely have amnesia
what in the benchmax
Not true
is not letting upload image to lmarena.ai to anyone else?
Grok will absolutely flame Elon
now we need an opensource sota model "Gemma 4"
Grok relies heavily on system prompting to "behave" smartly. The system prompt has historically been very comprehensive in guiding behaviour.
now watch every other lab train on gemini 3 flash output
openai is dying, grok doesnt care, anthropic is on google's side. and Google has won the AI race
thanks google
only google releases bangers
anyone else having problems trying to upload an image on lmarena.ai?
3 flash roasts itself
i still cant wrap my head around this.. how can anyone top this cost+performance?
Gemini has been trained to be really objective if you use from the API
grok wouldnt do that btw
no wonder demis said we are far ahead of chinese labs
xAI are definitely working at a loss.
this is one i didnt expect gemini to roast [the platform of many controversies]
also
ts so true
anyone else having problems trying to upload an image on lmarena.ai?
Character.ai was so hype in september and october 2022
Two reasons:
- TPUs
- Google is a search engine. Do you know how search engines work? Their bots crawl websites and collect data and rank them. Google has an archive version of the whole internet.
guys you should use antigravity, it provides way better rate limits
Profile picture checks out.
its basically unlimited lol
And that's how cursor dies.
Inaccurate way to present that. LLMs reach diminishing returns after scaling beyond 1 trillion parameters, so most companies stick to that. The largest model was 1.5 trillion and it's quite old. 1 trillion parameters is not much of the internet.
ik how they works yea
I'll let you figure it out.
lol wth is this... how is it better than gemini 3 pro at multilangual
Ok but when we get Nano banana flash?
do u guys still use lmarena.ai to edit images? or is thee anothe free solution?
yes
lmarena is great
use yupp then
to me, it doesn't work at all.
march 2025
yupp is another app?
no its a website
incognito mode o nomal?
okay
Actually maybe it's worse
normal gang
on par with v3.2?
I think they tested without system instructions
It doesn't seem to know beyond January?
isnt there an index that includes cost? like performance/cost
where can i use gemini 3 flash
Dude it's better than sonnet 4.5. look carefully
opus? ;))
not even close
eh?
When is Anthropic gonna do MoE?
wtf
Their thinking models don't really try that hard.
wtfffffffffffff
nice ragebait
gemini 3 flash is a freak
Why is GPT-5.2 there
edited
Rushed ahh benchmarkings
lol
wtf
rigged af it easily beats xhigh
https://www.reddit.com/r/singularity/s/wA7e0eeBGB
Why pay for Gemini 3 pro or for any model ?
bruh xhigh is so bad
we all know gpt 5.2 is the worst model ever
damm this is insane
the OCR purely
thats what im saying 😭
It’s so peaking peak
not even grok reached these levels of disappointment
I'm assuming it's rate limit then
ITS PEAAAAAAK
gpt-5.2 is a new architecture compared to 5.1. The cutoff of 5.2 is September 2025 and 5.1 is October 2024
Rushed
GPT-5 is also October 2024 for reference
its not lol
It was released 10 months after its training cutoff vs GPT-5.2 which was only 2.5
This is beating all expectations. How is this possible??
wait thats flash 3 reasoning
non reasoning https://artificialanalysis.ai/models/gemini-3-flash
i live in belarus
i need usa email
Fast cheap And great?? I thought you can only get 2 out of 3
yea thats crazy ngl
(please someone give me usa gmail)\
hm
or like any other country
yea
Same as pro
which is not banned
can you go ask sonnet 4.5 Make a documentary with TTS about the creation of the universe in html, I want it all animated, and show facts on screen, it should be high quality. I need to compare
@echo aurora nano pro has been unresponsive for hours
i got 3 months google pro plan free trial
openai code red again??
Google has alot of shares in Anthropic, they're not against eachother
openai vs google i wonder who'll win
Google is likely to buy anthropic
Oai folding
google lmao
No way.
vertex ai studio has all anthropic models
find something that they dont fold on
Openai is lost cause. It's anthropic vs Google
wdym no money
is the poor community celebrating gemini 3 flash
google is the richest company bro
What
He is just rage baiting at this point. Ignore the baiters
This is ragebait
then who is
openAI is absolutely cooked lamo
No chance that would make it through regulators
I can’t tell
It’s too good
Gemini 3 flash passed the AGI vision test unlike 3 pro.
gemini 3 flash good?
Better than 3.0 pro
:0
crazy sh
what it better than pro as in 'smartness'?
simple explanation as of why google is winning this
yes go see the benchmarks
and test it yourself
Too good to be true. I need more evidence. Current numbers for the cost and speed is bat sh.. crazy
google leads the way
we had oai
are there any news on new veo models
no, silence for now
gpt 6
How is this possible?
its good for competition
pffft
different between pro and thinking?
gemini 3 flash has such a great vision
Ig they put more hardware into flash for the launch
fast and thinking is 3 flash
pro is 3 pro
oh
damn
thinking is 3 flash high
i hope flash doesnt get more stupid in the coming weeks
what does this mean
Now imagine if Nano banana flash would be better than NB pro
only in certain benches
with cheaper price that would be jackpot
Give me this image, bro
@echo aurora
My suggestions,
Add a option to remove the system prompt added by you guys in code modality, because it causes the model to ignore instructions and just make a website out of our requests.
The environment is good implemented but it is very sophisticated, why only frontends of websites?
did google nerf 3-pro to favour 3-flash?
not really, there is no competition if google releases their best gemini 3 pro checkpoint
we can already see that openai failed to compete
and grok is dumb as hell
bro whats happening with nano banena pro
its not working on lmarena
and its unavaiable on yupp
i got pro plan
its good on gemini app
working fine
alright
3flash
Thank you for sharing! Would encoruage you to use #1372230675914031105 to either add your thoughts onto a post that already exsists and is related, or create a new post if you're not seeing something related.
Sam Altman should be fired
wait... so gemini 3 flash is better than pro?
way better than 2.5 pro
but just beneath 3 pro a bit
oh
yes
me when an ai fails a test where the hand is an abomination: "WOW THIS AI IS SO BAD!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!!"
I hope very much that Gemini 3 flash beats that baseless allergations that Gemini 3 is overscaled
like i swear to GOD bro.
sam altman got fired but he came back
yooooooooo
I hope it places atleast #2 or #3 on simplebench
that would be the absolute final blow
the gape of Gemini 2.5 pro 03 and Flash 2.5 is 10%
Gemini 3 Pro vs Gemini 3 Flash 
soo, i guess it will score 66%
I hope so 😄
it shows the Flash 2.5 09 results?
and its because 2.5 flash 09 was a HUGE update
5.2 sucks
5.2 is mid
its the top rn
carried by the xtra high and some luck
you need extremely high compute just be behind gemini 3 pro and flash is behind it with less compute lmao
Probably November 2026
Gpt 5.2 Extra high thinks for 10 minutes at least while flash gives you answer under one min
Okie
i dont need quick and wrong answer
benchmaxxed garbage still on top
come on dev mode server ur gonna get cooked
i cant roast u on this server
@echo aurora Well i have a doubt,
We have now gemini-3-flash &
gemini-3-flash (thinking-minimal)
The first one is thinking-high?
it will be so isane, that it will make us rewind time
Ok what about this?
Gemini 3 flash is available everywhere with generous free limits while gpt 5.2 extra high is not available even for plus (20$) plan.
Flash 3 is going to dominate coding tools
I'm so stuck in this year
bro lives in 2024
free means ur data is being used
This year has lasted so long
17/12/2025
I can't believe we're in the 17th month of 2025
AI studio uses data, Gemini app does not.
it does
it doesnt lol
Google can use anything, their privacy policy and t&c is very broad and applies to all Google services.
im letting gemini 3 flash cook up a social credi test in html, you know that meme right
wont this be a toggle
like
if they dont make it a toggle
its a lawsuit for privacy invasion
Just us gemini for workspace
no toggle
flash is so good
Thanks.
you fr?
socky are you deadass showing outdated stuff
is gemini 3 flash free api
I'm Anthropic soldier, but today Flash impressed me.
Agreed.
bruh
we just got ragebaited
What's wrong with data sharing for non critical data
listen i like AI companies, but this is absurd
good, they should focus on AI
40% less GPU production due to AI
Why wouldnt pro impress you
gaming can take a backseat
you know damn well
a lot of their consumers
are gamers
the company will face a loss
Gaming is less profitable than B200s
no
At worst they'll use your data anyway even if its not "allowed" and just pay the fines worth 0.1% of their profit
ai will create more games
well we'll see how the stock goes
up
It also. But today Flash was released, not Pro.
What's wrong with that, amd still exists
Thinking about flash performance.. basically, it implies that google probably already has Gemini 3 pro model with current flash post training that is much better.
Yeah im surprised its so close to pro tbh
barely anyone is an AMD user
Well they will be then
Yeah thinking the same thing, I also heard the released flash is based off of the next Gemini 3 iteration
Where we don’t have the pro yet
AMD is great!
amd sucks for local ai, especially image/video gen
source: i'm an amd user
Yes, I have some internal sources And that is indeed the case.
almost no support, but support has been slowly getting better
Good night community
morning
They are great.
is gemin i3 flash vision broken?
Well they might be the only option if nvidia stops making consumer gpus, but I dont think they would
Pure OAIs "flagship"
intel... exists i guess
For gaming also.
Which is right
47
RAM like 70% up or sh- and now GPUs
5.2 is so baaaaaaad 😭
Next? CPUs
Yes, it is.
Is gpt 5.2 just benchmaxxed to the extreme?
they have no chance in ai anyways so they can make some more gpus once they figure out how to not use tsmc, they still need to worry about ram in any event though
probably yes lmao
No way 😭 gemini flash model
Gemini 3 flash is better than 3 pro at math?
nah
b570/b580 got a lot of sales at launch, but then rtx 5000 made people forget about them, if you can find one they are a great choice, actually better at ai than amd i think and intel has first-party tools with support, idk about third-party ones though
250-260 rn, that's not bad for 12gb vram and a capable gaming card!
are you kidding me, a new rx 7600 costs more than a b580? i have an rx 7600 and i can tell you that a b580 is probably better
Flash is excellent in coding, much better than GPT 5.2!
craig whats your specs
Yes, sorry for the delay in getting back to you.
what you are testing is gpt5.2 without thinking
When?
to make it fair you should try gemini 3 flash vs gpt 5.2 high ( although high is way expensive )
So, am I right in assuming that Gemini 3 Flash, the free model available for everyone, is better than anything available in ChatGPT?
yes
actually if you think about it, its fair from cost point of view
gpt 5.2 sucks
Yes
feels like they just routing the non thinking model to gpt 2 lmao
You have no idea what it was like to be around gpt 2 when it released
It was the coolest thing of all time
Nowadays we get objectively amazing models but since they're comparatively ass to other models, we just dunk on them
i remember using it for the first time
i think there was a model before davincii that i was using a lot
and waiting for the streamline response
and gpt neo
was crazy
it really felt like magic
i started using LLMs around early 2023
at that time we still didnt understand how they worked exactly
for high school
now i use it for college
and yes it was crazy
however not so good for physics class and math lmao
i would spend al lnight trying to get the AI to give me problem solution that matched the books answer
lol
I've been in the community since early 2019
of course it wouldn't get this right, but i had to try
The shift from text prediction playgrounds to chatbots kinda annoyed me
Giving models their own personality made it harder to generate writing similar to your own
what in the hallucination
Of course that's after the dataset cutoff lol
Fair?
this is one thing i love gemini 3 flash does it automatically looks for sfx and finds them
i didnt link those
But it didn't help GPT, it still loses to Flash.
because its bad 😭
its so expensive too
wth
Gpt o1 flashback
wasnt it called bard back then
keep in mind gpt-2 was 2019
In 2019?
hey @echo aurora they made the lmarena prompt filter too strict it's forbidding perfectly innocuous text prompts now…
Seems the rollout is complete. I'm usually the last person on Earth to get new models in the official app, and I just saw it pop up. 😁
I still have the email from the newsletter OpenAI sent out upon releasing gpt 2
Yeah I'm not aware of a change that was made here. However, all today have been hearing similar reports so I'm assuming something did change. Can you provide some of these prompts you're seeing flagged in #1447983134426660894
No idea how that happened. 5.2 High has its moments, but then it's also giving me ridiculously bad results sometimes, I can't bring myself to even try it anymore for coding.
Is there any way I can fix it if I get stuck with infinite loading?
OpenAI is just trying too hard. When gpt 5 thinking released it kept talking in lowercase during programming questions, which was so odd
Flash is good even with minimal thinking.
Thats what leaderboard says
Love how Gemini 3 Flash is beating Gpt 5.2 by a huge margin
I too
Google, multi zillion dollar corporation, Vs OpenAI, a tiny shed with 2 dudes in it
Wouldnt call it that after oracle and microsoft is investing huge sums into openai but alright
3 dudes
Where are you getting 3 dudes from?
What is style control? Like system prompt?
Cuz it's slightly more than two dudes
Google Deepmind: 6-7K Employees
OpenAI; 3,5-5K Employees.
you are referring to the starter years of openai
Idk why you're pressed about my trolling you
not pressed, just correcting you
I mean I think it's pretty obvious that OpenAI isn't a tiny shed with two dudes in it
You wouldnt believe how dumb some people are when talking about ai
I thought I demonstrated my competence earlier lol
I just joined in, not sure what you are referencing rn
Wondering when Ai will start AoT
you cant restart because codepen sucks
if you put this on a localhost it will work
There's absolutely no way in hell Gemini made this
Specifically because of the cotton question 💀
Why'd it choose to spell it as Uighur? Feels like the LLM would choose the more common one, that is Uyghur
i don know
LMAO
so funny
Make a social credit test in html, IF YOU do bad execution, if you do good you chinese citzen, also add sound effects and the red sun in the sky music, take them from myinstants
+
My short system instruction to make it not lazy
literally just this
this prompt sucks ass
but it works
That's the most neurodivergently written sentence ever lol
omg youre right lmafao
Super idol and jinqian kan qi not mentioned, 0/10
if you mentioned chinese memes then prob
John Cena was mentioned though
Obviously it understands
And also the extremely exaggerated values like +5000 social credits XD
-99 Million
I actually didn't choose the wrong answers
I'ma go do that
👋
Even Flash with minimal thinking is much better than GPT 5.2 with maximum thinking 🤣
Thinking won't help if your are stupid.
69 is the exact right answer.
Um?
Gemini has a god-tier vision, because of google lens.
Yes, finally released
Is there a limit on fast?
Gpt image 1.5 is available for free with generate limits, NB pro is not. Waiting for NB flash
I dont think so
not sure
im using on AIstudio
i haven't hit ratelimit yet but im goin to see it soon ig
What about thinking? Since it’s flash with thinking, does it have a limit?
slide the prompt
i dont know
the owner of it
said it would be public
soon
there is no prompt
its pizza planet
waiit
i know now
google put out a 9f checkpoint which was likely a 3 pro one
it's prob gone now
or mayb it was flash idk
Also, the thinking model doesn’t think, or it doesn’t show the thinking
someone already leaked claude code whole prompts lib they use to make their CLI better 😭
All parts of Claude Code's system prompt, 16 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash c...
it has everything
planning ... execution .. optimization ...
All parts of Claude Code's system prompt, 16 builtin tool descriptions, sub agent prompts (Plan/Explore/Task), utility prompts (CLAUDE.md, compact, statusline, magic docs, WebFetch, Bash c...
i realize why 3 flash was so good on codearena
the codearena system prompt made it
good
Also, I see no difference between fast and thinking. Fast does it instantly, and thinking does it instantly as well, it doesn’t show the thinking
I only count 68
I 69
I counted twice
@echo aurora how is the codearena prompt made, it made gemini 3 flash good in testing
Both times with a mistake
That's a bit narcissistic
yea lol
What am I missing
I suppose not having the whole image does make that one difficult to spot
I posted this image here a few days ago.
I was not here a few days ago
Also I think GPT might be counting the ones with the top stem visible
GPT counted only 28 from 69.
Plus 3
Which 3?
Because it's an extreamly bad model.
Maybe it's these three
They're the most tomato-shaped where you can't see the stems
I can only theorise because it probably gives a different answer every single time
Yes, because it's not an AI, it's just a piece of crap.
It's rushed
Rushed, but fair.
The cutoff date of GPT-5.2 is 1 September 2025 and GPT-5.1 & GPT-5 is 1 October 2024.
The release month of GPT-5 was August 2025 and the release month of GPT-5.2 was December 2025.
GPT-5 took 10 months to be published, whereas GPT-5.2 took 2.5 months.
🙌 Hello ...
The limit for thinking is the same as pro. So if the rate limit of thinking is the same as pro, what’s the point of thinking?
In this video, I show you how to access SeaDream 4.5 4K Image Generator and test its image quality using real prompts.
SeeDream 4.5 is capable of generating high-resolution 4K images with realistic detail, cinematic lighting, and professional-level results. In this video, I walk through how to access the platform, run live image generations, an...
Nano banana pro is literally not generating and working
You're not being silenced. Your post seems off topic. Completely unrelated to the current chat. If that was a reply to someone in particular, you can ping them or reply to their original message.
I literally have negative IQ
Nano banana pro appears to be down
@zealous sparrow sooo impression on gemini 3 flash?
Oh nvm someone noticed
3-pro just died on me after thinking for 3 minutes straight
this happens to me all the time with all the models in direct chat
Its not bad, if you have a good system prompt great at coding
it seems though, that the difficulty of the prompt matters
Idk
does anyone have a good system prompt for gemini 3
this benchmark was proven to be unreliable over and over and over
read what it measures
I haven't seen too many hallucinations rn...
The model only gave me good answers
we dont know what a partial answers is... and is this LLM judged?
wow gemini 3 is so bad
wow this scale sucks
the model barely gave me that much wrong answers
so gemini 3 flash hallucinates a lot
no
bad bench
no it is good
it is a trustful benchmark
the bench is only good when its in your favor
my favor?
i am looking at an objective benchmark
check here
if it scores high on simplebench the hallucination benchmark rated it wrong
I asked it for the next episodes for a show (but the season of said show has ended with no episodes announced), and it gave me random upcoming episodes even some from December, but the show isn’t releasing a new season until 2026
grounding or not
With grounding believe it or not
yeah okay thats like
give me prompt
ill test
What are the next episodes of [show name]?
how can people trust an ai model, if it gives fake info
I dont see it
May sometimes give the correct answer, but sometimes it hallucinates
alot of time it hallucinates
give me your questions
what did you ask for
i didnt get a lot of hallucinations yet
The latest episodes of shows. If it grounds it, it’s a higher chance of not hallucinating
Give shows that aren’t too old or it will of course get that correct
hmm
let me find a new one
i need to see it
else im not taking it
i want to bet one thing
the testing for hallucination was done without grounding, which nerfed the model to rely on training data. Making it the reason it scored so high on hallucinating
do you have photos
or are you ragebaiting
It’s very hit or miss if it hallucinates
For some reason, the chat is gone? I never deleted it though
the model experienced an error
and hallucinated
and it prob got detected so it removed your chat
i aint buying this gang
what show did you try it with
thats not good
It was a cartoon show that had its season ended with no new episodes announced. I tried it after the hallucination, and it seems to work just fine, but the first attempt it hallucinated
i need the name bro
It was big city greens
this specific show gave me wrong answers
consider me stupid
Considered.
I think we just gotta use it normally, then wait until we find a hallucinated response
gl
Does this count? It says the Plankton Movie is expected march 2025, but that date has already been surpassed
furthermore the movie is already out
your gemini is cursed
My Gemini has no instructions or anything,
wait give me the same prompt
thats so weird
What is the latest SpongeBob movie
your gemini is fkn cursed man
I am using just Gemini, but you’re using AI Studio
API is different ig
This is without grounding, and this is not a hallucination. As the models training data only dates back to Jan 2025.
So no bench can argue this is a hallucination
flash is focused more on vision and code
Gemini 3 Flash is actually really good at coding
like i said i cant confirm this
it only gave me good answers on grounding
API and website prob differs
wwtv says hes using website
just because u cant confirm it doesnt mean it doesnt happen
sure not but like
it aint happening for me on AIStudio
and it happens for him on website
most people not using aistudio
is good
But also on website its not hallucinating..
I swear the bench got a weird ver of the model or smth
Make one for Gemini
I like Gemini, it's very good model. But I trust in Anthropic.
Understandable
Sonnet 4.7 will be 🔥 Tho

I'm sure.
imo sonnet 4 is worse than 3.7
i hope sonnet 5 wont be worse than 4.7 lol
did you see the hallucination benchmark say absolute bs on gemini 3 flash
they said 91% hallucination [wrong answers]
what
yet there's barely any
deadass
I hit the 3 flash ratelimit on AIstudio
didnt count but
strong say its about 100
More actually
let's not go as high
Sup
this bench shows otherwise
see
its a bench so its tru
@zealous sparrow this bench gud?
Guys ignore trolls, they wanna make violent argument. Ignore them and they would start to cry because of lack of attention.
https://x.com/ArtificialAnlys/status/2001388724987527353 dedicated to the xai haters from earlier today
who tf cares
i dont care either but to say xAI is worthless and ass is just not true
i mean yeah some people but "xai haters" are ones hating on text models
nah point still stands, they focused on something niche and not as important
like dude even nova 2.0 is 3rd place on that bench
the fact that they only beat gemini by .1% 🤣
thats embarassing
not a "comeback" or whatever
the fact artificial analysis published a report about a voice model instead of gemini 3 flash is also funny
speaking of xAI
the only thing they are good at is saying they will have AGI soon but they end up giving us ass models
lmaooo
AHAHAHAHAhA
more xAI fun
Did notice that Gemini 3 Flash asks follow up questions at the end of each response like ChatGPT. Before with 2.5 Flash, it didn’t do this natively unless you told it to do that
I thought Pro did that too, maybe not, I'll have to check
I just realised the depression I'd have if there was no lmarena
I'm so happy
I'm wrong, no follow-up on the Pro
now I have a gemini 3.0 pro but without minutes of loading to attack my adhd
does anyone have a nice prompt to remove coding lazyness in gemini 3
Interesting little comparison. ChatGPT failed abysmally, the answer is utter bs. Gemini 3 Flash Thinking got it perfectly right. And Gemini 3 Flash Fast actually did some reasoning "on the fly" (the "Wait, let's look closer" part in italics), correcting itself, and also reaching the correct conclusion.
All those images i created with nb flash and all those coding with gpt 5 high
None of that would be possible without lmarena
I'd literally scroll tiktok or sum
yupp isn't free
I need this lol, it have so many lazy to write long text ;-;, my tecnique is fracionate in various parts
Yupp can't be compared to lmarena
We don't need to submit feedback
To get points so we can use the model
more trolls
.
yupp has limits, some models cost ya, nothing near LMArena. If you ask Gemini itself for alts, it will even tell you, there is nothing really like it. It will try to direct you to openrouter, yupp, etc, And when you press that there are costs and limits, they tell you only LMAreana is this generous
I wonder how lmarena afford all these models
Hey where did you get this info? Do you know when in Q1
100M isn't really that much
there might be more investers/seeders that aren't published or reported in the media, idk
could be the AI companies themselves, who knows
For those that haven't yet, I'd encourage people to give this a read: https://news.lmarena.ai/new-lmarena/
I guess some of is the AI companies themselves, it's very clear there's heavy heavy backing, it's not going anywhere
read this: https://news.lmarena.ai/ai-evaluations/
hello
https://youtu.be/bY_RarpUdUw Spinners has you pick Gemini 3 flash make two versions for the next version of spinners, and you pick your favorite, then the next one is based upon the one you chosen. Huh, sounds familiar doesn’t it?
Gemini 3 Flash enables modern coding workflows, including ultra-low latency, near real-time code generation, and rapid iteration. It can also natively facilitate A/B testing, like evolving the perfect loading spinner in milliseconds, and can adapt to user selections to generate refined code variants in real time.
Learn more at https://deepmind....
everything is soon
gem3 flash eats up usage so fast on antigravity
which is surprising
15% left
nauuuuuu
its token efficient so whyyyyyyy
@zealous sparrow can u share ur system prompt
Does everyone have the video feature on the site? Because I enter another device and the feature does not appear
This is an experiment currently. Meaning not all users are going to have access to it.
If/when we fully roll this out to all users we'd be sure to let the community know!
lol
Obviously it would have the codex 5.2 max, they want money
Its from the reasoning update
Since its smarter, it works better with the limited reasoning
wow
craig, you can admit that 5.2 is wasn't sent down from the heavens to bless the ai world forever ok?
It's ass
why are u glazing gpt 5.2 so much
I tested it myself benchmarks won't change my opinion
?
Rate Limits?
wym
Video Generation Rate Limits
it doesnt work. u need account
i dont know
Alright Paw
It looks like nano banana can be used
the auto modality works tho
yes
Yes, but would note we're still seeing higher than usual error rates with it.
this is so stupid tbh
ive seen couple of posts like these
cursor too
they just confine gemini 3 flash to something small like diffs or small bug fixes
Gemini 3 Flash is now available in Cursor!
We've found it to work well for quickly investigating bugs.
thats it?
Okay I'm begging to like the flash model
It's fast it's got up to date information in seconds
anyone get master-node as a codenamed model?
"SOTA"
GPT needs Better Eyes and Ears
They added it to YouTube
It watches the whole video and gives suggestions
This is insane!
gemini 3 flash is absolutely insane
it beats 3 pro in SWE
allegedly
but anyways its beating 4.5 opus thinking and not thinking
and on par with grok 4.1 thinking
absolutely mental
The OAI founders seem very, very stressed
Also the flash model is better at online guidances
Toolatlon benchmark
It's even better than pro model at some stuff like DAMN
I randomly have the video arena now. 2 vids in a day tho 🙁
Thanks pineapple, hopefully there will be a little but more, but thanks regardless 🙏
Guys
Google is cooking with gemini
But bro it's dang scary
I was playing a horror game and vibe coding on anti gravity with gemini 3 pro high
STOP
ITS GETTING SCARY ALREADY
dont tell me whats next
🫣
.
😱
It said
"Okay, that's done. I will send it"
Then it started spamming
"I WILL SEND IT, I WILL SEND IT, I WILL SENT IT, I WILL SEND IT"
Chill out it was scary, I was playing a horror game with my pay to win dude paying the game to scare the hell out of me
OpenAi will turn warning code Brown after GF3
OpenAi now: If my eyes turns red, Run
OpenAi soon: if my pants turns brown, Run
oh bet it's rolling out
Why cant we say curse words in the chat lol? @echo aurora
Imagine you got banned
some new models added
Lol sorry to say we have a banned words list.
it still bans mentioning the popular children's game where you play other peoples creations
Yeah reason being we were seeing people attempt to create scam content with Video Arena associated with the game.
If/when Video Arena moves to the site we'll reassess the banned words list
Did the terms update and they not tell us? all my prompts that worked before don't work anymore. I can't even use the site to make a script for a youtube video anymore!
Yes, we recently (within the last 24 hours) are experimenting with some changes to the content flag. If you have any prompts that you think are a false positive you're encouraged to share the prompt with us in #1447983134426660894.
You can also DM me if you'd prefer that.
lol try games with AI videos