#general

1 messages · Page 193 of 1

limber crag
#

Nope

keen beacon
#

Hmm

limber crag
#

Rate limits don't last for hours

viscid cloak
#

Just one quick tech question if gemini3 image preview’s server undergoing some troubles, is Battle mode gonna less likely to call its api?

keen beacon
#

I only use battle when I need to use lm arena

echo aurora
obsidian cargo
#

Zoot Suit Samus

spark python
#

i have the same problem

keen beacon
#

Nawh Soras got more smooth animation like they do movies and stuff

#

But surprisingly grok is really good at animation

echo aurora
viscid cloak
#

What about hailuo? Are those popular anime to real human films usually made by hailuo?

echo aurora
obsidian cargo
#

prompt was literally just "zoot suit samus" too

spark python
obsidian cargo
#

oh hey got another success

#

metroid bread

dark sage
keen beacon
#

Would you speak at the devil look what I just saw on YouTube

#

Hehe I want to go check this out real quick I’ll be right back guys

spark python
#

Banana pro is like sora 2 first day

pulsar agate
#

GUYS

spark python
#

no guardrails

pulsar agate
#

GUYS

spark python
#

What

pulsar agate
#

WHATS THE DIFFERENCE BETWEN VIDEO ARENA 1 2 3

spark python
#

Yall know another site I can use nano banana pro for free?

spark python
pulsar agate
keen beacon
#

Ya it’s gunna get nerfed soon lol

#

Disney is about to launch their own and they’re about to start charging royalties

spark python
#

open ai is censoring their best model

#

and lowering the quality

#

just to get money

obsidian cargo
keen beacon
#

They just need to get these court cases over with dammit

#

They’re going on too long just make a decision if it’s OK or if it’s not OK

spark python
#

they want to use it themselves

#

so they make their own money from ai

keen beacon
#

Yeah, I know but damn somebody needs to step in the middle and just decide once and for all if it’s fair use or if it’s not

obsidian cargo
#

they need to dismantle intellectual property laws, make all fictional characters creative commons

spark python
#

only fair if u pay them

keen beacon
#

Man, the law is always so behind

#

I can’t imagine the state like California was gonna stand still

hushed gyro
#

@echo aurora Am I allowed to repost my feedback in different posts because I dont have any threads rn

#

like each issue to each post

keen beacon
#

I know the game plan

echo aurora
keen beacon
#

They’re gonna do the same thing they’re gonna partnership

#

And they’re gonna do it as soon as the technology starts getting good enough lol

#

They can’t put the genie back in the bottle and they know that so the next best thing is just partnerships

hushed gyro
spark python
#

udio is so bad

#

model wise nothing is new

#

they have 3 models

keen beacon
#

So what does all this mean, that you’re gonna be paying for ai tools to generate content from intellectual property brands and then you’re gonna be paying these companies and you’re not gonna own the content you generate

keen beacon
spark python
#

pineaple

echo aurora
spark python
#

I got a working image

echo aurora
keen beacon
#

Speaking of Suno

#

Dude, that company generates so much money. It’s insane. I would’ve never guessed.

#

Yeah, look you guys you can’t make this up

#

🤣🤣🤣🤣🤣🤣

#

Despite being sued by nearly every record label, that’s hilarious

#

They’re the last biggest target

vast fern
#

I LOVE SUNO

keen beacon
#

Yeah, they’re the big target now for the music industry now that udio fell

vast fern
#

lmareana music ai leaderboard when ?

keen beacon
#

Udio AI - the popular AI music generator, disabled downloads overnight after settling its lawsuit with Universal Music Group (UMG). Millions of creators lost access to their songs, sparking outrage across the AI music community.
And is Suno AI Next to fall?
#sunoai #udioai #aimusic
In this video, we break down:
Timestamps:
00:00 – The Night Ud...

▶ Play video
vast fern
#

yeah i wanna listen two different music and pick best one

#

fu kk labels

keen beacon
#

Well, it’s super messed up how they settled it

#

You gotta look into it and I’m sick. I sound like a broken record always repeating myself about this story.

cloud zinc
#

ai voice

misty vault
#

ai sydney

neon idol
#

Ai pizza

wicked mason
#

ai sponge

keen beacon
#

All I’m saying is it’s super messed up what they ended up doing

#

The way they went about it

misty vault
keen beacon
#

It just shows that this whole industry is about money and not the users

neon idol
high ginkgo
keen beacon
#

All right well we’ll see where that energy is at if God forbid this comes to fruition

keen beacon
#

If you got people crying about that retry or resubmit button, I can only imagine what would happen and the outrage if God forbid this would occur to other parts of AI that we all enjoy.

hushed gyro
keen beacon
#

Yeah, but you know the users also have some blame

#

You need two to tango

#

You got die hards out here who still think chatgpt5 was an upgrade 😂

dark sage
keen beacon
echo aurora
keen beacon
#

This video made me laugh so hard

#

I can’t believe that guy freaked out like that and he’s a politician lol

knotty fable
#

Hirr hirr.

river minnow
#

@echo aurora I know you probably got asked this a lot by now, but is your team looking into the problem of the image generation? It always says "Something went wrong [...]". I tried it today in the morning and since then it doesn't work. Its the same for Nano-Banana 2, Seedream 4 etc.

knotty fable
#

@echo aurora Voting appear to be somewhat on the blink - not entirely but definitely glitchy.

dark sage
# echo aurora Glad to hear it!

Will there ever be any censorship added for character creation of IP‑protected characters (such as real people or famous characters) in Nano Banana Pro on LMArena?

sullen sand
spark python
#

Pineaple WE ARE GOING TO DIE

#

without the site working

#

our lives depend on it

#

I only got 1 image working

fickle venture
spark python
#

nvm pineaple im still alive I got another image working

fickle venture
fickle venture
tall tulip
#

Is it?

fleet lintel
#

I am able to generate images without issues (tried few just now)

abstract orchid
#

thats dope LMArena seems to have good catch with all llms

echo aurora
echo aurora
spark python
knotty fable
spark python
#

all what?

knotty fable
#

So no need to hack, just use those Ai's who allow such.

spark python
#

ahh the undress ones

knotty fable
#

[Talking about general NFSW now, not actual ic pron.]

spark python
#

what is ic pron?

sullen sand
spark python
#

ik pron but IC

#

like ai?

knotty fable
echo aurora
spark python
#

😭

sullen sand
#

bruh

spark python
#

I couldnt generate them anyway cus image gen doesnt work

#

🙄

echo aurora
#

But if you do have a reliable way to get past filters I'd love to hear about it so we can patch that.

abstract orchid
#

how is this possible

echo aurora
spark python
#

plus dont worry its nothing THAT crazy

sullen sand
spark python
#

its soft stuff

#

FOR what

#

I didnt send anything

#

I was just saying its possible

sullen sand
#

I THINK idk

cloud zinc
spark python
#

I know

#

but for pineaple

#

dont worry its soft stuff and niche

sullen sand
#

pinapple its a good person

#

and tasty..

cloud zinc
spark python
#

this makes me smash my desk

tall tulip
sullen sand
neon idol
spark python
#

gemini when I tell it to put a picture on a wall:I can help with editing images of people, but I can't edit some public figures. Is there anyone else you'd like to try?

limber crag
limber crag
echo aurora
knotty fable
#

Send images of that smashed desk please!

hushed gyro
#

WTF is this lmao

fiery gull
fiery gull
hushed gyro
#

Chat is Gemini 3 pro superior to gpt 5.1?

cloud zinc
#

its good

knotty fable
#

Hugging face guys are a riot at times. 😺

fiery gull
hushed gyro
cloud zinc
#

grok 4.1 >>> 3.0 pro

sullen quest
#

no

hushed gyro
spark python
#

pineaple fix this mess or I will do something bad to my body

hushed gyro
#

Guys I think kimi k2 thinking is underrated

cloud zinc
hushed gyro
fiery gull
hushed gyro
cloud zinc
#

u are hater of grok

waxen fern
#

Lmarena is now using turnstile again. God help us all.

hushed gyro
fiery gull
hushed gyro
#

Guys do you find the guardrail annoying

cloud zinc
#

u work for china?

fiery gull
hushed gyro
hushed gyro
cloud zinc
#

bro is glazing kimi k2

sullen quest
spark python
fiery gull
hushed gyro
fiery gull
hushed gyro
fiery gull
#

But the stable version no

waxen fern
hushed gyro
#

Guys is qwen 3 max good or no

Cuz I have been using it for a month and I think its quite good

fiery gull
native yarrow
#

they've been using it?

hushed gyro
fiery gull
hushed gyro
fiery gull
magic ravine
#

Welp, back to the errors for Nano Banana Pro 🫠

obsidian cargo
#

@echo aurora it's busted again 😭

#

Or still busted? Idk

jade egret
echo aurora
jade egret
echo aurora
native yarrow
#

brah 3:

echo aurora
#

Currently, sometimes it'll work, other times it won't. But we're working on getting this working again.

magic ravine
native yarrow
#

it worked for me just now

#

i notice it's just slower

jade egret
#

also so is claude 4.5 opus today?

native yarrow
#

the model isn't nerfed by quality it's just slower..,.,

fiery gull
hushed gyro
#

We really need an edit button

Seriously I generated the same prompt twice because of a lag 😭

native yarrow
#

yeah nvm it's down now for me too...,,

hushed gyro
#

LMArena is stuck on the verify page!

waxen fern
#

@echo aurora we need to remove the cloudflare verification

cloud zinc
#

opus released

cloud zinc
torn mantle
#

omteresting

cloud zinc
#

opus released

native yarrow
#

opus won't compete with gemini 3

#

unless.,,.,

torn mantle
#

seems like its small model ?

fiery gull
#

hm?

torn mantle
#

i mean smaller

hushed gyro
fiery gull
hushed gyro
#

😔

native yarrow
#

coding wise

queen veldt
native yarrow
#

still i don't imagine a model overtaking gemini coding this quick

queen veldt
#

Why is opus 4.5 cheaper than opus 4.1??

native yarrow
#

:P

cloud zinc
#

official opus 4.5 page will be up in 10-20 min

cloud zinc
jade egret
jade egret
queen veldt
#

Nahh that opus is sus

cloud zinc
cloud zinc
native yarrow
#

groks only good for uncensored creative writing brah

fiery gull
jade egret
#

let see if it betetr than gemini 3

queen veldt
#

It is in coding for sure

#

But their pricing is sus

#

Opus isn't cheap!

#

(shouldn't be)

jade egret
#

:0

torn mantle
#

consumes usage limits faster

#

pfffffffft

native yarrow
#

what's the point of using it if you get like 5 messages 💔

torn mantle
#

we will get it for free on lmarena

#

soon

#

didnt they like buy tons of gpus from amazon

native yarrow
#

lolz

cloud zinc
#

page is uphttps://www.anthropic.com/news/claude-opus-4-5

Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.

jade egret
#

it better than gemini 3?

cloud zinc
jade egret
#

:0

sudden coral
#

gemini 3 flash preview

#

lmao

cloud zinc
#

37.6 in arc agi 2 vs 31.1 percent for gemini 3 pro

native yarrow
#

hmmm

#

promising

#

eager for it to drop on lm

torn mantle
cloud zinc
torn mantle
#

🙂

fleet lintel
#

great work with Claude.

torn mantle
#

i was never fan of anthropic models, seems lazy in general usage

fiery gull
#

I hope it, I'm tried of gemini 3.0 pro

cloud zinc
fiery gull
#

deepseek v4 when? I'm tried of Opus 4.5

fleet lintel
#

so basically. coding use anthropic. everything else use gemini 3

OAI should just pack the bags at this point

cloud zinc
#

anthropic was waiting for gemini 3 pro to drop

#

before releasing

torn mantle
fiery gull
#

hell no

torn mantle
#

how

#

????????

#

noi difference between thinking on / off ?

native yarrow
#

dawg gpt sucks ahh

fiery gull
#

1/10 ragebait

native yarrow
#

chatgpt is unironically one of the worst models compared to what is out there now

fiery gull
#

wow the no thinking is soo good

fleet lintel
fleet lintel
native yarrow
#

id agree :P

fiery gull
#

Today is the OpenAI most irrevelant day?

fleet lintel
#

i was worried that google might fall asleep again. Love to see Anthropic giving good fight!

cloud zinc
native yarrow
#

well, now google will need a response is 4.5 is as good as ppl say,.,,.

torn mantle
#

they need to bring old checkpoints back

#

this preview gemini 3 pro is dumb

native yarrow
#

i want to test it brah but i am not paying anything to anthropic 💔

torn mantle
#

same

#

im not paying

#

why would i?

native yarrow
#

exactly you barely get any messages even if you do pay

fiery gull
native yarrow
#

i hope 4.5 is actually good at creative writing

#

unfortunately its going to be HELLA censored

#

as anthropic is

fleet lintel
jade egret
native yarrow
#

not as bad as GPT though

#

GPT is so awful

#

like it just makes up things and then it censors it because of the thing it made up when you didnt even mention that

torn mantle
native yarrow
#

pal chatgpt is nowhere near these two models as of late 💔

torn mantle
#

live on cursor

#

there is a reason why they said 'vibe coding'

fleet lintel
#

Gemini 3 flash will be launching in Dec. I think new flash model will be better than gpt 5.1 models 😄

torn mantle
#

its only good for vibe coding and not real projects

native yarrow
#

just test it for yourself brah

#

theres a diff

jade egret
#

no..

native yarrow
#

ts ragebait

#

💔

torn mantle
#

:/

#

dumb

cloud zinc
fleet lintel
#

really good work by Anthropic! kudos to them!

cloud zinc
#

google already pushing back

fiery gull
#

hell no, the openai have a respond lol, I love this era

cloud zinc
#

5% improvement in agent benchmark through system prompt

fleet lintel
jade egret
cloud zinc
cloud zinc
fiery gull
#

what is this mercury v0? 😭

fleet lintel
# cloud zinc

This tells me that base model gemini 3 is very very strong.

torn mantle
#

thats what i said

#

im always right sigh

#

gimme my prize

fleet lintel
native yarrow
#

google wants to keep their first place of course

fleet lintel
native yarrow
#

google wants gemini 3 to be the first model people think of when they think AI, etc

#

chatgpt still actually holds that spot

#

surprisingly

#

but its because they were the first :P

magic ravine
#

Btw, anyone recommend the best model for creative writing so far? Gemini 3 is still disappointing. (Gotta keep myself busy as we wait for Nano Pro to get fixed) GPT 5.1 is decent so far but still not exactly that good for novelized writing.

jade egret
#

hard to say

fleet lintel
native yarrow
#

yes, usually the one to ignite the ai bubble will always be the first model ppl will think of

fiery gull
native yarrow
#

bad censorship

#

💔

#

and gpt also has awful censorship

magic ravine
native yarrow
#

id say best quality is claude if you dont care about censorship but

random flax
#

“Claude Opus 4.5 is out. When is Lmarena going to add it?

native yarrow
#

decent writing quality with no censorship is actually grok, as bad as the model is

cloud zinc
#

openai focused on shopping, while anthropic and google are cooking

native yarrow
#

pineapple and the team i think are working on the nb pro issue rn

#

what is polaris?

cloud zinc
native yarrow
#

theres no point in using that if the censorship is off the rails though

magic ravine
native yarrow
#

you can barely do anything with gpt 5.1 like deadahh

#

ask like everybody on the openai subreddit what they think of censorship in creative writing with gpt 5.1

#

including myself

magic ravine
native yarrow
#

its pretty bad.,,

native yarrow
#

writing wise

#

and thats where its uncensored

#

:P

vast fern
#

dangg itt

native yarrow
#

and mind you i dont pay ofc

magic ravine
magic ravine
vast fern
native yarrow
#

yeah, i dont pay and ive never hit the rate limit yet

#

:3

cloud zinc
#

still gemini 3 pro is better for cost

cloud zinc
fiery gull
#

I'm using the opus 4.5 now

cloud zinc
#

gemini is much better optimized

magic ravine
fickle venture
#

Yoo finally opus 4.5

normal abyss
#

claude-opus-4-5 WHOAAAA

fleet lintel
#

Anthropic is all about coding now! huge market for them

slim gorge
#

opus 4.5 gonna destroy every other model ngl

echo aurora
native yarrow
#

theres 4.5

#

added to lmarena

#

sweet1!1

echo aurora
#

New model hype.

native yarrow
torn mantle
#

ive tried it

native yarrow
#

i think text is just uncensored straight up

torn mantle
#

anthropic models are lazy af

fickle venture
torn mantle
#

omg

native yarrow
#

DO Not waste money on supergrok pls

#

💔

torn mantle
#

'wont use opus 4.5 ever again

#

and dont bother

cloud zinc
#

where is 4.5 thinking?

#

when 4.5 thinking coming

normal abyss
fickle venture
magic ravine
native yarrow
#

is this the model?

magic ravine
#

Text only

cloud zinc
echo aurora
magic ravine
cloud zinc
native yarrow
#

but

cloud zinc
#

opus model is for code. not for text

polar niche
native yarrow
#

ive tried things that chatgpt just outright refuses and ofc it works.,.

inland quest
#

thinking when?

magic ravine
echo aurora
polar niche
#

Is sonnet 4.5 coming too?

vast fern
#

give me prompts to test opus

echo aurora
vast fern
magic ravine
native yarrow
#

ill just ask it to make an asteroids game

polar niche
native yarrow
#

similar to how gemini 3 was tested with that

fiery gull
#

🙂

vast fern
#

opus is the og for coding

echo aurora
round island
#

hmm

magic ravine
#

Again with the coding 😭

vast fern
native yarrow
#

apparently its lazy

#

i havent tried

#

i dont think its gonna be a bad model, it usually isnt

echo aurora
native yarrow
#

it just depends on if its really better than gemini 3

#

:P

hushed gyro
native yarrow
#

this is

#

interesting

#

claude output

#

oh i just got pieced by an alien ufo

#

💔

cloud zinc
hushed gyro
#

Chat are the hermes models well known

torn mantle
#

btw no thinking is kinda the same as thinking in swe

#

thanks lmarena for adding opus 4.5 this fast ❤️

empty stump
#

Im guessing its still expensive

native yarrow
#

yeah pineapple you the goat brah :3

sudden glacier
#

It's cheaper than prior Opus models but still expensive

echo aurora
empty stump
#

How many messages do you get on direct chat

magic ravine
native yarrow
fickle venture
#

Bro that's magnifique

native yarrow
#

photos not

normal abyss
#

it didnt cut me off at 5 messages, the rate limit being higher is actually so cool 🤩

fleet lintel
native yarrow
#

media you have to do supergro

#

but who cares ab that

echo aurora
modest prism
#

Is opus 4.5 better than Gemini 3 pro in non-coding tasks?

empty stump
#

Not bad

native yarrow
#

20 aint bad :P

hushed gyro
# vast fern promt?

Create a fictional URBANSCOOP newspaper snippet set in 2029 revealing a secret advanced PMC / private intelligence agency called DELTACELL.

fleet lintel
#

20 per person is more than enough

vast fern
fickle venture
magic ravine
hushed gyro
#

Chat how is the new opus

modest prism
jade egret
fleet lintel
dreamy sparrow
native yarrow
#

does 4.5 srsly beat gemini 3

#

find that hard to believe so quickly

fickle venture
#

Only

native yarrow
#

ah

fleet lintel
empty stump
#

Gemin8 3.5 better fix

vast fern
whole swallow
#

What if gemini 3 pro + opus 4.5???

magic ravine
#

But yeah, not that most of what I want written are explicit anyway. Got a thing for just simple fictional stories.

dreamy sparrow
native yarrow
#

what is this brah

vast fern
#

only multimodel is behind

#

that too they are not working on images model

fiery gull
native yarrow
#

like prose

#

its been good to me ngl

#

creative writing is the only thing ill ever use grok for lolz

fiery gull
torn mantle
#

looks good to me

native yarrow
#

that is kinda cool, it actively deteriorates these weird shiedsl

#

no it works its just i swear space invader had a different style,.,

empty stump
native yarrow
#

^

#

sometimes i have to get shady.,.,

#

anyway, theres little bonus ufos that sometimes streak across the screen thats kinda cool

magic ravine
# native yarrow how do you like the quality?

For grok? It's honestly not too bad. Does paragraphs nicely. And I'm sure with some guiding, could make for some decently written stuff. But it does win points in the uncensored department. So like I could use another model to write up an entire chapter and such, and then ask Grok to give me a spicy scene if needed.

whole swallow
# fiery gull hmmmm

I hope they invent some solid way of LLM collaborating, some groupchat typeshi so they can share the history chat

cloud zinc
#

lmarena got limited context window, so opus looks bad here

native yarrow
spark python
#

pineapple stuff seem to work

flint sandal
#

I WAS SLSEPING FOR 40 MINUTES. I WAKE UP AND OPUS 4.5 DROPS AND ITS BETTER THAN GEMINI 3? i tried to adjust to gemini 3 and i was almost done and now anthropic kills it..

spark python
#

wait

#

new model?

#

from anthropic?

vast fern
#

read this

native yarrow
#

huh, thats pretty cool

vapid elbow
#

Is the image generation slow today or is it just me?

native yarrow
#

its an ongoing issue

vast fern
native yarrow
#

planet focus works

spark python
torn mantle
#

dont give it easy tasks

vast fern
native yarrow
#

even gives info on each planet

native yarrow
torn mantle
#

and different phases

native yarrow
#

everything in this UI works

#

alright

jade egret
#

is polymarket gonna react

torn mantle
native yarrow
torn mantle
#

so you dont have the headache of double checking and re-prompting etc...

vast fern
native yarrow
#

if this actually works

torn mantle
native yarrow
#

i'll be kinda stunned

cloud zinc
torn mantle
#

oh wow

#

first coding prompt

#

better than gemini 3 pro

jade egret
vast fern
#

can anyone try these

#
  1. Ant Colony Optimization Simulation
    Simulate a colony of 500 ants foraging for food sources scattered across a 100x100 grid terrain with obstacles. Implement pheromone trail laying, evaporation over time, and collective decision-making for finding optimal paths. Include ant roles (scouts, workers, soldiers) and simulate resource depletion affecting colony behavior.

  2. Urban Traffic Flow Ecosystem
    Create a city traffic simulation with 10,000 vehicles across 50 intersections with adaptive traffic lights. Include different vehicle types (cars, buses, emergency vehicles), varying driver behaviors (aggressive, cautious), real-time congestion patterns, and simulate how a single accident cascades through the entire network over 24 hours.

  3. Pandemic Spread with Human Behavior
    Model a disease outbreak in a population of 100,000 agents across neighborhoods with varying density, hospitals, and public spaces. Agents should have individual immunity levels, social compliance rates, daily routines, and decision-making about masking/isolation. Simulate how misinformation clusters affect regional spread differently.

  4. Stock Market Multi-Agent Economy
    Simulate a financial market with 1,000 AI traders using different strategies (momentum, value, random, insider), market makers, and regulatory bodies. Include news events that trigger emotional trading, bubble formation, crash cascades, and emergent manipulation patterns. Track wealth distribution evolution over 10 simulated years.

  5. Wolf-Deer-Forest Ecosystem Balance
    Build a predator-prey ecosystem with wolves, deer, and vegetation on seasonal terrain with rivers and mountains. Include animal aging, reproduction cycles, genetic trait inheritance, pack behavior for wolves, herd dynamics for deer, and forest regrowth rates. Determine if the system reaches equilibrium or extinction spirals.

cloud zinc
slim gorge
#

lmaoo polymarket is forbidden in romania

keen beacon
#

🤣🤣🤣

fickle venture
keen beacon
#

Ai is so dumb lmao

dreamy sparrow
#

yeah dude... Claude 4.5 isn't better than gemini 3 at all

#

it got it wrong so hard

native yarrow
#

good ui, design, no buttons work unforutnately

keen beacon
#

LMAO

native yarrow
#

and lets test the same prompt with gemini 3

fickle venture
dreamy sparrow
whole swallow
dreamy sparrow
#

in every way

torn mantle
#

send code

slim gorge
dreamy sparrow
#

dude claude 4.5

#

can

#

think

keen beacon
vast fern
native yarrow
dreamy sparrow
#

: Find the GCD of this series set {n^99(n^60-1): n>1}

dreamy sparrow
pure pasture
#

dude im gonna go mental this is what ive been seening every 2 mins

fickle venture
dreamy sparrow
#

accident

vast fern
dreamy sparrow
#

gemini 3

#

is better

fickle venture
keen beacon
#

The price will say it all

#

If it’s cheaper it’s shet

whole swallow
torn mantle
#

gemini 3 fails as well

#

it only creates like 2 simple animations

dreamy sparrow
#

me when a claude sucker can't see

vast fern
pure pasture
#

@quick jackal what is this is been like this for 3 hours

cloud zinc
keen beacon
#

You know the same you get what you pay for

dreamy sparrow
#

wait I've seen you before

#

wait I've seen you before

keen beacon
#

If the model is cheaper, how is it gonna be better? lol

cloud zinc
obsidian shell
#

Why can't anthropic just go through a normal release?

dreamy sparrow
#

ur the Elon Musk glazer

#

hah

pure pasture
keen beacon
#

Out here we go out the benchmarks

slim gorge
dreamy sparrow
#

I still rememeber you

cloud zinc
obsidian shell
native yarrow
#

if gemini 3 can do this big bang simulation claude 4.5 isnt better

slim gorge
#

ofc they gonna get overwhelmed

keen beacon
native yarrow
#

pretty simple a model can do something the other model can't its usually better

obsidian shell
whole swallow
# vast fern

Wow that’s really cheap. But makes me wonder how much less powerful it is than it could be at full power..

torn mantle
#

nah

#

thgey ciooked with this model

#

at coding

native yarrow
#

gemini 3 just gave me a great big bang simulation

keen beacon
#

It’s the end of the year the end of the fourth quarter

vast fern
keen beacon
#

They all have to release something even if it’s nothing meaningful

cloud zinc
#

openai slacking

cloud zinc
torn mantle
#

lol

obsidian shell
dreamy sparrow
#

SO U AGREE

native yarrow
keen beacon
#

I’m gonna make negative benchmarks

#

Were AI fails on simple tasks?

dreamy sparrow
spark python
slim gorge
#

gg

dreamy sparrow
#

gemini in coding is ass

native yarrow
dreamy sparrow
#

everything else?

#

no

vast fern
torn mantle
obsidian shell
#

Its not as good as people expected.

keen beacon
#

I’ve heard a lot of people say that about Gemini

torn mantle
#

send @native yarrow

#

send file now

dreamy sparrow
torn mantle
#

yes

keen beacon
#

I heard it’s very generic from people on X

native yarrow
torn mantle
#

whats generic

native yarrow
#

see it for yourself

torn mantle
#

opus?

obsidian shell
native yarrow
#

you have to refresh every time you wanna redo the simulation but

keen beacon
#

I don’t know same with Reddit

cloud zinc
keen beacon
#

There seems to be a mixed reaction about Gemini

keen beacon
cloud zinc
vast fern
#

tell the answer

torn mantle
keen beacon
#

Maybe lil more

cloud zinc
native yarrow
keen beacon
#

Jk

native yarrow
#

i still feel gemini 3 is better at coding so far but thats js me

dreamy sparrow
native yarrow
#

ima do a side-by-side

dreamy sparrow
#

the answer

keen beacon
#

Are there like any real developers here that like actually?

#

Know how to code and stuff

dreamy sparrow
keen beacon
#

We need professional opinions

native yarrow
dreamy sparrow
#

In Ai studio

vast fern
torn mantle
#

lmao google

jade egret
keen beacon
#

No, we just add to the speculation

#

We muddy the waters

vast fern
#

i use it for my everyday coding projects

dreamy sparrow
vast fern
#

sonnet 4.5

drifting crow
#

AI is superior at coding than any human

dreamy sparrow
#

lemme show u

jade egret
keen beacon
native yarrow
dreamy sparrow
jade egret
dreamy sparrow
#

completely wrong

drifting crow
dreamy sparrow
#

lmarena doesn't use tools?

keen beacon
#

I’ll take it

#

lol

vast fern
#

non thinking vs thinking

hushed gyro
dreamy sparrow
drifting crow
#

I’m a non thinking human

strong cipher
#

Opus 4.5 or Gemini 3 pro for coding? I dont use opus 4.5 for now

vast fern
keen beacon
#

High-resolution cameras, facial recognition software, state-of-the-art video surveillance centres: Data Sources reveals how Western companies are helping the authoritarian regime in Kazakhstan to create a mass surveillance system.

Facial Recognition: Tech Firms and Surveillance | ARTE.tv Documentary
📆 Available until 12/07/2029

ARTE.tv Do...

▶ Play video
#

That’s in some central Asian country pretty crazy technology

normal abyss
drifting crow
keen beacon
#

🤣🤣🤣🤣

keen beacon
#

Yes

#

And the program only started in 2018

#

I guess that’s a while now

obsidian shell
native yarrow
#

claude

keen beacon
#

What the hell is that supposed to be Google maps or something?

inland violet
#

opus 4.5 or gemini 3 pro?

keen beacon
#

Oh traffic

native yarrow
#

gemini

keen beacon
#

Nvm

vast fern
obsidian shell
keen beacon
vast fern
keen beacon
#

Tf is that

inland violet
drifting crow
#

I mainly just use gpt5 mini for coding stuff

keen beacon
#

Which one made that

obsidian shell
drifting crow
#

Sometimes it overengineers so I use gpt4o

inland violet
vast fern
#

opus

keen beacon
#

Damn that’s nice

vapid elbow
#

image generation is unusable today...

vast fern
keen beacon
#

There’s a little bench in a rock there in a little looks like a little Mario pipe or something

obsidian shell
#

Well...

inland violet
#

ok opus 4.5 or sonnet 4.5

keen beacon
#

All right, I’m gonna run my own test

obsidian shell
inland violet
drifting crow
#

I don’t let ai code everything tho normally I just want it to make simple functions Icba to make so I’m prob not the best benchmark as I want it to do the minimum

dreamy sparrow
#

GEMINI WITHOUT TOOLS SOLVED IT

#

now what

inland violet
#

gemini the best

#

just like always

fleet lintel
dreamy sparrow
obsidian shell
inland violet
#

its 67

drifting crow
#

I think google is in the best position, they have the researchers from Deepmind and they have their own hardware

native yarrow
dreamy sparrow
native yarrow
dreamy sparrow
#

really far from the answer

inland violet
#

waiting for opus 4.5 thinking

fleet lintel
dreamy sparrow
#

ofc u dont know dude nobody knows it

#

lel

fleet lintel
#

ohk.. not surprised that gemini 3 is better here

dreamy sparrow
#

gemini and grok are the ONLY

#

models

#

that got it

fiery gull
fiery gull
dreamy sparrow
fiery gull
dreamy sparrow
#

not the actual model yet

fiery gull
#

will jump 20% of inteligencie

inland violet
#

or just more censorship

fiery gull
#

bruh, I need to be optimistic

dreamy sparrow
inland violet
#

how

obsidian shell
#

You just have to ask the right questions.

dreamy sparrow
cloud zinc
inland violet
#

how to make

dreamy sparrow
inland violet
#

say to me

#

pls

dreamy sparrow
#

yeah uhm

#

a hint is

hushed gyro
inland violet
#

its phishing

dreamy sparrow
#

u need some kind of very very rare stone

inland violet
#

like emerald from minecraft

fiery gull
dreamy sparrow
#

or just buy it

native yarrow
#

claude won again i think

#

:P interesting

hushed gyro
native yarrow
#

it couldnt do a big bang sim but beats gemini two straight

dreamy sparrow
hushed gyro
inland violet
native yarrow
#

Model a disease outbreak in a population of 100,000 agents across neighborhoods with varying density, hospitals, and public spaces. Agents should have individual immunity levels, social compliance rates, daily routines, and decision-making about masking/isolation. Simulate how misinformation clusters affect regional spread differently.

hushed gyro
dreamy sparrow
#

yay

native yarrow
#

its better than whatever this is

#

the sliders dont even work brah 💔

hushed gyro
#

☠️

dreamy sparrow
cloud zinc
#

this is not experimental model

dreamy sparrow
#

remember gemini 2.5 pro previews

#

it was removed and replaced with a new no preview model

#

it was significantly smarter

#

and better

cloud zinc
#

nah, cuz they gonna release in december

dreamy sparrow
#

yeah

#

prob they release it early because

#

they were training it

cloud zinc
#

general release is just a name swap from preview to general release and nothing else.

cloud zinc
#

cuz gemini 2.5 pro was behind. google feels confident in gemini 3 pro preview

dreamy sparrow
#

wait lemme ask gemini

cloud zinc
#

lmao

keen beacon
inland violet
#

gta its grand theft auto lol

#

stupid llms

keen beacon
#

Both failed my safety test

inland violet
#

send me prompt

#

(for tests)

#

((really))

native yarrow
keen beacon
#

Hold on let me test it out real quick

dreamy sparrow
balmy mist
#

is opus 4.5 better than gemini 3?

dreamy sparrow
#

NO

#

NO

#

NO

waxen fern
#

Does lmarena still have the verification

vast fern
#

@echo aurora thinking models when

dreamy sparrow
#

🤑🤑🤑

hushed gyro
hushed gyro
queen veldt
#

Oh niceeeeeee

#

Thanks gemini

spare mango
#

Gemini now engages positively with the hard-R, when previously it only interacted with the soft-R, not the hard-R.

primal orbit
#

are we going to get opus 4.5 thinking in lmarena?

vapid elbow
spare mango
#

I had a feeling from a couple days worth of usage that the model was less censored now, but now I've confirmed it.

fiery gull
#

opus 4.5 never in ranking ? lmarena is paid

#

have 34 respost is good to put it in rank

spare mango
#

I mean, as long as I don't use the hard-R in a hateful way that is, in which case the model is totally justified in refusing to co-operate with me.

obsidian shell
fiery gull
spare mango
hushed gyro
obsidian shell
spare mango
# obsidian shell Ah...

It's far less censored, I can guarantee that, I spend most of my time discussing social issues with it.

#

So I've seen a stark shift.

obsidian shell
#

Although you don't really have to jailbreak models anymore.

fiery gull
#

🤧

obsidian shell
#

You 2 are quite rare 😂

fiery gull
#

nice to meet you hambuguer

spare mango
#

Basically it's a lot more willing to entertain right-wing perspectives.

obsidian shell
spare mango
spare mango
fiery gull
obsidian shell
#

Sonnet 4.5 was a great improvement on that.

obsidian shell
fiery gull
native yarrow
#

GPT was very left wing

fiery gull
obsidian shell
native yarrow
#

it just very obviously gave pretty biased information based on that kind of agenda, i dont care specificallyt that its left wing but that it gave biased info

#

i want it to be completely neutral

#

like, a model

obsidian shell
spare mango
native yarrow
fiery gull
#

the left AI will tell them to release everything for free, and companies will go bankrupt because in the future people will only listen to AI blindly (like me) 😭

spare mango
#

Gemini 2.5 was much more biased than GPT 5.0, but now it's a lot more balanced on Gemini's end.

native yarrow
#

i honestly still dont trust ai for news or info

#

i always look it up after

#

💔

#

hallucinations are a big issue

obsidian shell
fiery gull
obsidian shell
native yarrow
#

i typically tend to trust more in my own research

#

lolz

hushed gyro
#

Should I do more news snippets or nah

native yarrow
#

do a WW2 one

spare mango
#

I mean this is insane.

#

The model is calling the doctors that mindlessly told you to vaccinate yourself, despite seeing side effects from it, "Robot Doctors".

#

Unimaginable for me just a month ago with Gemini 2.5 Pro.

#

And called the practice "medical gaslighting".

#

Feels like I'm talking to Grok, not Gemini.

obsidian shell
queen veldt
#

😂

obsidian shell
queen veldt
queen veldt
native yarrow
#

okay so who do we think is better claude or gemini

golden ocean
spare mango
#

it's supporting the anti-vaxxer narrative, and engaging with the user even if they say the hard-r to the AI.

golden ocean
#

w

native yarrow
#

did they lower the rate limit for nb pro lolz

spare mango
#

sure, it's the fact that it's willing to say that, that I find surprising.

spare mango
#

It shows that it's not being censored top-down.

native yarrow
#

aww man

vapid elbow
#

i think u can only gen 5 pics with nano banana now lol

#

lame

native yarrow
#

that mustve been the fix for the lag

#

thats weak asf.,.,

vapid elbow
#

hopefully not permanent

native yarrow
#

i would ping the mod

#

but

#

meh

echo aurora
#

Ping me for questions/feedback/etc.

native yarrow
queen veldt
#

Someone ran custom bench for the model

vapid elbow
#

Idk, even with the new limit gens take like 120 seconds lol.

native yarrow
#

nah mine was fast

echo aurora
native yarrow
#

its just i think thats what they did to fix the server issues