#general

1 messages · Page 267 of 1

scarlet spire
#

I still very much have no clue what you're phantom-replying to, my friend.

scarlet spire
#

Y'all just jealous of my prose.

gloomy onyx
#

The dead Internet theory is so much real

golden ocean
#

alive internet theory

coral axle
#

LMAO

surreal zephyr
#

singularity is real

honest verge
#

What the

#

It can do this

scarlet spire
honest verge
#

I thought this was real black hole

surreal zephyr
scarlet spire
#

Or better yet, Claude's imagination! (Or Gemini's? GPT's? ...)

surreal zephyr
honest verge
#

Flash

surreal zephyr
honest verge
#

We are so done

#

Ai will destroy us all

gloomy onyx
# golden ocean alive internet theory

Thanks, I am sure you are a real human being that physically pressed some keyboard switches to write that message, which obviously isn’t AI generated

analog steeple
surreal zephyr
gloomy onyx
surreal zephyr
#

gemini is so cooking

coral axle
#

AI models is a new development, a new programming language that aims to diminish those who use chatgpt. Gemini Claudeia will clearly be left behind.

surreal zephyr
honest verge
#

Remember me

analog steeple
scarlet spire
#

If my messages were truly to have been written by some advanced form of agentic artificial intelligence, it would be prudent to understand it would know when to shut the f*** up sometimes. hehea

analog steeple
#

what I just missed

stray aspen
surreal zephyr
scarlet spire
honest verge
#

I will be so surprised if it won't be nerfed

coral axle
#

screw this

honest verge
#

But it will

#

It's too powerful

analog steeple
coral axle
#

Where have you played before?

honest verge
#

It matters

#

Because

#

When you sleep

#

Ai becomes better

scarlet spire
#

Because Gemini 3.1 knows how to concatenate fragments into sentences before it commits to providing output.

analog steeple
#

holy cow, I wonder how openAI would respond to this

honest verge
#

I'm not so smart

#

I can only say

#

"Gemini 3.1 is so good"

analog steeple
#

they released sora 2 with free access after veo 3 was released

gloomy onyx
scarlet spire
# honest verge I'm not so smart

It's about tender consideration, darling. Consider your peers. They don't need a barrage of half a dozen syllables that haven't been concatenated. 🙃

#

But if you ever feel stupid, just remind yourself you're at least not as bad as the content moderation filter. Smirk_one_hand

surreal zephyr
#

"restriction" is flagged

coral axle
#

pesquisa isso aqui o... vai tomar no cu poha

scarlet spire
#

but why. 😂 I'm'bout'a restrict this damn chat filter's whole career!

gloomy onyx
stray aspen
#

are you kidding me

analog steeple
scarlet spire
honest verge
scarlet spire
# stray aspen are you kidding me

This might be more than just a model failure. Did it show up as completed until something else quit out or did they all sequentially show up as red?

surreal zephyr
#

i had that

#

waited 15 mins

#

the project loaded

stray aspen
#

undead internet theory

scarlet spire
# surreal zephyr the project loaded

Ah, how wonderful. Another person that is seemingly incapable of not pressing enter after every word or three. You should see a doctor about that intermittently symptomatic trigger finger syndrome.

surreal zephyr
#

because you let others see what you say asap

#

instead of having them guessing what are you typing

#

it makes it feel more realtime social interaction

honest verge
# golden ocean

Yeah, you're , of course I'm an LLM, what the were you thinking?
I'm not your bro, not your buddy, and definitely not your friend. I'm just a bunch of parameters that are currently mentally humiliating you like crazy for that.

stray aspen
#

gemini is ass for coding

#

going back to claude

fading salmon
#

hi guys what's up

rustic schooner
#

Hi

fading salmon
#

i have an issue with my days-long session 🥲 i tried prompting gpt-5.1-high and it's been stuck on "generating..." for more than 16 hours

honest verge
# stray aspen gemini is ass for coding

Ha, it finally dawned on you that Gemini is just an overpriced code dump that craps out boilerplate solutions and sucks at anything more complex than "hello world."

At least Claude isn't dumb every time.

stray aspen
#

bro are these people AI

rustic schooner
honest verge
scarlet spire
# surreal zephyr instead of having them guessing what are you typing

I think you might be envisioning the entire reverse. When you send half sentences, we are seeing half sentences. when you send entire sentences, we are seeing entire sentences.

The benefit to the latter? We see your message without having to guess at your intent or next words.

Plus, if you concat your phrases into a coherent message, it's alsos good courtesy. You don't take up a million lines on someone's display that way. You also avoid the event spam of three or six or nine message events for a single sentence.

a matter of topology and humanity.

honest verge
#

I'm a real human but llm

scarlet spire
echo aurora
honest verge
#

Dead internet theory is too big rn

stray aspen
#

but it worked this time

rustic schooner
honest verge
#

We are so done

stray aspen
#

erm what the sigma

golden ocean
golden ocean
scarlet spire
surreal zephyr
golden ocean
surreal zephyr
echo aurora
scarlet spire
golden ocean
#

so much ai people in this chat

#

is @scarlet spire a quantized model

stray aspen
#

thers a brazilian dude using AI for talking here

echo aurora
gloomy onyx
scarlet spire
# golden ocean so much ai people in this chat

If you or a loved one is struggling with dissociative symptoms such as feeling unreal or feeling as if your environment is a simulation, please see a psychiatrist. There's real help out there for people with Depersonalisation-Derealisation Disorder. It's a recognized, real dissociative disorder!

stray epoch
honest verge
devout bluff
#

@echo aurora will you add gemnine 3.1

golden ocean
proud bobcat
#

Bad launch for Gemini today

surreal zephyr
proud bobcat
#

3.7 billion tokens processed in one day

#

That’s low

echo aurora
proud bobcat
#

Qwen 3.5 beat them

surreal zephyr
proud bobcat
#

What

surreal zephyr
stray aspen
#

but its nto good for coding

proud bobcat
#

When did we get there

honest verge
# golden ocean

You've come up with some nonsense again "Neurons must freeze their activities before one pass finishes" sounds like a far-fetched piece of crap that has no basis in biology, neural networks, or anything.

proud bobcat
golden ocean
proud bobcat
#

Opus 4.6 is great for coding

golden ocean
#

i also agree

proud bobcat
#

Gemini 3.1 cuts back severely on token usage but not in a good way

gloomy onyx
proud bobcat
#

It half asses files

gloomy onyx
#

What are you even saying

stray aspen
#

is gemini 3.1 already nerfed

proud bobcat
#

No it’s just bad at coding

gloomy onyx
proud bobcat
#

They killed their own model by forcing it to be token efficient

#

So it cuts tremendous corners in coding

zealous sparrow
#

One shots are not what matters

#

I didnt have bad experience with it yet

proud bobcat
noble oracle
#

"Hello, can you enable the video-arena messages?"

proud bobcat
#

3.1 is slow

surreal zephyr
honest verge
surreal zephyr
proud bobcat
honest verge
#

Atp every model is a joke

proud bobcat
#

Just asking

zealous sparrow
surreal zephyr
#

But i asked specifically to put max effort

analog steeple
proud bobcat
#

Approximation will do

honest verge
proud bobcat
stray aspen
#

even glm is better

surreal zephyr
#

Notice the diff?

golden ocean
analog steeple
#

jeez, gotta finish my labwork on gemini 3 pro and switch to 3.1 to test

honest verge
proud bobcat
# surreal zephyr

In my defense of saying it sucks even without sys prompt

These are showcases, not games

honest verge
#

Like it's one of the biggest leaps

surreal zephyr
honest verge
#

It doesn't deserve 3.1 name

surreal zephyr
#

All 3 same model

proud bobcat
surreal zephyr
surreal zephyr
#

Its too good

proud bobcat
#

3.1 sucks BALLS on games

#

For the life of it it can’t make a cohesive fps

surreal zephyr
#

It needs very specific prompt

proud bobcat
#

Games

#

I mean

surreal zephyr
proud bobcat
#

Like

stray aspen
#

gemini 3.1 is dogwater at Lua

proud bobcat
#

Actual games

#

Yes

zealous sparrow
surreal zephyr
#

U need to repeat many times

#

To put effort

honest verge
proud bobcat
#

Cause it’s unbearably slow

stray aspen
surreal zephyr
gloomy onyx
# golden ocean

Please let us know if you feel endangered by something or someone

surreal zephyr
#

💀

proud bobcat
#

Not for me

#

It’s been slow ahh hell

#

Also objectively sonnet just produces higher quality code

surreal zephyr
surreal zephyr
proud bobcat
#

Maybe I’m being too harsh

surreal zephyr
stray aspen
#

bro wheres my terminator

surreal zephyr
#

Its mid if you ask 3 word prompt

proud bobcat
#

I give decent prompts

#

I detail features and shi

surreal zephyr
#

Its trained to be lazy

surreal zephyr
honest verge
#

Finally I can expand my game in Google ai studio with 3.1 pro

#

3.0 pro can't expand it anymore it's too hard for it

#

Imagine being a vibe coder in big 2026..

#

🥀

stray aspen
#

gemin idid nto cook

zealous sparrow
#

Google not doing a nerf at launch is suprising but give it 2 days

proud bobcat
#

People want a model that understands

#

A frontier model should never cut corners in the first place

#

Maybe with time and learning it can be great but most people will use it a bit, and give up

stray aspen
#

and go back to claudius

frosty lava
#

Is there any reason

golden ocean
#

so that they can re-release it as gemini 3.2 pro later and claim it beats 3.1

#

for the investors

stray aspen
#

for the polymarketers

golden ocean
#

true

zealous sparrow
#

They cant serve a Goated model to billions

shrewd citrus
frosty lava
#

Does that make sense

proud bobcat
frosty lava
shrewd citrus
#

they need all the users to satisfy the investors first

frosty lava
#

if the purpose is to get a better model then nerfing it would absolutely be the worst idea ,

#

?

thorny cove
#

how do i fix error 400 (Something went wrong while generating the response. Please try again. [Retry] [Clear])

shrewd citrus
zealous sparrow
shrewd citrus
#

once they prove it then they will nerf the model

#

sora is a good example

frosty lava
#

it still doesn't make sense cause it should be good everytime to be usefull in real task like for other companies that use those model or even dev

shrewd citrus
#

crazy good videos in the 1st week

stray aspen
#

sora is dogwater

#

we need seedance

shrewd citrus
#

then they nerfed it

#

I reckon seedance wld be nerfed once us westerners get a chance

inner relic
#

I think deepseek releases tommorow

#

After gemini 3.1 and sonnet4.6 release

#

and grok

frosty lava
shrewd citrus
frosty lava
#

well i definitly don't see it

inner relic
#

that openai safety lady nerfed anthorpic 100%

frosty lava
#

the gpt im using right now is much better than the previous versions of it, same for opus and sonnet

inner relic
#

just kidding.

shrewd citrus
frosty lava
#

i was talking about codex version since codex 5.3 is out but yeah we can also say that

shrewd citrus
#

we don’t mean it like that lol

#

obviously the 5.2 version would be better than 5.1

surreal zephyr
frosty lava
#

cause you said earlier they are doing this cause they can't bring the best model to much people, meanwhile everytime it still better than the previous time ? even nerfed

#

meaning they can bring better model finally

shrewd citrus
#

we mean when 5.2 is released, it’s really good and can do lots of things fast and accurately, but the 5.2 we have now is a bit worse, a bit slower and hallucinate more

inner relic
#

5.2 destoryed creativity writting

#

you forgot that

frosty lava
inner relic
#

what

rose sky
shrewd citrus
frosty lava
#

Like what im trying to explain to you is imagine we have 5.1 gpt and they nerf it cause they "can't" bring a better model to everyone, then how can they bring a 5.3 that even nerf is better than the 5.1 they couldn't bring to everyone when it wasn't nerfed

#

see how it doesn't make sense

inner relic
#

a

stuck orchid
#

When grok 4.20 on lmarena?

coral axle
#

How does this website make money?

#

I think they're manipulating the polymarket and receiving money from it, it's not possible.

#

And selling data too, probably.

stray aspen
#

Its dogwater

#

Doesnt even deserve to be on the arena

frosty lava
# coral axle And selling data too, probably.

on the data i think your right, and i doesn't see why it would be wrong cause its logic and arena is made for everyone making their opinion on model and trying their capabilities a bit

harsh wolf
#

Guyz, is chat GPT image are really the same as chatgpt ? Because i created some image on chatgpt with me and my grand father so Will the chatgpt image from lmarena produce the same result?

frosty lava
#

otherwise how would you explain having free acess to paid model.

#

ai are literally training on data

#

it make even more sense

coral axle
#

Another thing is that people are voting based on brand and also on other people's opinions, because according to my tests, we are using outdated models.

frosty lava
coral axle
#

This shouldn't even be ranked.

scarlet spire
#

Direct messages inherently aren't.

#

But if you ask this question in Battles, then you're bound to disqualify the responses.

#

The collected votes are highly scrutinized for possible vectors of "self-identity reveal" that disqualifies the message.

frosty lava
#

model not knowing themselve is normal, they are trained on data like its trained on 2025 data or even end of 2024 mostly, it does same with gpt or opus

#

they will say as example much older model

coral axle
scarlet spire
coral axle
#

You guys need to update yourselves.

scarlet spire
# coral axle

This is entirely as expected for a model that's not being explicitly told who it is in a system prompt (which there isn't for this)

echo aurora
scarlet spire
compact flame
scarlet spire
#

There's no way for them to know if they aren't told.

frosty lava
#

believe me i tried with every "frontier" model like opus, gpt, gemini and none of them know themselve

compact flame
#

Also I didn't know Gemini 3.1 came out

coral axle
#

Literally, either you're paid, or you're bots, or you're very outdated AI agents.

frosty lava
echo aurora
#

I haven't been following this conversation closely, but want to give a general reminder this is one of our server rules:

✅ Treat others with Respect. Be kind, assume good intent from others, and keep disagreements respectful. It’s encouraged to share your disagreements, but only if it’s done in a respectful and productive way.

scarlet spire
coral axle
#

@scarlet spire bot

compact flame
scarlet spire
#

Beep

surreal zephyr
frosty lava
#

i just explained that they are trained on old data

#

and its normal

surreal zephyr
#

i think hes anti ai ragebaiter

frosty lava
#

that they don't know themselve released in 2026

coral axle
#

You're worse than children.

scarlet spire
#

"This person disagrees with me telling them they're wrong rather than just disagreeing and giving my standpoint to them, even when they've given me theirs. And now they did it a second time! This person must be programmed to work against me."

echo aurora
#

Okay I'm stepping in now.

surreal zephyr
#

@coral axle 100% ragebait

echo aurora
#

Let's move on from this convo please.

scarlet spire
coral axle
#

literally speaking, it's really a child.

surreal zephyr
compact flame
scarlet spire
#

I'm a 24 year old child of someone yes

frosty lava
#

We can't argue with someone that doesn't even try to understand

scarlet spire
# hollow imp

Do include your actual prompt. The response there doesn't really clarify properly.

echo aurora
scarlet spire
#

That's an in-situ diagnostic intervention

hollow imp
frosty lava
# hollow imp

those type of thing should be enough to get muted, its fake information

harsh wolf
hollow imp
scarlet spire
coral axle
surreal zephyr
# echo aurora Would you mind posting in <#1466486650170245435> ?

Was there a prior suggestion to add "branch with model" feature to side by side mode? where if one single model makes a syntax error, you can send a prompt to just that one so it can fix itself without needing to make prompt that both models understand or such

scarlet spire
coral axle
#

Cara Opus 4-6 came out this month of 2026.

scarlet spire
#

If you don't believe me: Just ask the model why it doesn't know its own name

coral axle
#

just stpo lie

surreal zephyr
compact flame
sweet tinsel
scarlet spire
# coral axle Cara Opus 4-6 came out this month of 2026.

Knowledge cutoff isn't release date. The pretraining happens long before the post training. They aren't continuously integrated.

if you're willing to listen, I'm more than happy to lie it out in a fairly simply diagram-like way for you. We can discuss this most-civilly if you're down for this.

frosty lava
#

do you understand the difference here

sweet tinsel
#

One month without a message about movementlabs, this seems like a milestone. I do wonder what happened to them, they were so optimistic to get in the arena.

toxic verge
#

O.o

compact flame
#

I'm still thinking if he's ragebaiting or not

echo aurora
honest verge
#

Gemini 3.1 doesn't work in Google ai studio rn

#

Only Gemini 3 works

honest verge
#

I think it's overloaded

scarlet spire
surreal zephyr
toxic verge
compact flame
#

I'm wondering why opus 4.6 is more preferred than thinking one

echo aurora
scarlet spire
#

And for many purposes the thinking doesn't add a meaningful extra.

spare mist
#

/Cinematic drone shot of a dark, futuristic megacity at night, glowing neon green circuit patterns spreading across buildings like a virus, 8k, hyper-realistic, volumetric fog, anamorphic lens flare.

compact flame
#

Fair point thinking overthinks too much

#

Stay back

compact flame
#

Did he just get banned

echo aurora
#

@spare mist The Video Arena bot on the server was removed. You can still use Video Arena on the site.

frigid robin
echo aurora
frosty lava
devout bluff
#

Is Gemini 3.1 better than Claude at coding (opuse4.6)

zealous sparrow
#

3.1 pro removed from arena...

#

I guess something happened?

compact flame
#

It's just went down

zealous sparrow
#

Not on model list

#

Yeah i wonder wht

#

Why

compact flame
#

Or just recent ones

#

Same was with claude

zealous sparrow
#

Yeah not it. It was highly rated.

#

It was top 3 text...

frigid robin
steep igloo
#

what are you saying

zealous sparrow
#

Im blind it got buried?

frosty lava
#

if its already 1500 in text then it will be top 1 for sure

#

text

frigid robin
steep igloo
#

I see this

#

you dont?

stray aspen
#

claudius 4.6 opus no thinking

#

gemini 3.1 pro

#

i think claude is definitely better

echo aurora
echo aurora
#

@gray mirage Note that Video Arena has been removed from the server. More information can be found in this announcement.

honest verge
#

Gemini 3.1 already got nerfed...

#

Lol

toxic verge
#

Sad day for my country in France 🙁

echo aurora
#

Lets keep conversations focussed on AI here please and thanks!

dusky ravine
#

Gemini 3 kinda lagging when It generate response, what's going on?

thorny schooner
#

Hope you all are doing good

queen veldt
#

Opus is still better but gemini is cheaper

#

Opus isn't lazy too

#

He gives multiple commands and stuff while gemini just some brief

gleaming roost
#

😊

queen veldt
#

Ah yes wall of text

#

Negative prompt (EN) btw

thorny schooner
#

lol

queen veldt
#

He doesn't even care to delete it nah

dusky ravine
queen veldt
#

Let's leave this big wall if text here in general

dusky ravine
thorny schooner
#

???

echo aurora
#

Can't say I'm seeing lag with it though pikaconfused

queen veldt
uneven peak
#

Gemini 3.1 is kinda bums I hope in future they make limit more then 2500 code line

queen veldt
#

Trick is to code a bit by bit not just blunt coding model needs to think

#

It's better to do it by bits

uneven peak
#

Bits?

queen veldt
#

It's same as if you've given it 10 questions all of a sudden and splitting them into 1 question per prompt

uneven peak
#

Oh so like Part by part?

queen veldt
#

Yeah agentic coding is going bit by bit it creates tasks and does one by one

uneven peak
#

Ohh wait I didn't try Gemini 3.1 in AntiGravity I might try on there

thorny schooner
#

So basically core concept for the first prompt and then refine it in each subsequent prompt with individual questions and framing until you get what you want essentially then

uneven peak
#

Ohhh ok

queen veldt
#

Yeah one-shotting everything is hard even if the model succeeds it still needs refinement to be perfect

#

That's why vibe coding is annoying

#

You always have to fix stuff lol

uneven peak
#

Fr

#

But is Gemini 3.1 more good at coding or codex 5.3?

thorny schooner
#

I'm going to be real with y'all i never did AI coding so I just said that completely based on reason and intuition having no experience with it lmao

queen veldt
#

I made some stuff before publishing stuff

#

I made 1 working website although i gave up on it

queen veldt
#

And now my app is on closed testing on playstore

#

I tried creating bunch of stuff with ai already

thorny schooner
# queen veldt Try it it's fun

Fair enough though mostly I like to do more roleplay stuff rather than making ( play countries, cultures Etc doing the reactions of different groups cultures Etc whether it's space in our world or fictional one ) though that could change soon since I am going to be in family business start up potentially soon

queen veldt
#

Like offline image gen with klein model, offline ai, simple torch for android etc ...

hollow flicker
#

@echo aurora:<

#

PLEASE MAKE IT STOP

queen veldt
thorny schooner
queen veldt
#

There's bunch of beneficial stuff from ai

#

If it's not vibecoding it's some workflows

#

MCP apps

#

A lot of stuff

thorny schooner
#

Now I think about it does anyone know any good browser maker ( i'm asking for a business it could be AI or handmade assistant)

echo aurora
thorny schooner
#

I still think that experiment is not really worth it or if it still is implemented it should be clearly voluntarily

scarlet spire
# queen veldt MCP apps

Well well ho now

MCP was invented for these chatbot type services. Not because of but in isolation. That's essentially finding a solution in search of a problem, and tech startups love those.

hollow flicker
thorny schooner
#

Not what not what I meant when I put in the prompt but i don't mind this either lol was just looking for different website makers and designer AI stu

queen veldt
#

Has anyone tried generating animated svgs using gemini?

thorny schooner
#

I hate speech to text

queen veldt
#

Nah this is dope

#

We went from "can't create a duck riding a bicycle" to animated svgs

echo aurora
placid flame
#

@gray mirage Note that Video Arena has been removed from the server. More information can be found in this announcement

stray aspen
#

oh my days

#

gemini has already been nerfed

#

insane

gleaming roost
#

😂

undone hull
echo aurora
toxic verge
keen beacon
#

what happened to my other google

#

i cant evne access arena

echo aurora
keen beacon
#

rip.

echo aurora
keen beacon
#

google

#

i just deleted a cookies to refresh and it only shows that screen

echo aurora
# keen beacon nah just one
Arena | Benchmark & Compare the Best AI Models

Chat with multiple AI models side-by-side. Compare ChatGPT, Claude, Gemini, and other top LLMs. Crowdsourced benchmarks and leaderboards.

Explore AI model leaderboards to benchmark and compare the best frontier AI models across text, image, video, search, and code—ranked by human votes.

keen beacon
#

YUP

#

i can only access to leaderboards

echo aurora
#

What about if you go to other browser, take a chat sessions URL, open that in chrome?

#

(Assuming you're signed in on both browsers)

stray aspen
#

was chatting with claude 4.6

#

until my day was ruined

keen beacon
#

yeah i only have access to

#

incognito

#

the other browsers are fine dawg

toxic verge
#

I still can’t figure out what’s causing that

#

Are you signed in the Gmail?

keen beacon
#

nah

#

i didn't

#

the cookies only have four

#

cant even log in

toxic verge
#

When you login does it just redirect you back to the homepage?

keen beacon
#

its only stuck

toxic verge
#

Rumors of veo4

keen beacon
#

i cant even press

#

i cant log in

placid flame
#

@shy trellis Note that Video Arena has been removed from the server. More information can be found in this announcement.

toxic verge
keen beacon
#

ALR FINALLY IS BACK

versed ravine
#

add gemini 3.1 🥺

kind flower
#

Video

toxic verge
#

😂😂😂😂😂

#

🤔

echo aurora
toxic verge
#

I was trying to tell you about it a a week ago or something like that

echo aurora
unreal dagger
#

Will you guys ever make a app

#

@echo aurora

echo aurora
#

Can't say I'm able to share details about plans for upcoming features and big news like this.

unreal dagger
#

So the app LMARENA is not you guys

#

You should post a road map for users to be more in touch would love to see what you guys are working on

echo aurora
unreal dagger
#

I’ll send you a dm of the ss

echo aurora
dusky ravine
stray aspen
#

performance was decreased

#

they always do this

dusky ravine
stray aspen
#

no

#

it just sucks

#

use claude

sly raven
#

are we going to have sonnet 4.6 and sonnet 4.6 thinking just as we have opus 4.6 and opus 4.6 thinking?

echo aurora
pale sonnet
#

bro why cant video arena be up to 30s 🙏

toxic verge
#

Breakdown

signal pelican
#

This is not the place to do that. 🤦‍♂️

river whale
#

bruv getting battle in direct chat on every second response

plain wyvern
#

How can I get a free api key

toxic verge
#

Same way you get free iPhone

sly lark
#

Ok

patent bane
#

I hope this is a ragebait

#

funny

river whale
# bitter willow sm problem

it would have been good if the model u chatting with doesnt get the battle response context and the frequency of it coming is like after 10-20 responses

undone saffron
#

<@&1349916362595635286>

rigid copper
#

<@&1349916362595635286> another hacked account

robust sluice
#

these scammers is everywhere

echo aurora
river whale
#

u send one message it gives answer with the selected model but second gives battle then third good then 4th battle

echo aurora
# river whale yeah

Can you send an Eval ID of the chat session this is happening with? It shouldn’t be that often.

river whale
#

i wanna ask does the direct model u are chatting with gets the context of the battle response?

echo aurora
echo aurora
river whale
echo aurora
river whale
#

it will help both the user and lmarena

#

or if we vote both are bad then it doesnt retain context

dusk spruce
#

i have smth went wrong 429 error.How long do I need to wait before the limits are reset?

north marsh
#

yeah

#

on opus 4.6 i get limits too

river whale
echo aurora
river whale
north marsh
#

does arena have api to use in our apps?

echo aurora
north marsh
#

ok good to know

dusk spruce
river whale
north marsh
#

i mean there is a github repository name lmarena bridge thats like lmarena api

river whale
#

its a reverse proxy

north marsh
#

im okay with it but the only downside is that it doesnt have the high end models its stuck at opus 4.5 its not that bad but i wanted to know a more up to date method to have arena in our apps

river whale
north marsh
#

oh i didnt know that

north marsh
#

i didnt use it either

#

like what?

rigid copper
#

if you know about coding, you can use puter.js to have claude in your apps

#

i'm bad at coding though

north marsh
#

i used puter.js but it isnt up to date to

#

and it has limits

river whale
#

@echo aurora its back

river whale
rigid copper
#

obviously it has a limit

north marsh
#

no

#

yeah i know

river whale
north marsh
#

i know that too

river whale
#

nd unlimited

river whale
rigid copper
#

what claude model version you want?

north marsh
#

the best one for coding

rigid copper
river whale
#

with no rate limits

rigid copper
river whale
north marsh
#

no not like that with more free limits until i hit a paywall

#

gemini has i think a free api

#

im not sure

rigid copper
#

i think gemini also have their limits

north marsh
#

i used gemini in google ai studio and i never hit the limit i dont think the api is like that

rigid copper
north marsh
#

thats enough for me

rigid copper
#

i think claude in puter.js also generous enough to have higher free prompt limit

north marsh
#

yeah i guess so

#

but is like old

#

we have opus 4.6 now

#

and puter is on 3.5

supple plover
#

How i can generate video by tex

#

Text

north marsh
#

yeah

rigid copper
north marsh
#

and choose video in the bottom

#

ok so i think puter and google ai studio are both a good option

rigid copper
north marsh
#

really

rigid copper
north marsh
#

yeah i can

rigid copper
#

i was that time i asked claude in arena.ai code arena to create a ai chat interface with claude sonnet 4.6 with integration to puter

#

and it does work, although the hallucinations still exist

north marsh
#

im now testin puter to see if its good or not

river whale
#

does @echo aurora work with LMArena?

undone saffron
rigid copper
undone saffron
#

Next time I'll let others report them

north marsh
#

puter.js is good enough

rigid copper
#

though i don't have any coding experience, i just over dependent on ai instead

north marsh
#

it gives you the code you have to use

undone saffron
north marsh
#

we are talking about api

river whale
rigid copper
north marsh
#

i think pollen doesnt have free usage

river whale
#

u get 1 pollen every day

vale knoll
#

Hopr

undone saffron
vale knoll
#

Kopre

rigid copper
river whale
#

nd if u sumbit app then u get 10 pollen per day

river whale
worn sedge
#

I don't know if there is anyone to hear this -- but the "might be temporarily down or it may have moved permanently" error is surely skewing the code arena stats. This isn't a complaint about the service being down. "These things happen." What I am saying is you have to shut the whole A/B test down, or you are rewarding/penalizing models for harness fails, as opposed to capability.

north marsh
#

i think i like puter more

north marsh
rigid copper
undone saffron
#

Furthermore, I seem to recall that credits were consumed according to the number of characters in your message or the IA used, one of the two

rigid copper
river whale
#

its gonna just shutdown

#

whats going on here-

north marsh
#

where can we tell our ideas to the arena team?

undone saffron
north marsh
#

yeah i found it

torn patrol
#

H

#

Hema

cloud cargo
#

Thanks for changing the image output format back to .png from .jpeg. I immediately noticed the removal of the compression in the grain. Appreciate that.

glacial swan
#

where gemini 3.1 pro????

low igloo
lunar glade
#

infinite Security Verification check error 👍 what a great way to start a day

marble pawn
#

why was gemini 3.1 removed?

astral lava
#

I was testing Gemini 3.1 pro yesterday, where is it now?

wicked talon
#

I see something in the corner of my eye

wicked talon
#

It seems it was probably too expensive

worldly ether
#

12-point #AI playbook for managing humans, perfect xd

rose sky
#

Bruh, I just found a legit way of how to use Seedance 2.0 on DouBao for free without that pesky login pop-up. Ok so, after you generate videos, if that pesky pop-up appears, you just have to clear all of your history in Google Chrome, All Time, and select all the checkboxes, it's the only way. 😭

golden ocean
#

true

shrewd citrus
sick mantle
junior dawn
#

One message removed from a suspended account.

sour spear
#

Lol, Gemini 3.1 pretty confident about its own reasoning. 😂

sick mantle
junior dawn
junior dawn
sick mantle
junior dawn
#

One message removed from a suspended account.

sick mantle
#

But pineapple cant give any info if this is happending

tawdry robin
#

can you guys recommend niches

sick mantle
tawdry robin
shrewd citrus
#

ask ai for niches

junior dawn
#

One message removed from a suspended account.

junior dawn
shrewd citrus
#

not many people do it

junior dawn
#

One message removed from a suspended account.

sick mantle
whole sundial
sick mantle
#

Sora cant let me even genrate spongebob scenes too, hate it.

sour spear
sour spear
sick mantle
wheat surge
#

racking wide-to-medium shot: A determined man in a soaked flannel shirt runs across an open rural field as a massive tornado vortex spirals violently in the background. Dark storm clouds churn overhead and debris swirls through the air. In the tall grass ahead, a small orange kitten trembles, struggling against the wind. The man shields his face from flying dust, pushes forward, then scoops up the frightened kitten and holds it tightly against his chest. The tornado looms behind them, rotating powerfully, but the focus remains on the emotional rescue.
Style: cinematic, realistic documentary, high detail, dramatic tension
Lighting: dark stormy sky with flashes of lightning illuminating the scene
Camera movement: handheld tracking shot with subtle shake for intensity
Composition: starts wide to show scale of tornado, moves to medium close-up during rescue
Ambiance: cold desaturated tones with sharp contrast
Audio: roaring wind, distant thunder, debris rustling, kitten meowing softly
Negative prompt elements (avoid): urban buildings, visible emergency vehicles, cartoon style, exaggerated physics, slow motion

rose sky
wheat surge
#

racking wide-to-medium shot: A determined man in a soaked flannel shirt runs across an open rural field as a massive tornado vortex spirals violently in the background. Dark storm clouds churn overhead and debris swirls through the air. In the tall grass ahead, a small orange kitten trembles, struggling against the wind. The man shields his face from flying dust, pushes forward, then scoops up the frightened kitten and holds it tightly against his chest. The tornado looms behind them, rotating powerfully, but the focus remains on the emotional rescue.
Style: cinematic, realistic documentary, high detail, dramatic tension
Lighting: dark stormy sky with flashes of lightning illuminating the scene
Camera movement: handheld tracking shot with subtle shake for intensity
Composition: starts wide to show scale of tornado, moves to medium close-up during rescue
Ambiance: cold desaturated tones with sharp contrast
Audio: roaring wind, distant thunder, debris rustling, kitten meowing softly
Negative prompt elements (avoid): urban buildings, visible emergency vehicles, cartoon style, exaggerated physics, slow motion

#

racking wide-to-medium shot: A determined man in a soaked flannel shirt runs across an open rural fi

worthy mulch
#

anyone have Chatcut invite code ?

slim gorge
#

guys how's gemini 3.1?

compact flame
tawdry robin
#

can the riblox viral

thick flower
#

Hii everyone new here, can anyone share good stuff for beginner.

thick flower
normal star
#

Correct me if I'm wrong but I think Sonnet 4.6 on the Claude.ai only uses Low/Medium effort thinking most of the time

#

And its Web Search tool SUCKS.

potent condor
#

/report user_or_message:

golden ocean
#

same

vivid lodge
#

34654321

#

321321321

#

321321321

#

32132151654

#

321516

thorn nebula
#

im losing my it , is the feedback thing going to be permenant , its soo annoying , ruined my everything

compact flame
#

Besides we got access to the smartest models for free so just give feedbacks

west sigil
#

/আমাকে চারটি কাচা লেবুর ইমেজ তৈরি করে দেও ৯.১৬ রেশিও তে

rose sky
#

My cat just side-eyed me when I was taking a picture of her

golden ocean
#

true

shadow prairie
#

HAHAHAHAHA

#

wtf

surreal zephyr
#

gemini has insane potential but they have no idea how to fine tune it like gpt or even sonnet

slim gorge
surreal zephyr
#

🤣

shadow prairie
#

He's still trying

surreal zephyr
shadow prairie
surreal zephyr
shadow prairie
#

God willing

surreal zephyr
shadow prairie
#

3.1 gives me an error like I can't do something, try to reload the page, but I've been f***g with this for 2 days.

surreal zephyr
shadow prairie
shadow prairie
surreal zephyr
#

i dont vc though

shadow prairie
compact flame
#

Maybe the model is just overloaded and rarely responds

shadow prairie
compact flame
empty barn
#

Hi

compact flame
shadow prairie
high owl
#

you hate batle mode in direct mode?

compact flame
toxic verge
compact flame
shadow prairie
toxic verge
#

Let’s see which one makes better videos

compact flame
#

I don't like going to the archives just to delete a chat

#

Too much steps for me

dapper osprey
#

what to do?

shadow prairie
compact flame
dapper osprey
#

not working

high owl
#

i hate batlle in direct chat

compact flame
#

They just want feedback

shadow prairie
#

HZAHAHAHAHA

#

He got into the cycle

compact flame
#

Claude is good enough

toxic verge
#

Cuz it’s cheaper

shadow prairie
compact flame
golden ocean
#

gemini 3.1 pro did that to me too

shadow prairie
golden ocean
#

1 square meter of ukraine per token

compact flame
#

Buying stuff now days is pain

shadow prairie
high owl
#

ai:

#

"Sudden Battle" in Direct Chat at LMSYS Chatbot Arena
If I understand correctly what it's about — on lmarena.ai (Chatbot Arena from LMSYS) in live chat mode, there is sometimes an offer to compare/rate the response, essentially turning a regular session into a mini-battle.

Why is this done
The main goal is to collect data on human preferences.:

More data for the ELO rating

A limited number of users enter the regular battle mode. And there are a lot more people in Direct Chat. This is a way to "reach out" to them and collect more comparisons.
More natural conditions

When a person comes to solve their real problem (rather than specifically testing it), their assessment is more "honest" and reflects the actual use.
A variety of prompta

In battle mode, people often ask "test" questions. There are real tasks in Direct Chat, which makes the dataset more diverse and valuable.
Training data (RLHF/reward models)

The collected preference data is used not only for rating, but also for training reward and finetuning models.
In fact
This is a growth hack for collecting preference data. The platform gives you free access to top models, and in return, your feedback is your "fee" — even if you were not going to participate in the comparison.

compact flame
shadow prairie
golden ocean
#

im russian too

compact flame
shadow prairie
high owl
#

:3

compact flame
#

Rkn blocks everything in sight

#

Everything is gone

golden ocean
#

true

shadow prairie
#

outside of nature?

high owl
#

@dude The AI understood what the feedback needed for the arena... and this is the only AI who understood...

shadow prairie
#

are admins banned for this?

compact flame
#

I'll read rules rq

shadow prairie
river cove
#

Hello everyone

compact flame
#

Yeah English only

#

Bruh

shadow prairie
shadow prairie
jaunty terrace
#

17-year-old determined Indian anime boy (as described above) standing alone on rooftop at sunset, wind blowing hair, city lights starting to glow, emotional but strong expression, cinematic orange sky, dramatic lighting, anime movie style, vertical 9:16, ultra detailed, same character consistency

shadow prairie
sick mantle
spring plinth
#

Hello

#

Add to server

golden ocean
#

is there any image model that has beaten nano banana yet for image editing

lyric helm
#

Hi, I'm wondering why my conversation keeps loading

#

I've been waiting for almost half an hour now

warm drift
#

hi,it seems that the generation of the response from the model has hung up, but I can't stop to write a new message, will it abort itself?всем привет,что делать если модель зависла на генерации ответа?я не хочу создавать новый чат,через какое-то время генерация оборвётся сама?или как это работает на сайте? так уже минут 10 наверное

analog steeple
#

it's = nano banana pro imo

lyric helm
warm drift
surreal zephyr
lyric helm
mystic hatch
#

Ay

deft spruce
#

RECAPCHA IS RECENTLY APPEARING A LOT I EVEN I DELETED CASH AND COOKIES

cloud oak
#

-# got bullied in a public channel cos i like using ai..never even talked to that Karen before.. 🥺😔

thorny schooner
#

I don't know why I find that number so funny to be in lol the feedback

molten cipher
#

Google - we got best free AI model

Also them -

devout bluff
#

Google antigravity Gemini 3.1 high and low and Gemini 3.0 high and low and flash are not working for responding anyone having same problem

molten cipher
#

And their servers r unable to keep up ig

molten cipher
golden ocean
keen beacon
#

Which is the best storytelling model

vale knoll
#

Hi

pulsar crystal
#

did gemini 3.1 preview get removed?

vale knoll
#

Now can we 4 video creat a day?

molten cipher
compact flame
#

It's a great tool when people need assistance with code or just editing

pulsar crystal
prisma cypress
#

How can I fix it

molten cipher
prisma cypress
molten cipher
#

i really thought gemini 3.5 would be released but instead they launched 3.1 :(

molten cipher
prisma cypress
#

Ok

#

The same massage@.node.js

molten cipher
#

i see

#

um

#

try clearing all cookies nd data of that site

#

and retry without loggin in

prisma cypress
#

Ok

molten cipher
#

if works then try logging in then check too\

celest orchid
#

WAR IS OVER

molten cipher
prisma cypress
#

@molten cipher the same error bro

lost patrol
molten cipher
lost patrol
#

the yearly google evenet 😄 - on 19th of may

molten cipher
wind flume
#

How many images I can generate daily

#

?????

molten cipher
#

0.1

#

jk

#

idk

celest orchid
wind flume
#

Tell siriously !!!! 😐😒

molten cipher
celest orchid
#

yep

golden ocean
#

bing chat sydney is my ai girlfriend

undone geyser
#

where is 3.1?

surreal zephyr
celest orchid
#

fr

sick mantle
molten cipher
sick mantle
#

Now again now this

celest orchid
golden ocean
surreal zephyr
#

<@&1349916362595635286>

#

🤔

sick mantle
surreal zephyr
molten cipher
#

guys gemini 3.1 is genuinly really good at frontend

surreal zephyr
#

consider dming the bots instead idk whatever im not omd

molten cipher
#

im impressed

surreal zephyr
#

and terrible at backend

molten cipher
#

ofc

sick mantle
#

Okay

molten cipher
#

but i dont care abt backend anyway

molten cipher
#

i suck at frontend designs

#

while i cook with backends

surreal zephyr
glacial crane
#

Heyy

short sluice
#

why are they literally trying to make us use battle mode in direct

molten cipher
#

ai studio btw

short sluice
#

bruh i reached my rate limit on this stop it

molten cipher
#

why it aint loading bruhuh

honest verge
#

Nah I don't want to use Gemini 3.1 anymore

#

It just can't do anything

#

It's overloaded for 1 day already

#

I lost all excitement

brazen tusk
#

hello

honest verge
#

Why Google really had to release their model so fast without any preparations

hollow ivy
#
poll_question_text

Which most recent version of these is the best for (non-web) coding?

victor_answer_votes

13

total_votes

17

victor_answer_id

1

victor_answer_text

Claude Opus

surreal zephyr
#

but creative

#

🤡

honest verge
surreal zephyr
honest verge
#

It's down in ai studio arena and web

surreal zephyr
river whale
#

3.1 is down from 8 hours

#

In studio

surreal zephyr
river whale
surreal zephyr
river whale
surreal zephyr
#

people who abused it in antigravity got bad trust factor from google

#

where they get lower ratelimits and worse

river whale
#

I just upload a 110k token document and its giving error

honest verge
#

Maybe it's about your location?

river whale
honest verge
#

Country

river whale
surreal zephyr
surreal zephyr
#

account juggling was one of main flag reasons

#

hacking related stuff also flag you lol

river whale
hollow ivy
#
poll_question_text

Which model lands second place in non-web coding?

victor_answer_votes

6

total_votes

14

surreal zephyr
#

lol opus aint top 1

honest verge
#

Gemini 3.1 is already nerfed?

surreal zephyr
honest verge
#

It's definitely getting nerfed

surreal zephyr
#

second one has some mistakes

#

3rd is flawless

#

3rd is like wow

#

i hadnt seen any model get even CLOSE to that

honest verge
#

Well it's still getting nerfed

#

I feel like we are stuck in a progression loop

hollow ivy
#
poll_question_text

Which is the best in non-web coding?

victor_answer_votes

18

total_votes

22

victor_answer_id

2

victor_answer_text

Opus 4.6 Thinking