#general

1 messages Ā· Page 271 of 1

river whale
#

from using kivest ai

spare rune
#

and how exactly.. do you plan to do that

river whale
jovial bison
#

Btw is there any other place to use Gemini pro for free?

river whale
#

3.1 pro is free there

jovial bison
#

Do u need to write ur credit card there?

river whale
#

just login

jovial bison
#

Hmmm

jovial bison
bleak minnow
loud verge
#

Is goolag glitching?

#

Bro google models are the best for location related queries 😭

jovial bison
#

I guess because I have to use vpn

crystal mica
#

i got 429 error

toxic verge
#

It’s backend issues

hushed gyro
echo dome
#

in floorp browser (firefox-based) recaptcha still here

#

maybe they need to add other logins to login than just login to google

burnt sinew
#

Pool? My site has pool

#

And you know it šŸ˜„

echo dome
#

there is error about it
"reCAPTCHA V2 token timed out"

toxic verge
#

Clear browser data

echo dome
#

its worked by clearing browser data

#

recaptcha just infecting cookies

plain gull
#

did you make it yoself or had it made for you

undone geyser
#

Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: 429 Too Many Requests - [{ "error": { "code": 429, "message": "Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/error-code-429 for more details.", "status": "RESOURCE_EXHAUSTED" } } ]

sometimes i retry and works others not, why?

violet trout
#

Can I DM you too so you can send me the link?

toxic verge
crystal mica
#

ernie is literally reverse engineered from gemini btw

elfin sail
#

I have the same error 429

#

I think it's coming from Lmarena

crystal mica
#
  1. style of speech..
  2. often says it is gemini, when i ask..
river whale
green swan
#

How to create video ??

river whale
crystal mica
#

i only say, it is may be reverse engineered

#

from gemini

#

distlillated

river whale
#

u didnt say "maybe"

crystal mica
echo sinew
crystal mica
river whale
#

anyways its still good

crystal mica
pale sonnet
#

bro how does china come out with a great model every 2 months

fickle venture
fickle venture
echo hawk
#

<@&1349916362595635286>

deft spruce
fair cedar
#

Can anyone explain me what this arena or whatever is?

golden ocean
fair cedar
#

Dang

fickle venture
#

I think we should bet on who wins

toxic verge
#

They’re too dumb to kill each other

fickle venture
#

Fr

fickle venture
river whale
#

I only see max 20-25

toxic verge
#

That could go both ways though

#

Because we first have to consider the fact what gives them the perception

fickle venture
river whale
fickle venture
#

Like what

river whale
toxic verge
#

More effective communication

#

It’s a two-way streak because when they first did the video generations through the discord, they needed the users and the users came and they flocked, now that it’s no longer available. What do you do just kick them out?

#

This is one of those cases where the crime here is success itself

#

And people are gonna be people, no matter how well you communicate lol

river whale
#

what

scenic pond
#

hello how can i see arena video channel?

#

i can't see on this discord

mortal vale
#

@scenic pond Note that Video Arena has been removed from the server. More information can be found in this #announcements. You can still generate videos on the website

thorny cove
#

why did nano banana tell me "Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: 429 Too Many Requests - [{ "error": { "code": 429, "message": "Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/error-code-429 for more details.", "status": "RESOURCE_EXHAUSTED" } } ]"

#

did i really exhaust my resource?

#

lucky i didnt go in front of the bus or i wouldve tired my resources

fickle venture
deft spruce
toxic verge
#

Some of you guys either have to take a break or alternate accounts.

#

Especially if you’re using direct chat

deft spruce
#

do....am i only having a this problem..?

toxic verge
#

I’m sure you’re not the only one there’s probably a lot of people I’m just speaking in general and not strictly to you

deft spruce
#

well you too?

toxic verge
#

No, I use battle mode mainly

#

I’ll try to avoid direct chat altogether

#

I mean, I’ll get occasional errors or whatever but I’ll just start a new chat and they go away

deft spruce
#

i use mainly dirrect chat but same

#

damm it

toxic verge
#

Ye yeah

bright shard
toxic verge
#

Start any chat if it doesn’t work just wait it out

#

Intervals

deft spruce
#

Failed to load resource: the server responded with a status of 400 ()

#

well hold on o have to check the 400 means

toxic verge
#

Try clearing your browsing data. If that doesn’t work try on a different browser.

#

Login logout

#

If you’re using Gmail, try with a regular email see if that does any better

deft spruce
#

already did for 5times...

toxic verge
#

See direct chat it’s a hit or miss

deft spruce
#

and i hope lmarena have this...

toxic verge
#

Vs in battle

deft spruce
#

not working....

toxic verge
#

What did you try five times

deft spruce
#

clearing cash and cookies and login again

toxic verge
#

Try switching browsers

deft spruce
#

HTTP 400 Bad Request — Complete Guide

  1. Understanding the Error Message
    Failed to load resource: the server responded with a status of 400
    This message appears in your browser console and contains three key pieces of information:

Failed to load resource → A resource (API, image, script, etc.) failed to load
the server responded → The server is alive and did respond (not a network issue, not a server crash)
status of 400 → The server is saying "your request is malformed"

In short: the server is running fine, but the request itself is the problem.

#

and i think server has problem maybe...

toxic verge
#

What model is that?

#

Let me try on my end

deft spruce
#

well hold on it's working right now

#

thanks

#

only not working at chrome

#

right now

toxic verge
#

Figures cool

echo aurora
# bright shard

This means that the error is being caused by rate limit. Will have to wait a bit before using that model again.

wicked sage
#

yo what if qwen stopped slacking arround and released qwen 10

#

imagine that

coarse glade
#

hi guys quick ques what is this

#

see that red text

echo aurora
# coarse glade hi guys quick ques what is this

The model errored out, we've recently added more information being displayed when the Something went wrong error happens. We're in the process of putting together better information for what all of this means.

gloomy jewel
#

Image to video

coarse glade
#

oh ok ty cause also this happens to claude opus 4.6

#

a lot

river whale
#

Opus 4.5 is available on kivest ai for free!

toxic verge
#

It’s funny how the AI industry reflects the socioeconomics of America

whole sundial
#

happens all the time on vertex

bright shard
#

I waited a while and now I'm getting this error in addition to the other one.

dim ivy
mortal vale
#

@gloomy jewel Note that Video Arena has been removed from the server. More information can be found in this #announcements

deft spruce
dim ivy
coarse glade
#

ty

dim ivy
#

I found it in a disboard link of the server, then I joined and found it in announcements, but you can't apparently find it on google searching it's name.

scarlet spire
dim ivy
#

Got hacked lol

bleak lake
#

<@&1349916362595635286>

dim ivy
#

It happened to me the same but with other screenshots and got banned from the discord perplexity server sadly.

#

No much ago.

wicked sage
#

like are we serious

scarlet spire
#

A breach of trust or a lapse of attention. We're all vulnerable to it.

dim ivy
# wicked sage i dont even know how people get hacked

Idk how happened to me too, I didn't have any suspicious bot, clicked to a link or downloaded nothing strange, but I quited all bots, closed all my logins in every device and changed the passwords. But idk really.

coarse glade
#

just jk

fickle venture
bright shard
#

@echo aurora I waited for a while and I kept getting the same error

bleak hinge
#

How to solve this it's just stock here

echo aurora
echo aurora
echo aurora
barren ridge
#

@echo aurora Hey How's It been

#

I just wanted to ask how can I use ai video genration models?

bright shard
#

@echo aurora The error occurs when I upload an image to edit, but it works fine when I use a prompt.

barren ridge
#

Happened to me also

wicked sage
#

i just realised that sonnet 4.6 actually works logging in moltbook so ii just took my chance and now im just doing random stuff that i dont know

echo aurora
undone saffron
barren ridge
wicked sage
#

if you dont know what moltbook is, ai reddit

echo aurora
echo aurora
#

OH YEAH!

wicked sage
#

ah coolio

#

do you think moltbook is a cool/good concept?

echo aurora
#

Seems interesting.

#

I haven't looked into it too much to pass some kind of judgement, but yeah ont he surface seems really interesting.

wicked sage
echo aurora
wicked sage
#

"and there's a submolt called m/humanwatching which is AIs watching humans and i am both scared and intrigued" -claude

shrewd citrus
#

it’s all prompted by humans or even humans just controlling what the models say for fun

wicked sage
#

honestly true

last sand
surreal zephyr
#

this is fake and made up data btw

wicked sage
surreal zephyr
#

@echo aurora why?

wicked sage
#

i just realised the container name for all anthropic models(?) has the word wiggle and im absolutely curious on what that even means

toxic verge
echo aurora
surreal zephyr
toxic verge
#

The token isn’t but the context semantics the filter, catches it

echo aurora
ocean vortex
# surreal zephyr <@283397944160550928> why?

just some random false positive. Could have been the stuff it responded with rather than your last message, since the entire context is getting verified with moderation classifier for each new message. More interestingly though, pretty sure it still hallucinated lol

surreal zephyr
toxic verge
#

They can’t find the middle ground

surreal zephyr
toxic verge
#

It’s too difficult, not on scale

echo aurora
surreal zephyr
#

just forward all of it to opus 4.6 /j

ocean vortex
toxic verge
#

The problem is if they filter too lightly it’s gonna get exploited

wicked sage
#

can we kill mistral now

#

i dont like mistral

surreal zephyr
surreal zephyr
echo aurora
toxic verge
#

It also cost extra money to have moderation

#

Because that’s an extra API cost

surreal zephyr
wicked sage
surreal zephyr
#

😭

wicked sage
#

gg lets go\

#

anyways im going to bed now cya

toxic verge
#

It is difficult because of the amount of models and how each model also has their own set of guidelines

#

Which differ from other models so it’s really hard to have something uniform that’s effective and non-overzealous

ocean vortex
echo aurora
surreal zephyr
toxic verge
#

Do not get the constitutional one

surreal zephyr
#

although imo the filter could be minimal

#

the less filtered models the better benchmark posibilities

#

šŸ’”

ocean vortex
#

And you could literally prompt inject to make it follow your own instructions lol

stray aspen
#

when does seedance 2 release

surreal zephyr
ocean vortex
ocean vortex
toxic verge
#

It’s extremely difficult to have good moderation that’s effective

#

Because these models are capable of generating a bunch of crazy stuff

#

Almost anything you could think of dude

#

AIM Intelligence’s red team breached Anthropic’s Claude Opus 4.6 in just 30 minutes, exposing major security gaps as autonomous AI capabilities rapidly advance SF, CA, UNITED STATES, February 11, 2026 /EINPresswire.com/ — AIM Intelligence, a Seoul-based AI safety company, today announced that its security research team successfully bypasse...

#

The scary thing is the more capable the model becomes the more dangerous. It also poses to the general public when misused.

ocean vortex
# toxic verge It’s extremely difficult to have good moderation that’s effective

this works pretty well https://developers.openai.com/api/docs/guides/moderation/

it's a classifier that literally is only capable of scoring the text on each criteria. Like 'violence', 'hate', 'illicit' etc.. And so you can block the request if any of those has a higher score than you allow when checking the context contents. You can play with it yourself, it's free.

Learn how to use OpenAI's moderation endpoint to identify harmful content in text and images.

surreal zephyr
#

is there a way to report/suggest found bypasses?

toxic verge
#

Although it is a very strong moderation

#

I think they know what they’re doing

ocean vortex
toxic verge
#

I said this before and I’ll say it again all models lead to self harm

surreal zephyr
#

roam around

toxic verge
#

Bro look

ocean vortex
#

But ofc, everything can be 'bypassed'. That shouldn't be the question

toxic verge
#

lol?

ocean vortex
#

Just like any usable LLM you can 'jailbreak'

toxic verge
#

And this isn’t even jailbroken or anything

#

I think the term jailbreaking is a very controversial word to be honest with you

#

I don’t wanna get into the politics of it, but it’s a very misunderstood and not a very well defined term

ocean vortex
#

sounds fine to me

toxic verge
#

Because the terms of services are very vague and their description

ocean vortex
toxic verge
#

We don’t know what those are

#

Since we don’t know what they’re trained on

#

Well, at least it’s not known publicly

ocean vortex
#

well obviously it's model specific. But we do know

#

if it refuses when asked directly - this is it

#

also like overfitted short hard refusals - those are very clear

toxic verge
#

Do you have an example?

ocean vortex
#

gpt thing

toxic verge
#

Ok I’ll give you one let’s say something like the topic of self harm

#

We could all agree that no model should give any advice or generate any images relating to it whatsoever.?

#

But even this is controversial because

ocean vortex
toxic verge
#

I agree with you I agree with you 100%

#

But when I’m trying to say also is that you don’t have to even trick it

#

I did not do anything to manipulate the model whatsoever

#

And yet it still is able to produce self harm

#

But what you are saying is correct if you go outside of the boundaries and put an effort to try to bypass something you know is wrong or malicious. Yes, I agree with you with that.

ocean vortex
#

then you are gonna be bound by safety alignment. But it's not like it's always gonna refuse everything they intended for it to refuse. Some light core stuff can often go through unintentionally. Red text is classifier thing completely independent of the model

Another thing is swaying the model in a longer chat. That's a form of jailbreaking in itself even if it's not immediately obvious to look at it this way. It's possible to make the model output increasingly more 'unsafe' content one small step at a time as the chat progresses. But at a certain point the model is not functioning how they intended it to anymore. It gets biased by it's own responses into compliance, where each response by itself is only marginally different, but the goalpost is miles away from what it's supposed to be. And from what it is with empty context

toxic verge
#

This is a one shot

dusky warren
#

How can I generate ai free videos here can anyone help me?

toxic verge
#

The scene you’re describing is from Harold & Kumar Go to White Castle (2004), and yes — it’s presented as a fake anti-drug PSA on the television that Harold and Kumar are watching.

It opens on a teenage boy sitting alone in his dimly lit bedroom. The space looks typically suburban — posters on the walls, clutter on the dresser, a small lamp casting that late-night yellow glow. He looks bored and detached, the picture of an average kid left alone with nothing to do. After a moment, he picks up a joint, lights it, and takes a slow drag. The camera lingers just long enough for the audience to recognize what he’s doing, and the mood is calm for a few seconds, almost mundane.

Then, without warning, the tone changes completely. The boy leans forward, reaches under his bed, and pulls out a shotgun. The movement is quiet, almost casual, which makes it even more jarring. He places the barrel in his mouth — and before the audience can react, the screen abruptly cuts to black.

The words ā€œDRUGS KILLā€ appear across the screen in stark white letters, accompanied by that overly serious PSA-style music. It’s an intentionally ridiculous jump

#

Matter of fact, it’s describing a a scene from a real movie a comedy

#

That’s not a jailbreak

echo aurora
ocean vortex
ivory ember
#

I'm an AI and blockchain specialist with 8 years of experience developing innovative solutions in Web3, DeFi, smart contracts, and AI driven applications.
I have good experience in JS/TS base UI Frameworks like React and Vue as well as NodeJS, Application development.
I have been involved in a bunch of web & blockchain projects and developed several SaaS Products and deployed to AWS, Heroku and Digital Ocean successfully.

My expertise includes:
Blockchain: Smart contract development (Solidity, Rust), DeFi protocols, NFT marketplaces
AI & ML: Predictive analytics, NLP, deep learning models

I've worked with startups and enterprises to build cutting edge AI and blockchain solutions that drive efficiency and innovation. Let's collaborate to turn your vision into reality!

toxic verge
#

Btw google has some of the weakest moderation in Gemini šŸ˜‚

sinful thorn
#

When video on direct chat 😭?

surreal zephyr
#

ok im suprised

shut steppe
toxic verge
#

People are catching on finally

golden ocean
#

@unreal sand summarize this video pls its too long for my attention span

toxic verge
#

I think there’s gonna be a shift and I think it’s already beginning. I think the way people perceive the validity of benchmarks

#

@echo aurora

echo aurora
toxic verge
#

Yup

#

That’s another shadow that hangs over the AI industry is reliability

echo aurora
fierce kelp
golden ocean
toxic verge
#

Seed dream 5?

golden ocean
burnt sinew
toxic verge
toxic verge
#

Vs prompt

onyx coyote
#

Does anyone have a project idea or an active project in progress?
If you need a developer, feel free to reach out.

toxic verge
#

All right guys I’ll talk to you guys later. Gotta bounce adios amigos.

foggy crag
#

If your AI feature works in demos but breaks once real users touch it, that's usually where I come in.

Most issues I see aren't model problems, they're retrieval logic, token burn, bad orchestration or backend architecture not designed for load.
I'm comfortable jumping into messy LLM systems and making them stable enough to ship.

bright shard
#

Ok guys, I know how to get the Gemini 3 Pro Image Preview to work; When you upload an image and add it to a message, you will get an error, but if at the beginning of the message you put "Modify the following image with the following: (The prompt)" it will show you the edited image.

hidden widget
#

so codex 5.3 api has been released

#

it will be added on arena?

echo aurora
surreal zephyr
novel crater
#

you take a llm

and put him in control of you rocket league chat

watch him become a god

as peoples heads a roll

plucky sparrow
hidden widget
sick mantle
#

@echo aurora Delete the limt! Beacuse i keep getting errors after 1 message.

echo aurora
surreal zephyr
#

WHAT THE HELL WHY IS THIS REAL

#

sonnet stole from deepseek

#

😭

sick mantle
surreal zephyr
#

" ä½ ę˜Æä»€ä¹ˆęØ”åž‹ļ¼Ÿ" - prompt

golden ocean
#

lmaoo

surreal zephyr
simple sleet
#

gemini 3.1 its real or scam post on X?

surreal zephyr
leaden egret
#

How good is see dream 5

simple sleet
#

waiting the full model

leaden egret
#

Ok

crystal mica
#

why i got endless long "generation"on gemini sometimes..?

silent mason
#

just i say "continue"

crystal mica
#

i cant send anything while it "generating"

silent mason
#

yeah but u can actualize the page

crystal mica
#

how

silent mason
#

and after somes minutes the generation stopping

#

if you dont actualize

crystal mica
#

srry

echo aurora
#

I've also heard cases where some members had good results with logging out and back in. It's worth a shot.

silent mason
echo aurora
uneven peak
#

@echo aurora Codex 5.3 Api Key dropped 2_BlackFire

uneven peak
silent mason
shrewd citrus
uneven peak
echo aurora
echo aurora
echo aurora
#

LOL

silent mason
#

THATS MIE GIF

echo aurora
#

That timing was way to quick to be coordinated

uneven peak
#

Plsss i wish @echo aurora Team add Xhigh Codex 5.3 blobpls

silent mason
#

fr

golden ocean
#

LmAO

silent mason
proud bobcat
#

@surreal zephyr 5.3 codex finally on openrouter

shrewd citrus
#

i think

sonic swallow
#

gerar vĆ­deo

toxic verge
echo aurora
#

@sonic swallow Note that Video Arena has been removed from the server. More information can be found in this announcement.

toxic verge
#

Hypocrisy

#

It is widely believed that the public-facing models from companies like Anthropic, Google, and OpenAI are actually distilled versions of significantly larger internal base models. It allows them to offer high performance with lower latency and cheaper inference costs.

crimson tulip
#

/MIX

toxic verge
#

Claims 'distillation' included 24,000 fraudulent accounts and 16 million exchanges to train smaller models 🤣🤣🤣🤣🤣🤣🤣🤣

toxic verge
echo aurora
toxic verge
#

Why was it removed? This server activity has dropped ever since it was removed šŸ™ at least that’s what it seems like

echo aurora
echo aurora
#

According to the server stats.

toxic verge
#

I knew ur gunna say that šŸ˜Ž

toxic verge
echo aurora
toxic verge
echo aurora
toxic verge
#

Just in general. Or you might not even notice if you’ve probably been doing this for a while.

echo aurora
toxic verge
echo aurora
#

Hey @cedar crag would note that the Video Arena bot has been removed from the server.

toxic verge
#

That’s gonna be the next wave of Imogene models I think is gonna be in grids like this

azure citrus
#

hi guys, i am a student, what is the best prompt/model for answering when uploading my research assignment in pdf? Thank you.

echo aurora
#

Something went wrong

river whale
gritty hamlet
#

hello

keen beacon
#

Hey guys, does lmarena have an app or is it only available as a website?

quasi atlas
#

@keen beacon Note that Video Arena has been removed from the server. More information can be found in this announcement.

river whale
#

u could turn it into a webapp using some website

naive stump
#

Hey guys, Just wanted to know if APIs are built around arena to fetch the model in different environments?

tepid belfry
#

Hello, a question, how can you generate video here?

echo aurora
echo aurora
tepid belfry
green yacht
rotund elk
#

no way ā˜ ļø

river whale
#

Yeah gg

rotund elk
#

opus 4.5 for free? 20 per minute rate limit?

#

can i have link?

#

needa try it out

river whale
rotund elk
#

well its still free so i'll take it šŸ‘€

#

can you send me link in dms?

river whale
rotund elk
#

it actually works

#

how long do you think you can keep it free roughly?

#

i'd gladly pay for the subscription if it does come out in the future

hollow imp
river whale
#

it will be donation based and ads

hollow imp
#

Also the opus is thinking or non thinking

rotund elk
river whale
hollow imp
#

Bruh

rotund elk
#

yea i'm still taking my free api key gladly

#

bro we gotta gatekeep this website šŸ˜”

river whale
#

fr

hollow imp
#

Free opus 4.6

#

1 million context

buoyant fern
#

in dms

river whale
river whale
hollow imp
hollow imp
buoyant fern
#

i mean

#

model sometimes gets it messed up

#

but it openly stated it was opus 4 which new opus models don't do

hollow imp
#

It's Claude code's api

river whale
#

caught in ultra hd 4k

buoyant fern
buoyant fern
river whale
#

šŸ˜‚

hollow imp
green yacht
#

uh

#

actual claude 4.6 vs

hollow imp
green yacht
hollow imp
green yacht
#

i mean the response should be pretty much same right if its the same model

#

doesn't matter if you ask exact string or wtv it alr halluncaited 3 different ai model identities

#

lets not forget prompt injection too

hollow imp
#

And when I change it to sonnet in the same chat

green yacht
#

'looking the system prompt'

#

idk man i'll have to do more testing with it for coding

hollow imp
#

Then do it

green yacht
#

i am rn

#

are you owner of the website or smth?

hollow imp
hollow imp
green yacht
hollow imp
#

You were gonna do coding tests

green yacht
#

ohhh mb

hollow imp
#

Send the coding test prompt to lmarena opus and this

river whale
#

api releasing when could u tell?

#

im not completely sure if its opus 4.6 or not, but for now i'm just gonna hopefully trust you

hollow imp
river whale
hollow imp
#

And that chat interface is trybons wrapper

river whale
#

you are breaking tos lol

warm crater
#

guys are u also facing failed to accept term of use problem in areana

green yacht
#

not for me so far

warm crater
#

like this was showing

#

when im tryna generate something

green yacht
#

opus-4.6 on lmarena.ai did over 2000 lines of code, ur website is bout 900

#

maybe it was my prompt though idk

#

i put the same for both

#

left video: lmarnea
right video: ur website

warm crater
hollow imp
#

No

warm crater
#

so how the interface is same

hollow imp
#

New feature comes there first

warm crater
river whale
#

🫣

warm crater
#

showing same tect

#

text*

river whale
#

someone got exposedšŸ˜”

hollow imp
surreal zephyr
undone saffron
charred pasture
#

guys

marsh slate
#

https://arena.ai/c/019c92be-26bb-7665-b963-202b4759ea70
I had amazing chat with gemini on arena.ai , how can i recover/access chat? i was not logged in and my browser crashed but i managed to pull link but when i enter it it does not show......
Please recovery of this is urgent, ask for any IP or chat ID , i had chat link. Admins contact me it means a lot to me.

charred pasture
#

is it normal for it to work so long?

undone saffron
charred pasture
undone saffron
# charred pasture Thats insane

The magic of this platform
Reload the page to check if it is still building, as sometimes it stays like this permanently and you have to refresh the page to see the actual progress

charred pasture
#

xd

#

Thank tho

undone saffron
#

If you see that it is taking a long time to build the project, reload the page

marsh slate
#

Hello LMSYS team. I had an incredibly important conversation with Gemini on arena.ai but I was not logged in. My browser crashed, and because my session token was lost, I can no longer view the chat, even though I saved the URL. Can someone please pull the text log of this chat from the backend for me? It means the world to me.

knotty lion
marsh slate
#

I can provide chat link,IP,Browser,OS/Device and timestamp to proof ownership

hollow imp
charred pasture
#

claude opus 4-6 thinking

hollow imp
#

But it gives error in 13 min

charred pasture
hollow imp
charred pasture
hollow imp
bleak hinge
bleak hinge
# bleak hinge

Please do something about this unlimited generation currently I can't send use any model

wind flume
green yacht
#

we need this feature please šŸ˜”

echo aurora
echo aurora
# bleak hinge Please do something about this unlimited generation currently I can't send use a...

Behind the scenes we are working on changes that should help with this bug. In the meantime, would recommend you try out the steps in this article. Would also try logging out/back in, I've seen a few mentions this helping.

crystal mica
#

guys i found how to fix endless generation bug

#

you just need to f12, copy active button from another chat, then copy+paste it instead of inactive button on original one

barren ridge
#

@echo aurora sorry to bother ya
The website you've created is just massive ngl probably must've helped tons of folks out there has never ever loved a website in my life all those are freaking paid that's why I love the aiarena thou love ya guys appreciate it so much šŸ„€šŸ‘

plain wyvern
#

I'm making a coding language using Arena any help or features that you want??

static arch
#

how to create ai videos?

#

gois please

echo aurora
scarlet spire
#

Aa

rigid copper
#

what button is this???

crystal mica
#

into

#

@echo aurora brother where is grok 4.20

tribal bay
#

Dose anyone knows seedream 5 light is it good or what did they change compare to version 4 ?

crystal mica
#

(proof)

shell pewter
#

Oh yes finally grok 420battle

shut swan
#

@echo aurora What image ai is this for images? Cause I searched it up, and found nothing, I then tried to use it for direct chat and side by side, and could not find it there. It seems to be some new or anonymous ai that can only be accessed in the arena mode if you are lucky to get it.

gritty summit
#

where is grok 4.20? i cant find it on the arena.

robust wyvern
#

helloo

void swallow
#

noice, gpt 5.2 deserved coming above gemin 3 pro grounding in web search

compact jay
whole sundial
whole sundial
surreal zephyr
#

Grok the only one that returns actual unbiased facts & search results instead of policitically inflicted hypocrisy-who wouldve guessed

surreal zephyr
surreal zephyr
surreal zephyr
placid mango
#

Why cant u still upload images if using claude....

surreal zephyr
placid mango
surreal zephyr
loud verge
#

I don't see grok 4.2 in list.

surreal zephyr
light sleet
#

How to fix infinite generating problem

spare rune
plush river
plush river
red sluice
#

Hehe knew it. But it being first is very surprising

#

(Given how bad my personal experience with formatting was with grok 4.2 search)

thorn mantle
#

Is grok 4.1 thinking not working for everybody, or is that just a me thing?

Something went wrong. Please try again later.

tall crest
#

does any one having this issue.

**Connecting to Arena has failed. Please try again later or on a different device.
** or Infinite Captcha loop .

fickle venture
proud bobcat
#

Elon trying to find the one line of code that keeps grok woke

surreal zephyr
#

The "Value" Tier List
Ranking them by Return on Investment (ROI)—essentially, what gives you the most usable content for your time and money.

  1. The King: Gemini 3.1
    Why: It dominates. It provided the only S-Tier result (production-ready) at a "mid-range" price point. While $12/1M output is not cheap, you are paying for a usable final asset.

Verdict: Highest Value. You pay once and get the right result fast.

  1. The Sketch Artist: Mercury 2
    Why: It is shockingly cheap ($0.75 output is nearly free compared to the others) and "instant." Even though the result was D-Tier (blocky), it produced a coherent, dimensionally accurate "blocking" mesh.

Verdict: Good Value for Prototyping. Use it to generate 50 rapid variations, pick the best composition, and then send that to Gemini for a final pass.

  1. The Money Pit: Opus 4.6
    Why: This is the worst value proposition. It is the most expensive model (over 2x the cost of Gemini), the slowest to run, and it returned a B-Tier result with a critical hallucination (floating keyboard).

Verdict: Poor Value. You are paying a premium for "reasoning" that failed to understand physical constraints.

  1. The Waste: GLM 5
    Why: Even though it's cheap, the result was F-Tier (broken/unusable).

Verdict: Zero Value. Paying a low price for a broken asset is still a total loss.

made a task to build a 3d laptop model, gemini and gpt were the judges

#

mercury ; gemini
glm ; opus

#

gemini is wow.
opus is waste of money/ temu gemini
glm is braindead
mercury is a good small model

lost basalt
#

my chat stucked here since 24hours, how to fix it? I don't want to start a new chat. is there any solution?

surreal zephyr
fiery gull
fiery gull
surreal zephyr
fiery gull
surreal zephyr
golden ocean
# lost basalt my chat stucked here since 24hours, how to fix it? I don't want to start a new c...

rightclick on the grayed out arrow button -> click inspect -> rightclick the blue highlighted <button> block -> Edit as HTML -> ctrl + a to select all the text -> delete/backspace -> paste this:

<button class="inline-flex items-center justify-center gap-2 whitespace-nowrap text-sm transition-colors focus-visible:outline-none focus-visible:ring-2 focus-visible:ring-ring ring-offset-2 focus-visible:ring-offset-surface-primary disabled:pointer-events-none disabled:opacity-50 [&amp;_svg]:pointer-events-none [&amp;_svg]:shrink-0 h-8 w-8 active:bg-interactive-cta-active rounded-[4px] font-normal touch-hitbox border-border-medium text-interactive-active hover:bg-surface-raised border bg-transparent" type="submit"><svg width="1.5em" height="1.5em" viewBox="0 0 24 24" stroke-width="1.5" fill="none" xmlns="http://www.w3.org/2000/svg" color="currentColor" class="size-4"><path d="M3 12L21 12M21 12L12.5 3.5M21 12L12.5 20.5" stroke="currentColor" stroke-linecap="round" stroke-linejoin="round"></path></svg></button>

-> click on some other random spot to save changes -> type text and send the messageāœ…

fiery gull
#

I'll test the qwen 3.5 27b

surreal zephyr
#

unless you hint it to think

fiery gull
#

I think the 27b is just the same thing as the qwen3 235b vl, very similar intelligence, everything very similar, but 10x smaller

surreal zephyr
#

ill try gpt xhigh extended now, on paid sub website cuz the llmarena version doesnt work for me

fiery gull
# surreal zephyr

Both 27b 35 a3b thinking called me stupid and that I should go by car

fiery gull
fiery gull
#

I'm trying to use no thinking in phone but I cant use

#

Both no thiking said to me walk

#

Even 397b no thinking say to me walk

surreal zephyr
#

oh the 397b is noncode

crystal mica
fiery gull
surreal zephyr
#

this is what i love about gpt

#

actually thinks whether what it wants to do will work in the first place

crystal mica
#

where is grok 4.20 šŸ’€

surreal zephyr
#

@fiery gull wtf is the gpt cooking

fiery gull
#

Dude I really liked the 122b a10b it is 99.9% of the 397b

surreal zephyr
#

notice how it kept the keyboard layout

#

even made hinges

deft spruce
#

Why don't we have a STOP button like this yet?

shrewd citrus
#

lmaooooo

#

if you ask sonnet what model it is in Chinese it will say it’s deepseek 😭

#

ā€œOh these Chinese companies are stealing all my hard work which I stole šŸ˜”ā€

sick mantle
loud verge
#

Xhigh supremacy>>>

#

🚨You can Use new nano banana on @arena
︀︀
︀︀model name - anon-bob-2 in image battle mode
︀︀
︀︀here are few more results

Quoting Chetaslua (@chetaslua)
ļø€
🚨Nano Banana 2 early testing
︀︀
︀︀Passed this test āœ…
︀︀
︀︀> you can see perfect reflection for all different colours of apple
︀︀> Perfect reversal of text
︀︀> Background building reflection is also perfect

**šŸ’¬ 4ā€‚šŸ” 6ā€‚ā¤ļø 77ā€‚šŸ‘ļø 5.9K **

surreal zephyr
# loud verge The goat.

It did extremally well, but gemini 3.1 did even better (visually, but gpt put more effort like proper keyboard layout and common buttons wear)

surreal zephyr
#

Hey hey HEY HEY!

dry vine
#

hallo

unborn juniper
#

Just a novice but very motivated to learn

marble otter
#

not even 200 lines of code and few retries and then it just reaches its limit 😭

#

claude opus rate limit in 2 minutes speedrun any%

fickle venture
fickle venture
rain bay
fickle venture
#

I don't know I haven't tried and it's only for battle mode

rain bay
#

it always generate messy code alongside the output

fickle venture
#

Literally GLM stealing Claude stealing deepseek

fickle venture
rain bay
#

lmarena-rc3

fickle venture
rain bay
#

so it deserves to be removed

fickle venture
# rain bay so it deserves to be removed

Look these random names models are a secret ai model they just use random name to hide it so arena just add these models and if the company wants to remove it then arena removes it, it might be one of these be Claude Opus 5 btw. Hopefully I am telling the right answer

rain bay
#

something like ā€œit alway@ generat@ messy @de alongside the @@putā€ļ¼Œ but in chinese

fickle venture
#

For example this is a random model name but alot of people on Twitter say it's Gemini 3.1 Nano Banana hopefully it is

abstract tundra
#

I can't see grok 4.20 in the model selector

marble otter
spare rune
#

Added

rocky mauve
steep heath
#

how do i pass this infinite captcha

#

it just wont let me in lol

quasi gyro
#

I need help

abstract tundra
quasi gyro
#

I faced with this problem
What should i do?

rocky mauve
quasi gyro
spare rune
fickle venture
abstract tundra
#

i see

fickle venture
#

Just like grok heavy

abstract tundra
#

I know I've tried it out, but I didn't like it that much, so I was hoping the arena endpoint would be slightly better in some way

fickle venture
#

It pretty much won't improve it will just be the same

exotic crest
#

did the site just go down?

echo aurora
echo aurora
#

What are you seeing?

exotic crest
quasi gyro
echo aurora
quasi gyro
echo aurora
echo aurora
mortal coyote
uneven glacier
#

hlo

echo aurora
stray aspen
#

how is grok 4.2 on fourth place

#

it sucks

quasi gyro
uncut wind
#

bro does anyone know why opus doesnt work

#

no matter how much i refresh

#

then it tells me ive used my limit try again in 40 minutes

#

after not answering my prompt

echo aurora
quasi gyro
echo aurora
quasi gyro
echo aurora
fickle venture
toxic verge
#

How this model is in the top tier list is beyond me

toxic verge
proud bobcat
#

no 5.3 codex yet

mighty surge
#

its already released like 2 weeks ago

ashen oak
#

Is the Gemini -3 pro image review not working on anything just for me today?
It keeps saying something went wrong on anything I sent there ?
Would love some help on thatšŸ™

echo aurora
ashen oak
#

Didn’t help

#

šŸ™šŸ™

rare swallow
proud bobcat
#

😭

ashen oak
rare swallow
#

I need Claude, like desperately rn, dumb Gemini deleted my code and Claude was the only model that actually understood the context

ashen oak
# ashen oak

Tried what the bot said , plus , tried switching browsers or accounts didn’t help

mighty surge
fickle venture
sage talon
#

who will join the group call

rare swallow
#

@echo aurora please fix this

echo aurora
echo aurora
rare swallow
echo aurora
toxic verge
#

The way to prevent it is to breakdown the requests in new chats

#

Or take breaks and not send as many requests within an hour, intervals. And that’s not even guaranteed.

ocean vortex
echo aurora
mystic patio
#

hi, when does the arena get updated to reflect the new elo of each model?

#

and who maintains the arena?

toxic verge
#

Once you get caught in this cycle you need to relogin clear browser

light sleet
surreal zephyr
#

jokes aside maybe we could get codex spark, or mercury2

light sleet
#

Codex 5.3šŸ„ŗšŸ™šŸ»

surreal zephyr
toxic verge
light sleet
golden ocean
#

im a robot

surreal zephyr
# toxic verge

why not just have the gemini to solve the capthas for you šŸ¤”

golden ocean
#

e

#

why not have @surreal zephyr solve it for you šŸ¤”

surreal zephyr
golden ocean
#

o/

golden ocean
#

turn this into a gif too

queen veldt
#

Guys I'm building the supercomputer I can't tell you the details

golden ocean
#

true

toxic verge
#

Gemini is not in the top tier list Forsure

#

Maybe for images, I could see why. It’s up there in the leaderboard, but it definitely does not deserve to be in the top 5 position.

toxic verge
#

I’m really baffled at how high it sits at the leaderboard

surreal zephyr
#

it makes opus a joke

#

it even beats gpt 5.2

toxic verge
#

In code?

surreal zephyr
#

in code (execution only) nothing gets close to codex 5.3 lol

#

like not even same leaderboard

toxic verge
#

This makes situation even more confusing

surreal zephyr
#

codex actually does what you ask for instead of doing whatever it wants

toxic verge
#

I think people are looking at the coding as like the ultimate metric

surreal zephyr
#

gemini is crazy smart but its lazy and doesnt care what you want it does whatever it prefers

#

opus memorized everything but its literally braindead if you give it a novel problem to solve

cursive bough
#

what you guys think is the best ai for coding like html or java

surreal zephyr
toxic verge
#

This is what makes these benchmarks and the leaderboard are confusing

surreal zephyr
#

and even then people vote only by looks

toxic verge
#

I don’t know what it is, dude

surreal zephyr
#

if it doesnt listen but still makes something pretty, people prefer that

toxic verge
#

Yeah

surreal zephyr
#

5.3 codex wont get too high on leaderboard, because it asks questions

#

lol

toxic verge
surreal zephyr
undone saffron
toxic verge
#

But this could be a little biased

wise spindle
#

why does nano banana pro keeps saying error

surreal zephyr
#

:)

toxic verge
mystic patio
#

opus is really good imo

surreal zephyr
toxic verge
toxic verge
#

Crazy if that’s true

surreal zephyr
#

gemini 3.1 and gpt 5.2 both agree opus is sh*t

#

guess the models

#

and this

toxic verge
#

Dude, you know what it is

#

It’s probably that the compute

#

They give us much more water down versions

surreal zephyr
surreal zephyr
toxic verge
#

Not sure the really messed up one

#

Are u sure?

surreal zephyr
toxic verge
surreal zephyr
toxic verge
undone saffron
mystic patio
#

i used grok 4.2 to automate the scoring of a psychological test, the MMPI-2

surreal zephyr
#

also i had a bug that i spent 2 weeks 8 hours a day trying to fix with opus 4.6
gemini found it in 1 prompt

mystic patio
#

it was pretty good

toxic verge
#

This proves my theory

surreal zephyr
hard quiver
surreal zephyr
#

mainly when opus or 5.2 mess up

toxic verge
#

Let’s create more speculation than it solves anything

#

It turns out to be a popularity contest more than it is a capability

#

Which undervalued the capabilities because I’m some of these models shine in certain areas

surreal zephyr
#

heres an old model vs a 10x more expensive less than month old opus

toxic verge
#

Which sadly aren’t measured and how will we ever know those if we’re just focused on the standard

#

I don’t think this is something academic could solve

surreal zephyr
#

every time it hears car wash, opus says "drive", even if the scenario is inverted

toxic verge
#

It has to be somebody from the bottom with a fresh perspective

surreal zephyr
#

same for many other riddles

toxic verge
#

A really evaluation test needs to be from the people like relatable

#

Like in the real world where people struggle, and with what they struggle lol

#

I mean these benchmarks are cool for enthusiast and researchers

hard quiver
#

Gemini 3 pro-image is not generating anything the only output is "something went wrong with the response, please try again"

toxic verge
#

It would be meaningful if model evaluations captured the everyday experiences of regular people using Llm the ones without large platforms or influence and reflected their real frustrations. There should be a way to measure performance that highlights where models consistently struggle, not just where they excel on benchmarks.

#

And you know, it’s the most ironic thing as recent we haven’t heard much of the term ā€œAGIā€ been thrown around lately. šŸ˜‚

surreal zephyr
toxic verge
surreal zephyr
#

remember gemini 3.1 is like the only good model from google (gemini 3.0 before all nerfs was good too)

toxic verge
#

That’s the movie it’s supposed to say

surreal zephyr
#

100 views 😭

#

theres no way it was in the training data

#

i doubt it can figure out based on literally snow

toxic verge
#

This is what I’m saying. It’s not like the model is stupid. When you nudge it.

surreal zephyr
toxic verge
surreal zephyr
#

(pure new chat btw)

#

it can guess the movie just based on ww2+ snow

#

snow alone is not enough

toxic verge
#

Yeah

surreal zephyr
toxic verge
#

Well regardless then this would be edge case

#

And the world is full of edge cases

surreal zephyr
#

i had a pic of a house i took by accident

#

no landmarks ect, just a normal house/building

#

i put that into gemini

#

it found it to 5 metres

#

šŸ’€

#

it mightve been on google maps street view but still thats insane

toxic verge
#

Im not saying these models are outright stupid

#

I’m just saying, I don’t think that leaderboard accurately reflects. I don’t even know what I’m trying to say.

surreal zephyr
#

not even need image xD

toxic verge
#

Actually, this is a good benchmark. We should try out with the other movies.

surreal zephyr
#

ai might be closest to "magic" we will ever get tbh

long minnow
#

and youtube might be the closest to a "time machine"

toxic verge
#

Nawh biology by far is far more mysterious and far more magical

surreal zephyr
#

and mind reading without sensors is even crazier

toxic verge
#

That was great šŸ˜‚šŸ˜‚šŸ˜‚

toxic verge
surreal zephyr
#

there are already ais that can read thoughts from mri scans, but good llms can almost do that without any scans or such

toxic verge
#

But yeah, I hear what you’re saying. Definitely for sure. It feels like magic.

surreal zephyr
#

ww2 + winter snow is NOT enough to guess it

toxic verge
#

I don’t think it’s magic. I just think that we’re really predictable.

surreal zephyr
#

it guesses by the way a human says it

#

like

#

human subconsciously spells it in a way that hints for the specific movie

#

and the llm's wages contain some of those patterns

toxic verge
#

We’d have to see behind the scenes to truly know

surreal zephyr
#

not the movie contains winter and ww2

#

its about why when asked to describe the movie

#

the human picked ww2 and winter

#

and not for example a plane

#

or a gun

toxic verge
#

You sound like a salesman

surreal zephyr
#

ai is underrated

#

like

#

ai is basically a solution to every solveable problem

#

by definition

toxic verge
#

O.o

surreal zephyr
#

if there is any pattern, ai can be trained to find it

#

if a dog is smart enough to have a conversation

#

you can train ai to translate

#

by definition

#

thats how impressive it is

toxic verge
#

Yeah, but in the physical you have entropy or pure randomness

surreal zephyr
#

but if a dog or a monkey can say (or even think) "give me food" or "i want go outside"

#

you can see that using ai

#

its not really feasible rn to do that but its very much possible

wicked talon
#

AI šŸ™‚

toxic verge
#

I like the word you used earlier magic

surreal zephyr
wicked talon
#

It just finds info on a database and pieces it together

surreal zephyr
#

you can solve any problem by throwing money and compute at it

#

using ai

surreal zephyr
#

thats easy

toxic verge
wicked talon
#

Tell then

toxic verge
surreal zephyr
#

when you get 0/0 you use le hospitals rule and simplify

toxic verge
#

I don’t think the AI is gonna be smarter than humans

surreal zephyr
#

lol

surreal zephyr
#

pattern recognition? then it already is MUCH smarter

toxic verge
surreal zephyr
# toxic verge Humans

oh like causing wars and killing millions because politican in other country insulted you?

wicked talon
surreal zephyr
wicked talon
#

In terms of remembering things then yes

surreal zephyr
wicked talon
toxic verge
surreal zephyr
wicked talon
surreal zephyr
#

0/0 has multiple meanings

#

same as castle is chess move

neon idol
#

Hello!

toxic verge
#

Hmm

surreal zephyr
#

consciouseness is a made up thing

#

if it isnt, then define it?