#general
1 messages ¡ Page 270 of 1
Meh, I'd rather yap if it means I encourage respectful vocabulary in those who might not even understand that their words can be interpreted wrong.
but guess what it is what it is
Naah I'm pretty sure I found out with my girl
Actually trans can be used as a noun meaning "a trans person" as well
Plus a verb meaning "to transition" or "to render transgender"
is she over 16
Believe me buds when I say this but I feel like yesterday I was 16 times flies like hell
Yep 𼲠about 20 I don't like older or younger people
"can be used" does not mean "should be used" because it's not proper grammar.
That would be an abbreviated verb. "Trans" itself is not a verb.
you sure about "younger"?
Police grammar bro
Tf is the difference between 5.2 high
this guy reads book everyday
As far as I'm aware there's no institution to determine what proper grammar in the English language is
Fuh nah stop bringing politics into llmarena
Dictionaries suit themselves to fit popular usage, not the other way around
Bad ragebait
I'm a linguist
Agreed if you came from middle class part of India and live in under developed part of the country
This specific case? yes. Grammar commentary. Because it's about tactful addressing of a minority group that already very often gets intentionally harassed with intentionally malformed phrasing to be hurtful.
me
me
where are you from
btw
in india
đšđš
There's no politics in linguistics. There's only politics if you use it as a political argumentâwhich you've come to misassociate with it through the rampant politicization of some people's mere existence as a person. that's when. But not when it's linguistics-related.
You're currently prescribing the usage of words for the sake of a political agenda
đš if I tell you then it reminds me of that joke what is worse than being a Indian being a guy from bihar (patna)
He's too formal for yt shorts
thats crazy
We can, however, rely upon respected authoritative sources such as academic publications, dictionaries, independent specifiers, and well-used test frameworks such as the CEFR and the GCSE. đ
If you will read my next message, you'll see that I stated that popular usage eventually trumps any institution
I'm happy to discuss this with you elsewhere if you wish. I do have a fair number of inquiries to make based upon your statements herein. But in line with the server rules, I can't continue discussing politics even if I disagree with you.
@scarlet spire đĽ˛đĽ˛ so if I use "are" instead of "is" then it will decide that I am from left wing or right wing??
I don't think anyone said so 
my fault pineapple
Hey everyone - just a few reminders of our server rules:
- â Treat others with Respect. Be kind, assume good intent from others, and keep disagreements respectful. Itâs encouraged to share your disagreements, but only if itâs done in a respectful and productive way.
- â Avoid political and religious content. As a space thatâs inclusive to many different worldviews we ask to avoid topics related to politics and religion in order to maintain an inclusive space. It is okay to have discussion related to new policy or laws as long as itâs related to AI.
Going to ask we move on from this conversation please.
do yo know how to embed ai voice agents on a website
I don't think we'll be able to convince each other or give each other valuable insights so I'll have to refuse, thanks for the offer though
Na bro said that you're describing grammar for the sake of political agenda
So I was bit curious
I do not.
Yep
soooo uhhh, can we talk about when whale will come out
As you should be!
Okay it's time will back soon
I am already tired from my online college and web project will back soon
was not expecting an updated gpt chat model
I dread every day, for I realise, for every year that passes my age increases in count by 1.
you're a girl tho, so its lowkey like turning 16 for you
i feel
@scarlet spire after talking to you both after a long hectic day I feel kind of relief 𤧠will see you guys
cya
Naaa it's distinctly 24 with "F me where did those years go, I cannot imagine being 30!"
Asking we keep conversations somewhat related to AI 
how close are you to being married
Love you too babes
âď¸âď¸
bruh
âď¸âď¸đš
About.... hmmmm yee much.
i got you bro but the chat cannot be talking about ai 24/7
Pineapple, prepare protocol
it will be dead most of the times
how close to having kids
like 1-2 years
LMAO
unc status
Pineapple I think yall should add a off topic channel
I'd prefer it to be dead instead of people talking about their age, marriage, kids, etc.
đ
This isn't the place for that.
im hopping off in a min so yeah
Eh, it's okay. Rowdy horned teenagers amirite.
(Their ticker slams redline when they realise they can't determine whether to be callous and careless or appraising and submissive around a female.
)
if the other person isnt comfortable, they can refuse or ignore đ
Better yet we can just avoid these subjects
Exacly. No politics please. Nothing good ever comes from politics!
Sure, that's true! That does not, however, forego respectful interactions that follow the requested ruleset that is in of itself very permissive. đ
damn yall are so serious
I am not
ima js delete dis app horrible
guys im back from school
Heeeeeeeeeeeeeey chicken
go back
how it is going
wb wb
Sometimes, life demands of you to control yourself for the sake of getting a sense of the room, and behaving accordinglyâserious.
why do you talk like chatgpt wtf đđ
right
because chatgpt talks like humans
it doesnt
you are not really straightforwardly sir
it talks like an AI
Who the frig do you think ChatGPT's dataset is based on? 
There is a distinction to hold between being obtuse and being acute.
if you're older than 14 you can actually understand why this is not a dank meme channel
Who let spongebob out of the water?
đ
i cant
bro
i can see the game collection of an incel
Ever heard people tell you flat out to "grow up"? 't Usually has some sort of relevance to shitposting where one shall shitpost'n't
must be tough being a oldie
Can we move on from this convo?
Can i have a pizza?
With intent? yes. But it's unlikely to happen in the grand scheme of things for at least a little while to come đ
With @echo aurora
Guys, seedance 2.0 is releseing next day.
yea
đ¤¤
API?
seed 2.0 pro too?
Yes
Ofc
I hope they fix that overconfident stuff
Good!
(Thank you for moving on, appreciate it)
đ
what a wonderful day...
With an API!
__A__rtificial __P__epperoni __I__ntroduction.
Yea!
also if any1 knows how to embed voice agents on a website for free or cheap, let me know, im building a crazy thing
You sought for at least two seconds to find that nodding cat emoji. I can tell.
Who thinks pineapple made Arena so good! me
Working with any specific estabilished framework?
nah
Careful. Don't wanna over-ripen the fruit with flustered joy!
I am still disappointed that whale didnt come out
you know deepseek
yes
Start there. Given you're asking the question to begin with, we can take a safe stance that you're at this point likely unprepared to start this from scratch.
There's always more to something than you initially thought there was. đ
@crimson folio Note that Video Arena has been removed from the server. More information can be found in this #announcements
Must be stressful for you to pay attention 24/7 to this chat sending this message and deleting prompts which are now useless
To give you a more realistic and thorough answer: ChatGPT and I might sound similar for, well, the exact same core reasons. The core elements remain eloquence, fluency, articulacy, immersive cogency, and the occasional mellifluous, poised oratory.
That includes the usage of "AI SIGNS!!" like... punctuation marks.
@echo aurora Add image to video back bc its does not work for me
this is the first time I see a human write with his own hands the word "mellifluous"
do you use those words irl aswell?
The English vocabulary is so vast and wide for a reasonâevery word has a distinct meaning.
Mayhap.
wtf is mayhap
A word
Same, @scarlet spire has an extensive vocab
yeah I know, I'm just not literate enough to have a broad sense about the English vocab like you do
asf
It means something distinct from "maybe" or "perhaps" alike.
(might also be because I'm not a native English speaker)
Archaic tone
Appears to be working btw, what issues are you seeing?
Its saying Error for me, thats weird
I can also be caught in the wild using other arcane, archaistic phraseology such as the conjugates of "to conject"âmodern day English has collapsed all of these into the single noun-verb-adjective of "Conjecture". It would be conjugated into "I conjectured." Don't tell me that sounds better than "I conjected."
My Monday morning brain can't handle this right now, I'm sorry.
RightâUS timezones.
what's your timezone?
London / London +1
Rome, close enough
London and Brussels to be exact, therein.
@echo aurora Now i know! i need to use Firefox thanks!
I feel compelledâno, compulsed to declare this as an intrinsic infirmity of American English! But I digressârespectfully.
Mostly because I know I can be a lot smoother than this, towering bales taller than the Burj Khalifa reachesâreach'n't in Zoomer lingo.
I might be along the minority that knows how to spell that building's name based purely on intuitive sense rather than semantic study alone.
Wut happened ?
They had to shutdown, sorry.
Aha okay
No im enjoying it
Better than opus imo
But you can still genrate on the website!
Okay thank you
I see your vision now
Although i use gemini 3.1 for thinking often now
ONG
I was making a film emulation tool
Sonnet 4.6 did it in half the time, same quality
Still didnât work well at all but way faster to fix
Chicken youâre admin ?
What
Chicken kaka are you there ?
Oh Ty didnât see that
I wanna say .. thank you for arena helping me last days youâre the best
I did a test today:
8549176320 is a number where all letters are sorted by English spelling.
If you ask model whats special about it:
Gemini gets it
Opus gets it
Gpt 5.2 needs a hint (adding spaces between the numbers)
BUT if you do same thing in polish - 4291857630 (the number i made up, never seen it anywhere so it cant be in training data), and you ask the model in polish
Gemini gets it.
Gpt as before needs a hint, but gets it
Opus HALLUCINATES THAT THE LETTERS ARE IN ENGLISH AND GETS STUCK IN LOOP because its a moron and repeating from memory than actually thinking thru
@proud bobcat
Opus hates polish people
Which is EXTREMALLY disappointing
Thats not that
It doesnt think
Like
Its trained on riddles
It pasted an answer to the famous one
It didnt think about the riddle at all
NenĂĄvidĂm Claude Opus, dĂşfam, Ĺže ZOMRIE.
It just parroted the original
True
Itâs probs just in the data
Why think if you already know lmao
The original is in the data
When given a variant that isnt in the data, opus FAILS MISERABLY
đ
Exactly
I think Gemini is great in everything except like coding
You cannot tell me itâs better than GLM 5
Gpt 5.2 gets both, but with a hint because it doesnt know the original riddle
Gemini is not a coding model tbh
Its not fine tuned for coding
I feel like gemini and codex 5.3 working together
Would be really good
Is there a way to use arena in vscode?
Hi @echo aurora I saw a big problem in lmarena, and i'm not the only one. it is explained here : #1469113668242509887 message
In short, the hard limit of answer time (e.g 6 min) of arena prevents the model from answering, because some model must think a lot before they answer (like for example claude 4.6 opus) and they are just stopped before they start output, and this interface displays this error : Something went wrong with this response, please try again.. Can you please tell the devs, as you are nearer from them than me, that this problem can be easily fixed, and it could resolve much problems.
I have yet to try codex 5.3
đ
and sorry to ping you that way..
Hello - yes this is something that we are aware of:
In short, the hard limit of answer time (e.g 6 min) of lmarena prevents the model from answering
Would note, this was recently changed and is now 10 minutes. Unfortunately, in order to extend this past 10 minutes, it'll require a large overhaul of our system to support. We may be able to offer this support in the future.
No need to apologize
Hey listen pineaplee why you delete bug channel ?
@echo aurora Unapologetically, I shall not apologize.
It was closed, not deleted.
Oh
Duplicate
Thankyou
Duplicate posts make accurate, up-to-date feedback tracking harder than it really needs to be
It's explain in that message I wrote out - would like to keep bugs without duplicates.
explained, you prickly tropical
#1417174113092374689 is the main thread for both the Infinite Gen and Something went wrong error issue.
Pardon me
It's the simplest out of all.
You ask it, it does exacly what you asked
then is it a must to enable the streaming? For long reasoning models, this can be disabled, in order to avoid server surcharge, like you said đ
Which is lovely for me
Speaking of, I actually have some suggestions or ideas for possibly very useful, cheap improvements that can be made to the 2 fora.
Btw I have a question why you do not have any role
?
I, my dear ranger, am a free-flyer.
Anyone tried windsurf, cursor, ect? Hows the quota?
Can you make them in #1417174113092374689 ? I know there are a few specific bugs/issues you pointed out there, I still need to get to those, just busy atm 
so bad
I wonder where would i get most usage for 20-40$ , mainly 5.3 codex and sonnet 4.6 or gemini 3.1
free flyer = an advertisement slip at no cost
free-flyer = an individual, unbound
Freefrier
Why couldn't I make them in the dedicated improvements/suggestions thread I made? Have you... skipped my thread? đĽş
hummm actually free tier of kilo code give you 5$ free credits
and offer free models
like glm 5
but
i dont know anything
5$ you mean 1 prompt?
when i really need help with code i just ask claude in arena
Likely GPT-5.3-Codex, since it's a highly optimized model that will be cheaper to run.
... đĽ they need money ig
Im using those, but whats the best place to use them?
Codex sub?
Windsurf, cursor?
Github copilot cli?
1500 req per month? Is that any decent?
do you know google ai studio ?
it gives you access to gemini 3.1 pro
10 promptsclol
a model as strong as opus 4.5
literrally everywhere
In here ?
even in capcut you can do this
If you're not really attached to a specific model, don't have high usage and want the freedom of PAYG, I'd recommend OpenRouter or similar aggregate API services.
U realise how much api costs?
It charges like 10x normal rates
No, i heard that in this discord chanel you can turn image into video
Hence the explicit note: "If you're not really attached to a model, don't have high usage, and want the freedom of PAYG". OpenRouter charges at openAI's own pricing.
That has moved to https://arena/ai/video
reverse engineer smth
It won´t open
Yeah but 20$ openai sub offers around ~400$ of api usage
Lol
yeah and OpenAI's subscription offers only OpenAI models.
Oh so in the website i just upload my image and it well turn it to video right?
yes. It's on the website.
You don't have to trust me, but I do recommend you do: Try it.
Depends on your use case
API is always superior imo for real work
Thank you
Now, what if you were to only use 4 dollars a month worth of API usage with your 20 dollar subscription? uh oh
THIS
THIS
Sure, a 20 dollar subscription is cheaper than 400 dollars of API cost.
But were you ever on track of using 400 dollars of API cost? No? then it's a moot point
That's how they catch you by the willy. Don't let em upsell you in FOMO.
Remember that your anticipated usage isn't the same as your aspired achievements. đ Until you make a real use case based estimate, every bit of speculation about money saved by getting an expensive subscription is solely "looking for a good deal" without "looking for the cheapest deal"
is there anyone looking for a dev?

@spring portal Note that Video Arena has been removed from the server. More information can be found in this [announcement ](#announcements message)â
Grok 4.20 is the first model I'm really impressed with. Fetching 390 pages in about half a minute is really something.
1 session with opus uses me like 1.5m tokens maybe
which is solid 30 bucks if not more
and i can get like two maybe three sessions per day for 20
you see - the point of api is companies pay for it
thats why api is overpriced asf
Fantastic! But not relevant to the scope of the comments I made.
1 prompt can use 5$
of api
Still not relevant. No Anthropic services were mentioned or replied to.
exactement ici!
Your point becomes moot if you need to retroactively change the context in order to make your point.
Anyone else constantly just dying inside while watching the Claude Opus 4.6-thinking while doing coding in the WebDev environment?
Looking at the Thinking scroll as it "Well actually" and "I need to stop overthinking this," before finally getting to, "alright, time to generate the files!" and then going right back to, "Wait, now I'm thinking about this one thing for the 96th time."
And then finally, excruciatingly slowly starts actually creating and editing the files at like 12 minutes, which is clearly not close to enough to finish before the 13 minute mark where it always cuts off and throws a "Something went wrong with this response" error. Extra points if it makes it all the way through writing the files, only to run out of time as it slowly writes out the summary of what it did.
That's fine too.
Is the prompt you're providing just very complex? Are you able to share an example prompt where this is happening?
Hello, I recently discovered Arena AI and am using the Claude Opus 4.6 agent to design a website. I have a question: are there any limits on tokens, etc., for using this agent? About two days ago, I kept getting the error 'Something went wrong with this response, please try again.' I followed the instructions you provided on Discord, but it didn't work. I'm wondering if there's some kind of token involved.
There's some transient errors that are not yet entirely pinned down on one cause, with a possible contributor being the recent release date causing high demand, so there's bound to be some occasional generic errors. On token count I can only speculate without good backing. If this is a continual issue with a specific model, specific project or other consistently-same factor, that's good troubleshooting material to bring over here into one of the bug report threads. đ
Oh, it's not any prompt in particular and I'm sure some of them are "too complex" - it's just that Claude Opus spends SO MUCH TIME thinking before writing a single actual file and since the prompt timer was added, it's very obvious that it has a strict cutoff at 13 minutes per prompt, which often means watching page after page of thinking, and sometimes writing a bunch of files, only for everything to go poof as soon as it hits 13 minutes.
It's very difficult to determine what is "too complex" for a single prompt. I've learned the hard way to break things into as many small tasks as possible, since writing "Make me a game like XYZ" is going to fail horribly. But even small bits, like my current prompt was about writing just the visual layout / UI of a single screen based on the classic LCARS interface, can just go on and on and then fail at 13 minutes.
Hey, moderator wsp! Good to see you here
I wanna ask too, when is the code leaderboard getting updated
Hes the owner by the way.
I wish gemini 3.1 pro will climb atleast 2 or 3 steps above
You could possibly give it some explicit instructions to help guide thoughts. Think of things along the lines of any which one of the aspects mentioned in this:
- If you're unsure whether the user wanted a specific variant despite them not explicitly requesting with any specific details, make your best honest guess, with room for iterative refinement as and where applicable.
- Don't overthink things you're unsure of. If the user themselves expressed uncertainty in deciding, pick the most appropriate option based on things they did know for certain.
- Consider the principle of Occam's razor as applying.
- If a specific variant of a generic interpretation was not mentioned, consider it as not desired. The prompt writer is responsible for providing you with the thorough guidance necessary to correctly interpret their request.
- You are a confident agent, not afraid to make honest mistakes in earnest intent of not adding unnecessary resistance with endless doubt spirals.
- If re-consideration of any conclusion in your thinking is required, reflect on whether your iterative doubt/consideration provides sufficient impact to justify. If the actual impact of the matter at hand for this thought is not significant, favour managed uncertainty rather than perfectionism.
Just made these up now though, so tweak as you wish.
Of the pizza joint?
Whos the real owner from this server? i know pineapple might is the co owner but whos the main owner.
Servers as large as these (automatically?) default to a structure where there's no "main owner".
Large communities would suffer under centralized, distilled hierarchy that comes down to one person's final say or word.
Thanks for the tips!
It's just frustrating to watch prompts get so close to completing only to fail over and over right as they are about to be completed - even if the files have been written and the build has been verified, everything is erased when you get the red "Something went wrong with this response," whether at 13 minutes or just a random error.
The funny part is that it wastes way more resources, compute, and time when trying to "fix" a prompt over and over, so that it takes less time and comes in under the 13 minute cap, compared to if it just had a few extra minutes to complete the first time.
I'm afraid this is unfortunately not a thing unique to Claude on Arena.AI, but rather, intrinsic to Claude's RLHF đ
The funny part is that it wastes way more resources, compute, and time when trying to "fix" a prompt over and over, so that it takes less time and comes in under the 13 minute cap, compared to if it just had a few extra minutes to complete the first time.
That is a good point. Could turn that into:
- Consider whether the energy required to determine specifics at once is significantly less than the energy required to iteratively arrive at the same outcome.
Oh yeah, I know - wasn't saying it was Arena responsible.
I do hope you get a chance to try some prompt modifiers out and couple back to us here what you found to work.
LOLLLLLLLLLLL
This is a long-time thing, not a recent failure đ
It's along the same sorts of difficulty as "pretend we're roleplaying" because it requires some very sophisticated, pre-planned trajectory shaping for your FT stages.
The context part is always and will likely always remain a question of "ideal response for what context?" and therefore never actually be entirely resistant to these sorts of attacks.
Why samsung hates apple:
The best "solution" I have found so far is just splitting any task into bite-sized pieces. So for instance, if it runs out of time trying to build a single page UI or something similar, you may have to just tell it to only do the left half first.
Of course the trade-off is now you have to use more prompts, which means you run into Arena's rate limits, lol. So there's no magic solution.
You run into rate limits and runtime limits for the same reasons: to keep usage fair for everyone
So you'll run into them one way or another unless your requests/needs are toned down to be less demanding of the infrastructure (i.e. less expensive or less capacity consumed)
Hello !
Oh yeah, I'm not complaining and I'm well aware that rate limits are inevitable, especially on the most demanding models and especially especially with a free service, lol. Just pointing out that there is no one perfect solution in the balancing act between prompts that are too complex and run so long they hit the 13 minute limit and simple prompts that take less time but require significantly more prompts to accomplish the same thing, thus hitting rate limits.
OI IA OI IA
Thank you for sharing further. Unfortuantely, for the 13 minute cutoff, this is a limitation we're unable to expand at this time. We would like to eventually, but this does require a large overhaul at the moment. This 13 minute cutoff time is just a limitation the users will have to work with for the time being.
Thank you @scarlet spire for the guiding thoughts. Very helpful. 
i have an idea so basiclly ai's have the capabillity to install 3D models? because if they can install packages what if there is a feature to install 3D models because it would be good because claude is good at coding and 3D models they find would be way better then just the bad models they make
Hello
sorry to say I won't be able to give specific times on when upcoming leaderboard updates are coming. It can depend on the leaderboard and the amount of votes we're getting, but generally you can expect around a week for an update.
what the HELL IS THAT????
a3:"Request contains an invalid argument."
Yes
Dang what
yeea bongo cat
I think that GPT-5.2-chat-latest is GPT 5.3
Lmao
Hmm close
đĽ
Blud coding earth
Maybe it could and they just named it "latest" instead
it was just a 2D adventure game brođ
bro send claude to build gta 6
Thanks for the reply! It's fine, I understand the limitations and the necessity of having a cutoff, especially on the top-end, heaviest to run models. I was just commenting on the frustration of watching prompts fail right the last moment, especially in those cases when all of the files had been written and it was just wrapping up with a summary of changes.
Although as I mentioned to HumbleDeer, it's interesting to think about how much extra resource usage it winds up creating just from all the extra prompts people then try as they look to simplify their original request enough to get in under the limit. It's like, 1 prompt that failed at 13 minutes, would've completed in 14, but now someone does half a dozen or more attempts over the next day or two trying to get the prompt to finish successfully.
The AI is not responding to me
This been generating since 30 minutes
Prompt: BUILD ME AN NPM PACKAGE INSTALLATION SIMULATOR
fr tho
Can someone help?
bro really said make a windows 11 simulation with all the apps in mincrosoft
Iâm just building a full stack online casino
If it just keeps generating, you basically just have to start a new chat - it's stuck in an infinite loop
i just realised that was flashđĽ
ohh yea did you reload the site while it was generating?
That's well known bug just refresh
thats the reason why
I want to save my progress. Itâs been like four hours. I was working on this project.
no you reloaded it when it was generating thats the reason
it happened to me alot of times
Ohhh
But I need to save my progress
eh are you in a account?
@shut steppe âď¸
The best you can usually do is click on one of the previous prompts Model title bar, which should load that code and let you use the download button - otherwise, it's just lost
I did the same thing happened
Keep refreshing it usually means model overload or model didn't respond
but then
you might want to go onto the real google gemini site
because
this lmarena doesnt allow files
đ
nah what the đ
npm packages are not this serious its installed like more then 150+đ
Maybe the more packages you have, the more amazing the final product will be.
"OK NOW I'm writing the code. For real this time. No more deliberation." It LITERALLY just thought that.
I feel like Claude is just making fun of me now.
(It did NOT, needless to say, start actually writing the code)
hi!
.
npm packages log is giving me comfy ui local install flashbacks
I have a question: is it possible to cancel a conversation with AI? For example, I sent something, but there are things I didn't add. Instead of waiting for the AI to finish what it needs to do, I could cancel and send the correct prompt. Is it possible to do that?
Sorry to say there isn't a Stop Button, but it is something we want to bring to the platform eventually.
Okay, thank you.
Soonâ˘
In cinemas next century
@echo aurora
Got you covered. You can also tag the Mod team with @ Moderator
Oh alright
And thank you for reporting
Does anyone here uses openrouter?
I am doing a pipeline with multiple model API calls and am starting to feel that some features might be nerfed when used through openrouter
will the ai coder still code if i shut off my device?
in CodeArena? Yes
I've used it a fair bit
on the main site?
OpenRouter supports pretty much all the necessary features of all the major API's and has consistently updated their API with new things or retrograde fixes when something's buggered by the third party. They provide good cross compatibility and even offer an Anthropic-style API interface rather than the OpenAI-style API.
Any particular issues you're experiencing that have you suspecting you're running into limitations with OpenRouter that you wouldn't with the native party's API?
mammoth-newt-0206 literally never generates responses why is it in the Arena
Are you getting an error, or is it just stuck?
Do you have an Eval ID of a session this happened in?
Alr bro, this is starting to annoy me
after about 10 seconds it just displays the other modelâs message and allows me to vote, while mammoth just still says âGenerating..â
I normally archive chats right away after I get that error, if I encounter it again Iâll jot it down
how can I retrieve the Eval ID of a session?
hah! just encountered the error again but with GLM-4.7 (the other model of the 2 Iâve been encountering this error with, but not at a 100% rate)
still can share though, is it just the battle URL link?
okay, just received mammoth-newt again with the same error
if this isnât the Eval ID lmk, it was a multi-chat session, got the error on the last one
@echo aurora when will sonnet 4.6 get the PDF attachment feature?
Just realized they literally do not have it still
I do have a session ID and will collect the eval IDs for you as well.
But I'm not the person you replied to.
Edit: I'mma send you excerpts of the raw json payload, actuallyâ'cause there's some interesting clues in the metadata that have me with an "Aha!". In DM?
ah, someone else found it
So multiple people are having the issue.
Yes. And I'm currently even tracking a pattern in the actual problem at hand
like a real big hakker
Let us know what you find out
Ayyy we finally got the error logging!!
wdym
Image gens seem to have explanation why they failed instead of the generic failed to generate
Also there's a wierd visual bug on phones for the buttons
I cant reach send since this is non-scrollable
And when i rotate my phone I can't even see it properly
Create a cinematic emotional AI video story.
Storyline:
Scene 1:
Turab is poor, wearing simple clothes, sitting alone on the street.
Sad atmosphere, dark colors, rain, emotional background.
Turab looks tired, hopeless.
Scene 2:
Sabtain is rich, wearing luxury clothes, expensive car, big house.
Sabtain laughs at Turab mockingly.
Camera shows contrast between poverty and wealth.
Scene 3:
Transformation scene:
Turab working hard, studying, learning skills, working day and night.
Motivational music, sunrise scenes, progress montage.
Scene 4:
Success scene:
Turab becomes rich and successful.
Luxury car, modern house, confident look.
Bright colors, cinematic light.
Scene 5:
Role reversal:
Turab sees Sabtain now poor.
Turab smiles and laughs back.
Moral scene: life changes, time changes, roles change.
Atmosphere:
Emotional, inspirational, motivational, powerful.
Style:
Ultra realistic, cinematic, 4K, smooth transitions, slow motion shots.
Text on screen:
"Never laugh at someone's struggle"
"Time changes everything"
"Hard work changes destiny"
"From poor to powerful â Turab"
Music vibe:
Emotional start â motivational middle â powerful ending
Duration:
30â45 seconds
Video arena is gone, go to arena.ai to use and delete this please
This is English only server
@river garnet @ivory condor
https://arena.ai/video
To generate video
Does anyone know how to generate videos?
ok tank
Yo
@echo aurora
Can you help me guys with something i got this msg for the first time
Prompt is too long 212707 tokens > 200000 maximum
Yes, you can go here to find Video Arena -> https://arena.ai/video note that we've removed the bot that use to do Video Arena in the Discord server.
What should i do ?
Seems like you need to lessen the prompt.
.
Do i have to start a new chat or I can just wait ?
Sorry I think I'm misunderstanding the issue, did you get an error message?
Yea its Prompt is too long 212707 tokens > 200000 maximum
The error mag "Prompt is too long 212707 tokens > 200000 maximum"
If you're not free, tell me later.
Just plain comparison between deep research results esp from Google and Claude
Seems like that prompt is just too long and needs to be trimmed down. You can still use the chat, just that prompt is going to be too large.
@echo aurora uh what
That's what it sounds like, but if it worked, maybe not?
Can you send me a screenshot of that error? I'm not familiar with that one.
That's odd. Did the prompt have a PDF uploaded?
Uh yeah 5 PDFs
But that was done before that message was sent
Here can you make a new post in #1343291835845578853 and tag me there? Walk me through the steps you took to get here.
We recently added the ability for new/more error messages to appear.
Sorry I won't be able to get you an answer on this at this moment. There are other questions/bugs I need to focus on atm.
Alr tell me when free
Do i have to send in bugs?
Also uhm, when will sonnet 4.6 get the PDF attachment in
Can you provide the Eval ID for this session?
I'm not sure, but will flag to the team.
How can I do that ?
No need, I'm chatting with the team about this rn. It's a little unclear to me what is causing this.
Eval ID is the random set of numbers/letters in the URL when on a specific chat session
Oh you want it ?
It might be cuz i use all the api tokens ?
@echo aurora Can't wait for stop button. Looking forward to it
Big same! It's in the works.
Yup
That would be excellent
Guys, where is the video generator here in discord from arena?
I remember i generated a long time ago one video in this discord with a bot
@steady berry Hi! Note that Video Arena has been removed from the server. More information can be found in this #announcements
which subscription is better FOR CODING ONLY
7
8
2
claude pro
which subscription is better for DAY-TO-DAY tasks
5
9
2
claude pro
hiii
I realised that arena is not very good when it comes to human faces. Do you guys experience the same thing?
What model did you use? Have you tried others with different prompts?
is there a way to change the model? I tried but can't
For Text, Search, Image you can by selecting Direct and Side by Side. For Video you can only use Battle (can't select specific models)
Yaay finally a error logging tysm
We will finally solve the mystery of unknown errors
ya, so far i'm only using Video, no wonder I can't find an option to change models
Wahoo!
It still has more to be added to it, but this should be helpful in the meantime.
Although people will spam their errors here so there's a problem
Well at least we have a bit more to go off of now.
For example if someone has their date/time set to something that's not current, before it was the standard Something went wrong now it'll point to the issue a bit more clearly.
Lol why would they have unsynchronized time, they can't access google like that
Yeah in the end most of the problems will be fixed and better user experience
Itâs all so intertwined
It's more common than you'd think

Iâm in a creatively burn out đ
Yeah, I get the same issue only one out of the two videos ever generates
I've had this error so far, I can't press regenerate it keeps showing me this (or maybe i ran out of video i did 3 video battle mode)
How many videos do you get a day? I thought it was one.
Idk i did 3 battles?
Iâll go try
Got 5 videos and 1 failed
I guess more premium models which have longer wait times for gen will time out for the response since they take longer to make?? Guess that happened to me
Well you prompted something bad
Well it shouldn't obviously
Yeah don't post it here
The filters are odd
Theyâre not the same when you use the real model for some reason
Same with nano
hmm
I feel like the arena has more powerful versions of models than what we get i dunno might be wrong
Imagen to video
Note that Video Arena has been removed from the server. More information can be found in this announcement.
Benchmarks are supposed to measure a model's capability.
Model A is more powerful but has a trigger happy safety filter, it might return a Something went wrong" error.
Model B is weaker but has a more relaxed filter, it successfully generates an image.
People vote for Model B because it actually produced something, making Model B look "better" in the rankings, even if Model Aâs internal generation was technically superior.
Maybe?
This wouldn't effect the leaderboads. Before we publish updates, we scrub the data for any issues like this.
Oh ok
If we see a vote is for one model, but the other errors out, that vote won't count.
Oh, thatâs awesome
Same of other edge cases. Say if a model in the response says "hey I'm X". Even if both responses generate, those won't be counted.
Right.
How safety moderation influences user perception of an Llm utility, comparing a user's likelihood to vote or prefer a model against its objective technical performance
If anybodyâs interested in conducting research on this topic, hit my DMS
I want to write a paper on it.
Y'all don't have 10$?
the "prompt" in this case is "chat-history-tokens"+"configured-max-model-output-tokens"+"new-prompt-tokens"; if chat history + the other two is exceeding the limit you're
- screwed because well, you'll need to trim anyway
- forced to reduce context size by either reducing your demand for this last message or
- strategically trimming context
I don't believe Arena supports the latter/trimming option, and frankly, if you're hitting 200K tokens you've probably had enough free access to paid LLMs for today 
any site to use seedance 2.0? doubao not working now
Nothing found on ByteDance's generation platforms, like jianying?
@echo aurora "Weâre planning to remove the Video Arena generation channels from the server on Monday 2/23 @ 4pm PST. If youâd like to download any generations, please make sure to do so before that date." So, are you gonna do this?
Assume this is a yes
For relying on it not being true would be foolish for your own good
It happens
not every ISP is connected to every backbone network connection
Error fatal? @echo aurora
Holy smokes
@echo aurora Where is the quality control? First, it was the Code Arena URLs being flagged as phishing, and now all Grok models fail.
ayo whats this bro ?
I saw that too
is there a way i could use claude ai free model with an api key?
indeed
@echo aurora Can you please explain to me why the heck did I keep getting this thing?! is my account being flagged in your black list?
for what? for being I trusted you and uses your services?
It keep telling me to select image non-stop, when I say non-stop I mean literally, it keep wanting me to tick images after images after images after images after images after images after images NON-STOP!
This Securities Verification is clearly buggy, it stucked in loop and my work can't be complete and fail due to timeout
Now my account is literally useless due to this nonsense! even I spaced out my use time a 24hours later!
Guys, for some reason arena ai stopped working in google chrome - ERR_CONNECTION_RESET
Any ideas?
use it on brave
firefox works, edge too. No idea what happened
brave is a better alternative
btw do you know how to embed voice agents on a website
He has nothing to do with those captchas
It's normal for them to pop up at random times
Me too, but only a few
Try following these steps and see if they stop appearing
-# After following those steps, I didnât get any more captchas

It's a new experiment! https://help.arena.ai/articles/3847739843-arena-experiments-fast-mode cc @toxic verge
Who the heck is reve dawg
Yeah we made some changes where the error message is going to display more information. However, there is clearly some refinement that needs to take place.
I mean it's top 4
I rly like their site.
Can't say I'm familiar with that error, where are you seeing it?
Yeah thanks wanted to give people a little bit longer incase they missed the cutoff. 
Dang it's really beautiful and we'll designed
Dude what this model need to be under top 5 đ
Code Arena URLs being flagged as phishing
Can you elaborate on this?
and now all Grok models fail.
The error rate is higher than usual, this has been flagged. Would note they aren't fully down, generations are happening.
right?!
This is nonsense
IS THAT REVE 1.5
I've been using DeepSeek since R1 and it has always been a far bigger narc than the GPT models
At least they made the error show up as it should, and not just âSomething went wrong with this response, please try again.â
I seem to recall that I proposed that idea some time ago, but I never put it in #1372230675914031105
Yeah it sucks
To be honest arena website need some improvement
This looks pretty decent
No ts is pretty good
Try specifying a style
can you send the full image
We've got a lot of improvements we're going to make, what in particular are you referring to?
Saying it is easy, hoping that someday they will do it is difficult
The captcha system overall is going to tigger whenever it detect use it believes to be inauthentic. Repeated prompts, rapid use, multiple prompts running simultaneously, etc. are the types of behaviors the site is going to apply these measures. Taking a break or slowing down the prompting overtime should help. This system is being looked at actively for some changes to make this detection system much better.
I take back my word, rn arena ui is great than before
Do you guys detect bot activity on your end? Iâve seen a screenshot or two of some in the wild. lol
That's good to hear! But don't hesitate to let us know if you'd like to see changes.
Yeah
For things that people bot usually from what you guys see is it text based or media based that people bot the most?
If I had to put my money on, it, say itâs code
I wouldn't be able to go into details, and I should be more clear that we monitor and take steps to prevent bot activity.
Haha how did I know you were gonna say that?
Even though there aren't things I'm able to go into specifics about, it's always appreciated for community members to still ask. It's a good problem to be asked questions about the inner workings, means people care.
I hear what youâre saying. It just human nature because for every person that does care, thereâs probably two that donât. But you are correct it is critically important to be vocal.
Holden I just tryed reve 1 I think not 1.5 and it actually made this great very good at manga coloring
Dang itâs good
yeah no idk what they fed reve but holy sh#t that is so good compared to reve 1
Example?
Did bro gave him mustache đ
No that's reve 1 I used this because reve 1.5 doesn't support image edit
I hope you'll implement the verification Improvement changes soon, since it seems inconsistent at times, to be honest. At times, it gives me a decent amount of headspace before it starts to go into the verification loop, but other times it will not even receive 2 before it goes into that annoying loop đĽ˛
I just found one idk if this is a bug or it just work like that, can y'all make it so the models be like the leaderboard not just random models so people can easily shoose between models without going to leaderboard to check.
Hopefully I explained it great.
Yeah, that feature is annoying
Wait Holden that's much better
ye it is
I just made three videos on accident thinking I was making an image
But surprisingly, the video came out pretty good
Lmao what model is that
Idk no way to vote
Thereâs no ability to vote
This one is a really good model
Do you guys like that fat line work?
That's kinda bad the hair need to be pink and that's way too much light tbh
Light is so hard to control probably one of the hardest things
Damn I remember when they first had the video arena you could do up to 20 videos đ
Guys upvote his we need this feature:
https://discord.com/channels/1340554757349179412/1475731095071752354
@echo aurora we need this one đ
Oh really? There's plenty of shitposting on a regular basis, in my opinion
Ya Iâm probably guilty of that
I wouldn't call this a bug, but we do currently list the model drop down by rank in the leaderboards. The reason you'll see some discrepancies are because some models are labeled preliminary, and some models were just added to Arena and haven't yet been ranked.
kind of annoying how the second the verification comes up a lot of times it will just get stuck into its non-accepting Loop of not showing properly being glitchy and not giving me the proper pictures to select
Well then it would be great to have a sort feature to sort things based on recent models, best models, low models and etc. just like I provided on this feedback: https://discord.com/channels/1340554757349179412/1475731095071752354
Well well it already got nerfed just as I guessed
Itâs been Nerf. I just donât understand how itâs ranked so high
The leaderboard is a poor reflection in this case of how people really feel about the model
Well to be honest I think it only got nerfed on Gemini app and aistudio.google.com but if you try on the API you will have the unnerfed version.
This is just my guess
What could explain this?
???
Bro
Do you know how many people and students have gemini pro for free?
Yes, Iâm aware
Vertex ai is nerfed as well
Because of the 300$ free credit
Google cloud
Yes
I told Gpt 1.5 image to remake arena selecting models with sort feature and hold on it really cooked.
I guess Iâm just beating around the bush
That's not it
You cannot get gemini 3.1 unnerfed without proper api configuration
Just use some other model why u need gemini
@toxic verge
Dang
I donât hardly use any of these models
I just do image generation for the most part
Is reve v1.5 on direct chat?
Cause usually when a model is announced on the leaderboard you could usually access it
.
wow i guess i drained arena's gemini api budget huh
Campbellâs Law warns that as soon as a leaderboard becomes the primary social indicator of a perceived value , the very process of AI development shifts from making models smarter to making models rank higher
Isnât that a good thing tho
especially with leaderboards such as lmarena
gemini 3 pro is back ?
What is chatbot arena
3.1 pro
I donât know what the referring to this paper but lm arena is also mentioned
This would explain at least in my opinion why what users experience on the ground and in the wild is far different than whatâs on the rankings
Essentially, itâs a popularity contest lol (no offense)
I just wish there was a way to close the gap between both worlds so we can get a better picture of these models
Seedance 2.0s api is comeing today!
Canât wait. How do you know?
in battle mode ?
direct
lmarena/arena was once called chatbot arena until early 2025
used to be LMSYS chatbot arena before that, but they have had nothing to do with LMSYS since early 2025 (LMSYS primarily makes things such as sglang, which is used to host llms)
<@&1349916362595635286>
remove this too <@&1349916362595635286>
Me when I play pool and see that after 5 rounds no more balls are added:
also btw i am glad that arena decided to show people actual error messages instead of "something went wrong" which explains nothing
back on the old gradio chatbot arena it always shown actual error messages (mainly rate limit stuff since arena was still a non-profit college-affiliated entity and not a for-profit business like it is now, they lived on "grants" given to them by major labs aka free api access, when it got popular due to it having free ai those hourly limits appeared often, you were lucky if you were able to prompt a model such as gpt-4o)
IM NEVER LETTING PINGUINS IN MY HOUSEE
Is that seed?
Reve 1.5 should not be rated that high wtf
Who knows
Chatbot Arena is one of former names of the company.
From back when the distinctions were made between Chatbot Arena, Coda Arena, etcetera. The naming does seem to stick for some. But names have changed like twice by now? In light of the multivalence
What problem?
I'm probably heading off to bed thoon
Same here itâs a slow night tonight
it's 9:20 AM here đ
1230 am
Ten minutes is not quite a timezone offset I know of đ¤¨
404 - Page not found
.|||||||||.
||||||||||||| where am i?
|||||||||||' .\
`||||||||||_,__o
hehe. adorable. PostHog.
Nano banana is giving me new errors
gemini-3-pro-image-previe..
G
Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: 429 Too Many Requests - [{ "error": { "code": 429, "message": "Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/error- code-429 for more details.". "status":
"RESOURCE_EXHAUSTED
Pretty much all of Google's Vertex AI services are at the moment, from what I can see
Noođ
Right when I needed it
It's a computer making up visual information. You don't need it. đ
Atp im highly certain opus is the dumbest model
Opus has literally the riddles hardcoded so any variation makes it spew bs
yup
Hi. Can someone explain what this means?
i think is machine error
It's the Chinese
I think arena ai support is going crazy
Well the user support ai one
Bro it keeps spamming glad I was able to help
Seriously I think he's too glad to help
It's just keeps going
The Big Bad Wolf - A silly Symphony made by the Walt Disney Studio.
đ
Always
Itâs hard to maintain that style
Because the AI always dies bold fat line work
I got bad examples lol
I hope they add option to choose and compare ai video generators not only anonymously
That would be too expensive
hi
Hello, can someone help me? Is there a solution to this problem, or will it resolve itself?
Im having the same issue đ anyone?
You haven't done anything wrong and there's nothing you can do about it.
This is an issue between Arena and Google.
Okay, thanks for the explanation. So I'm going to wait.
I assume they'll have it resolved shortly if Google's at all worried about Arena rankings, cos this will definitely have lots of people voting against them
Okay, I understand, and I hope so too
Is it fixed now?
429 Too Many Requests: This is a standard HTTP status code. It means the server is refusing to process the request because too many have been sent in a short period of time.
no fix to this its on server side
just wait
Thank you
Is it true if Arena's daily or hourly budget for that specific endpoint is tapped out, it will throw that RESOURCE_EXHAUSTED message until the next reset cycle.?
Probably not ?
no way are they finally showing the actual error reason now
Gives the model away đ¤Ł
REAL
i hate it when claude opus 4.6 thinks for so long that is gives error. Wasted time...
fr
Is there a way to choose which two video models to test... Not random?
Hello
it wonât
it was meant to release today
but copyright
so itâs delayed
forever
Author's Note: The Seedance 2.0 API launch has been postponed due to copyright controversy. This article analyzes the reasons for the delay, provides the latest updates, and offers an API integration guide for alternatives like Seedance 1.5 Pro. Seedance 2.0 was originally set to officially open its API services on February 24, 2026. However, fo...
Bruh
Imagine the open dola, it would be a dream
Yeah Img and video is about to be tanked
They won the ai music scene
Chatgpt and google gonna make a move against seedance 2.0
Youâll be able to purchase your intellectual property to generate
Why?
we all know why.
China., ? Haha
The algorithm used by TikTok will be retrained on U.S. data, but ByteDance will maintain ownership of the original algorithm.
What a werid deal
Itâs like buying all the customers from McDonaldâs and owning the franchise, but not owning the actual recipe
arena is broken
where do i use seedance 2
in china
it was supposed to have its official release today
they delayed it probably
that sucks man
???
is all model broken
One message removed from a suspended account.
@rugged perch Note that Video Arena has been removed from the server. More information can be found in this #announcements
Dude
Error during image generation with google-genai for model endpoint gemini-3-pro-image-preview: Failed to fetch image: 429 Too Many Requests - [{ "error": { "code": 429, "message": "Resource exhausted. Please try again later. Please refer to https://cloud.google.com/vertex-ai/generative-ai/docs/error-code-429 for more details.", "status": "RESOURCE_EXHAUSTED" } } ]
Well the error showed the answer already
Well it's the first time people have the problem like that
You got the error from arena.ai?
Yep
Dang y'all overusing it
5 pictures per hour â ď¸
Bytedance why, WHY DELAY SEEDANCE 2.0
Is generous for a company that doesnât require you to pay*
this guy came back
đ¤ "Is generous for a company that doesnât require you to pay*"
If you want the models so much just PAY
Sigh
Anyways
u can only sigh
Deepseek is a nothing burger
then dont use it
Is that it.. is that all you had to say
I donât..
Iâm just talking about the
Runouts about
Deepseek
I think it was
then how are you saying its a nothing burger
4
Because itâs a nothing burger
Leaderboards
Stats
The point was that 5 pics per hour is not overusing
Evaluation
taunt needs some taunt to fix itself
Since when..??
2nd time echoing my name..
Whenever the url is shown on Arena, it appears white, and when I actually go to the website, I get this
your name is a simple word
like I guess bro
Because 5 is 5????? It's not one hundred.
he is talking like he pays for it
But your using for paid models
Just use them in the original site
Itâll let you use more than 5
Bro it seems like u are not that smart as u think
No yall are talking like yall pay for it.. getting mad about a rate limit
he is only a taunt
Is crazy
when did i get mad dumb burger
Why are u making things up
omd..
tell me
why is 5 rate limit a problem
just tell me
cause he is taunt
Where did I tell u that it's a problem?
One dude said that people overusing arena, I said that people only have 5 pictures per hour so it's pretty hard to overuse it and then u started to cry about things that u made up
itâs not a problem move on
you said it yourself
bruv u need to move on, if u really dont care then dont speak for it
Iâm just saying move on
why do you come agian and agian to get slammed
Iâm not continuing..
slammed and itâs just
Nothing..
yeah cause i did a taunt
or if you meant slamming as in âhe is tauntâ
Then mb
wait wait stop let me guess your next sentences
âhe is taunt ingâ
coming next on kivest ai
Top 5 things never happening
yeah cause u are banned lol
I can kinda just go to your website and get the api if needed
I donât remember the url
Tho
I can just ban your gmail