#general
1 messages · Page 268 of 1
yup
Last progress we made was in 2025
Early 2026 is pure nothing
Wth is this bro doing 💀
3 flash is like 800b 20b parameter model let it be!
Brotato chips
Most accurate one
3.1 pro is literally 3.0 pro pre nerf
damn
I'm trying to stop him but he still does this
the point of arena is to compare models
a lot of people are misusing the site
this is their current experimental solution to tackle that problem
I'm begging you to stop doing this task
Bro dude Gemini 3 flash
I cancelled it 6 times already
gemini welding a cube, 2026, colorized
create the minecraft for ai?
why?
i think google released gemini 2.5 pro rebranded as 3.1 pro lol
Yeah, it’s kinda buggy right now
I HATE THISS 🤬
If I wanted two models, I'd be in battle/side-by-side mode, damn it
same
its same model everytime
its just some system prompt that gets increased everytime
OpenAI is close to finalizing the first phase of a new funding round that is likely to bring in more than $100 billion, according to people familiar with the matter, a record-breaking financing deal that would give the startup additional capital to build out its artificial intelligence tools. Bloomberg's Annabelle Droulers reports.
More...
I feel like I'm getting constantly bombarded with captchas lately. Anyone else?
LOL happened to me also, i just gave up
"Please give me more money, I promise I'll make better AI models with strong ethical boundaries"
true
openai is gonna ghost out
I would
true and the captchas are wild too
The ai community will gobble up anything
Probably the easiest group of people to profit off of
oh you too right?
yeah
is pineapple is here?
yeah, honestly I think we need stronger whistleblower protections
the last OpenAI whistleblower didn't end up well
battle mode in direct with context getting given to the direct model is so bad
I feel like I'm getting constantly bombarded with captchas lately. Anyone else?
@echo aurora Opus 4.6 thinking is broken pls help
its been broken for so long
but why?
We’ll get really nice AI for like a month and then they will nerf
i feel lmarena giving errors purposely
yeah maybe or the model its just broken ?
i didnt had any error using claude somewhere else
I give Claude like 2 more weeks
nope, it's just LMArena's website that's not handling the model correctly
And that’s gonna fall off
dangg
yeahh ngl i wish Opus was free on web
https://discord.com/channels/1340554757349179412/1451574502369656842
well im not only having a that problem
they should just remove context of battle given to the direct chat ai model then im fine with battle coming in direct chat
@echo aurora
theyre busy nerfing it rn
And again stuck to the 3.0 lvl
last time it worked-
We can't make any progress
what website is this
aistudio
does anyone actually know why 3.1 was outputting the way it was?
is it like arena where all the stuff is free
I don't really feel they would take it down the next day and then nerf it, usually that is gradual and weeks ahead
I'm nerfing it so hard
it must be a backend issue
what
do u know what epoch is?
no
@tough stratus @opaque patio It seems that you want to create video or images with the Bot. Please note that the Video Arena has been removed from the server. More information can be found in this announcement. Do not spam text channels with prompts as it won't work. Thank you.
is it code arena limit?
i have a big chat and big project on it 🙁
need to continue this
its over
Yeah the best suggestion you can do to save projects is copy one by one into a new chat
Hi
real
oh
Same here
hello
Yes there is a limit on the code arena once you hit that it will say limit was reached please wait -amount of time
I’ve found Kimi K2.5 to be the most human and realistic
GLM 5 very good at prose in general
DeepSeek 3.2 only a bit behind the two but strong and very cheap.
i have free glm-5 api
Absolutely fire
I don’t mind paying for it honestly
For it’s insane performance it’s incredibly cheap
how
ai studio doesn't work for me :(
lies
mhm
even tho im using teir 1 paid api key
want proof?
i can give kimi 2.5 free to cool projects
gemini 2.5 flash xhigh
mistral large
100%?
the dumber ones are more creative
if i could upload js , css, .txt file it would be wonderful
mhm
I just
Gemini is the best
I just answered that
gemini 2.5?
alr bet dawg.
It’s free on lmarena?
OH
yeah gemini is free
I thought
only in lmarena
Ohhh I thought you meant api
Still I think Kimi is very strong
I don’t like Gemini’s prose
is this usable somewhere rn for free?
Just the most open ended model so people like it
is there a way to use nano banana pro for free or do i have to switch accounts
imo use gemini 2.5 for writing the story plot, then gem 3.1 for fixing stupidities
i have api for it too
its in limited access
gemini 2.5 flash?
or pro
@surreal zephyr
fr
or normal, whichever u prefer
basic gemini 3.0 pro would do fine too i guess
flash faster but even dumber
@ocean ferry does not approve of u asking ampro for ai guidance
if dumber = more creative, then clawd ai is the most creative one out of all /j
so you saying pro is better
you need to find balance
I'm just hoping actually wishing they will actually listen about battle in direct because almost every other debate about any of the other updates can debate on some level but this has real no benefits at least in the way they are implementing it 😭
between creativity, stupidity, prompt adherence, ect
the big issue for me is that it often corrupts the actual project
try, if you dont like try 3.0 pro. ect
fine damn good choice ( i meant that in the good way since honestly that's my default)
ok thanks bros
https://gemini.google.com/share/84327f5018a7 @golden ocean whatchu think?
ip grab
It pops up like every two prompt for me and I don't think I need to tell you all how disruptive that is and sometimes / a lot of times it's every other prompt make sure it's a whole hell of trying to do chat AI stuff
cube welder
I can't turn water into wine but I can turn you into mine
Kind of the same for me the more deception of the memory and context disrupting the whole thing ( i don't do projects but I know what you mean0 to the point that I will trust often and not copy and paste the same prompt after building so it won't give me the normal answer even though will give me a disruptive answer a lot of times I notice it will centralize/simplify the questionWhen I do it that way which just does not give me a accurate answer to be honest kind of hard to even correct that since every individual prompt risk just being disruptive again if it's a battle mode
wtf
are they welding @ocean ferry
they be welding a box
you can put him there
☠️
Yeah wow Gemini 3.1 pro is awful
It’s so slow
It took 10 minutes to decipher a basic cone surface area problem
it took 25 seconds to reply to "yo"
And then
It lost connection
Literally real
They did not cook with this
Nah even 3.0 pro never choked like this
At launch too
gemini 3.1 and lmarena battle in direct mode what a awful combo
3.1 was nerfed PRE launch
but its still really good
bigger nerfs incoming
Basic cone surface area problem
3.1 lost connection
ah
WHY ISN"T AI STUDIO WORKING
Hello
When is @echo aurora Going to add genie made by google
"would you like gemini 1.0 flash or gpt 1.0 corrupt your opus 4.6 project?"
lmarena is so doomed
Same, i hate battle mode.
its just over at this point, they giving battle mode at every 3rd prompt
and the models that they give are slow af
its funny when arena just pushes an unfamiliar model in front of you during battle mode
vidu, rcps-fast (or rpcs?), wan 2.6 pretending to deliver veo quality etc. 😅
its not even funny at this point
its so doomed at this point
I hope they get it sorted out
Yeah that is a issue that needs to be fixed @echo aurora
like i really dont want "seed-1.8" to deliver opus 4.6 quality
for the video generation part I think they just wanna prevent overloading the popular models
Starving for votes so badly your stomach’s growling? Guess getting them in BATTLE MODE wasn’t enough to fill you up, so now you have to come looking for more in direct.
some people will now come and defend lmarena
They won't
Damnn
Hello
Is it possible to export all chats?
true
aren't these supposed to branch off, not affecting initial dialogue?
Not able to made it
Yeah it shouldn't be triggering this often
they dont :)
the real issue is they break the original convo
Are you able to repro this every 3 messages?
the moment it appears, the orignal model cannot continue
Yess
U guys should fix it breaking the conversation
Don't send battle context to the ai
What do you mean by this?
It breaks the quality
has to restart from 0
Can I have the Eval ID of the chat session this is happening with?
exacly
Eval ID is the random set of numbers/letters in the URL
@echo aurora sry for ping, do you know anything about this? is this invisible limit or model problem?
I'm in mobile now, I'll try to give
Even though direct mode is selected, battle mode gets randomly activated within a conversation if ”Max” is selected. Not so good.
It could be, try the steps here to verify - https://help.arena.ai/articles/8931786544-arena-how-to-rate-limit?lang=en
any news about deepseek v4
oh, so i can continue work after a while?
Battle models shouldn't appear in direct chat; this is wrong and in bad taste towards users. I've reported this before; if someone wants two models, they should use the Battle and Side-by-Side models.
If it's rate limit, yes
and how long is it?
Exactly. But it seeems to only appear if ”Max” mode is selected. Or have you encountered it even if you’ve chosen a specific model?
all
Yes, it appears with other models
They want the investors happy
This is lame. This is a new function that they’ve implemented. Hasn’t happened before for me at least. I hope they revert.
Claude 3.7 Sonnet supposed to be dead but it isnt
It's been a while, they removed it, but apparently they added it back
if all the models are free on arena, how are they making money?
Bruv they don't make money
so how is this still up
@echo aurora so i'm checked dev tools network and it's not a rate limit
cuz no 429 errors
claude sponsors
it says on their article
You'll then want to try these steps - https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
also from what I get all dialogues become public in a matter of months
No we have a product, can learn more here: https://arena.ai/blog/ai-evaluations/ cc @quaint trail @river whale
@echo aurora
It's over till battle mode is fixed or removed
Pineapple want some chicken?
it still drops down when finished
so how much money does arena use for the API keys for these models
Dog
@echo aurora Just me or is the gemini 3.1 pro model down on arena?
Issue im having with: Codearena, Model: Gemini 3.1 pro
yeah it throws the cube away then starts again yuh
that was the prompt
Is gemini nerfed
i dont think so
Sonnet smokes 3.1 pro
antigravity is really bad ngl
Absolute cinema
I can’t find any
the testing is based on pre release
😂😂😂
nope
Bottom one missing tho
It’s amazing how clueless the models are on safety
Hhahah
Got 2 weeks before they nerf
Shii is fire tho
they unironically fell off then became top 5
1
no top 3 actually
is gemini 3.1 pro erroring for anyone else on codearena
by monday nerf
হাত
No, they know we’re onto them so they’re gonna extend it for a month
https://019c7799-1908-7f24-b3e9-f0fb5dd8f92c.arena.site/
https://019c775e-143e-7242-92b4-0d936da78f8c.arena.site/ and those two are wow
They wanna make sure they get as much of those 200 monthly subscriptions as they can
When i zoom out it disappears bruh
Before they start nerfing
BUT FIRE THOO
ehm defo not they just being generous
low renderdistance i guess
its usually day one nerf
Damnn
the infinite one is trippy
its a bot
@whole swallow see those
in my experience sonnet still fails at oneshotting code, thought it would be lower
If you want to try out the next version of Grok Search, use the search tab, and wait to find "arastradero"
It's 100% the new Grok-Search it has the exact same ultra specific problems with API that I usually get with Grok
for me sonnet is best one shotter lol
It's 100% Grok-4-2-search
GO MY MICHAEL SPHERES
I had opus oneshot way more reliably for sure
funny considering sonnet scores best at hallucination bench out of all
NO
Btw I don't get why admins made it "private" since Grok 4.2 is on public beta for two days, makes no sense to hide the models' name for this one honestly and it's so easily recognisable
maybe there is less free tokens for direct chats, than for claude?
It's already in the arena, and already has a ranking, in fact it was available there as an anonymous model even before the release date.
its long already added
don't worry it's fine!
my bad
Pretty good, but underwhelming compared to the bomb that we always get from Gemini. It being behind Opus & Sonnet is quite disappointing.
OH I HAVE TO USE IT RIGHT NOW
"behind opus" xD
Well it's one of the best models out there, there must be some specific usage where 3.1 is the best model on earth, but overall it seems like Sonnet gets the gold
⌇⍜⍀⍀⊬ ⟟ ⎅⍜⋏'⏁ ⎍⋏⎅⟒⍀⌇⏁⏃⋏⎅
whaT?
sorry to ask that ARE YOU F'N A ALIEN?
discord is reporting to ice cannot disclose sorry
💔
-# [Reply redacted for privacy reasons]
--. --- --- --. .-.. . / -... . - - . .-. / ..-. .. -..- / ...-- .-.-.- .---- / -... .-. ..- ....
-# [Reply hidden for Discord Gold non-subscribers. Click here to see less]
Try glm 5
Fails miserably
Wow, in my tests and in my use glm 5 is always better than gemini 3, I'll test this 3.1 like is very good then
This one with the VR would go crazy
Yea
woe
https://docs.google.com/document/d/1r-QPxJZ5pAAiY4sVahT2AVgm5Y4Tofyh8CLow8eAHXI/edit?usp=drivesdk
All 4 most impressive ones
Is it a common bug that the model freezes while generating a response? For me, it probably happens every few messages.
How do you do this
It only gives me crap
Good workflow ;)
captchas are getting harder and harder those days
Any idea on the model that gets you that error?
It's a known bug where models can be stuck in an infinite generation state, but it shouldn't be happening every few messages. Can you try the steps here - https://help.arena.ai/articles/8691588590-troubleshooting-infinite-generation
Also worth trying signing out and signing back in when this happens. I haven't personally see this work, but have heard from a few users this helps, so is worth a try.
Captchas are getting harder OR humans are getting DUMBER??
Captchas are more difficult
Hey pineapple is gemini 3.1 pro codearena broken?
Use capsolver
It doesn't look like it's broken. You'll want to try the steps in this article - https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
ill check out the error code actually
I'll forward to #1447983134426660894
"error": "Bad next turn request in Battles in Direct mode. DIRECT was requested, but expected a BATTLE turn | sessionId: 019c7aee-6558-7127-a3af-82575eae7e2b | userMessageId: 019c7c16-cf19-7a5c-87bc-1fda4f09f853"
...SON OF A
hey pineapple i got a question
will you guys include gpt 5.3 anytime soon?
Maybe the error got fixed? It's going through more generations now and not erroring.
theres still no API
OpenAI is stalling
ok im sorry i didnt know
also i just realised
if gpt 5.2 high is top 5, imagine where gpt 5.3 high is gonna be placed at
@echo aurora Odd issue, It either didnt record or i dont know. Randomly errored and network posted nothing
prob top 2-3
Want to note #ask-here
Imagine it’s literally worse again
5.1 high beats 5.2 high
Are you able to copy/paste the prompt in #1447983134426660894 and explain further?
is that true
@toxic verge going to ask we use #1447983134426660894
Ok
I checked network, and theres nothing for Gemini 3.1 pro on codearena, Ratelimits should post to network right?
hello
Lmao yeah
According to the leaderboard at least
They FUMBLED with 5.2 high
guys can the devs fix the timing out and erroring out 😭
Hello, I’m back. I tested Gemini 3.1 pro and I just gotta say I barely notice a difference.
Between 3.1 and 3.0?
If you're running into these problems I'd recommend checking out this message - #1417174113092374689 message
@echo aurora I think u already typed that in anocemets
I used to use 3.0, 3.1 update is barely noticeable
Typed what?
That Video Arena is Going to be gone
thanks, what i mean in particular is that in some longer requests it says "something went wrong", and i'm not sure if this is my fault, but most of the time, opus 4.6 thinking says it edited the files when it didn't and just "deployed the project" without changing anything internally
This is different. Prior announcements were that the Video Arena in Discord bot was going away. But the channels are currently still accessible in the archived section ( #video-arena-1 #video-arena-2 #video-arena-3). Today's announcement is explaining that these channels won't be accessible on Monday.
I'd say either the chat requests you've made are thinking/generation for more than 10 minutes, in which case this will error out. Or if it's not, then the same steps in the linked message apply.
TLDR -> try the troubleshooting steps, check if it's rate limit, worse case report to the team the best information so we can take a look.
But the steps in the linked message will go into way more detail.
thanks a lot
clayton: hello my name is clayton and im made of clay
Clayton is an absolute rockstar
Truly a legend at Arena
I'm really trying to get away from answering questions in #general
you guys should make a clay statue of clayton
It’s a good idea since I guess you see a lot of people still getting confused on why they can’t generate a video even though they see the channel
btw what server do you guys use for the emojis?
Yeah that's the intent
Hoping this leads to less confusion
there should maybe be a bot too
like I’ve seen that on other servers, when a bot detects a keyword or phrase
the bot would automatically respond with the answer
We have one in place, but sometimes it doesn't always catch everything
I haven’t seen it in action yet 😅
Well because you don't see it when it's working
You only don't see it when it doesn't work
ohh right it’s one of those “you can only see this message” type thing
@echo aurora #ask-here
Going to ask to not do this
I'll get to questions/pings when I can
what to not do?
shall I not ping you?
If you've pinged me in other channel with a question, please don't go to another channel and ping me there pointing to your other message
Oh okay


DO 4 Grok Agents outperform (GROK 4.20 BETA) a single Gemini 3.1 PRO?
I test my causal reasoning suite (nonlinear logic, scientific reasoning) on 4 agents on GROK (on their platform) against a single LLL on arena.ai.
Can a LLM (Gemini 3.1 Pro) on arena.ai win against 4 GROK agents in their natural habitat?
Google published the new AI model Ge...
grok 4.2??
sonnet 4.6?
i just went into arenas coding
every model
is not
what they say it is
claude opus 4.6 is sonnet 4.1
🙏
Why are so many people saying Gemini 3.1 sucks? For me it has worked wonders
cause it gives you outdated info
saying its 2024
like holy
real gemini vs fake gemini
good one @echo aurora
Nah its not 2024
you managed to fool everyone with your fake latest models
btw kling 2.6 is 2.5
🤡
Models offered via API vs what's offered on the model providers site could have different endpoints
Right. The models on our site are provided via an API
ok so this ver of gemini doesnt have search mode on
so it basically uses its cut off date or whatever the fuh it was called and uses what it remembers from that time period
every model is more stupid then the real site
then the real Gemini would be cut off
Ok, the Gemini in Arena isn’t search grounded, so it doesn’t the current year unless you go to the grounding mode, and select Gemini, so it’s not fake, it’s just not grounded
ok yeah basically what this guy said
but why does the 4.6 models not beat 4.5?
again
if it doesnt have search it just looks thro info that happened in january 2025 and tells you that
if youre talking about text arena ehh idk
SCORING OVER SORA PRO?
But I don’t know why it says 2024 when the knowledge cut off is January 2025, but I am assuming it’s because it’s right when 2025 started, so Gemini still thinks it’s 2024
honestly i dont even know
i dont use gemini anymore
Knowledge cutoff is quite a simple topic
telling you it is not gemini 3.1
also im pretty sure some models like anthropic models say that they cant search
brb let me fact check
All good, no worries
yeah
wtf is ppl
Sora did good on this one ngl
bytedance is cooking
Yea with Seedance 2.0 its kinda good
wtf is bytedance also
The creators of Seedance 2.0.
no
why
Note that Video Arena has been removed from the server. More information can be found in this announcement.
ok
Guys, does anyone know any sites with Open Claw AI Agents without restrictions?
Hola
I have a subjective feeling that Sonnet is stronger in claude.ai itself. Is it really subjective, and if no, is it same for Opus?
Why does this keep happening, there is no way to do anything about it, retrying does nothing. All I can do is leave it and move on with life
Have you tried starting a new chat? This is often an indicator of chat being too long (i.e. too costly on tokens)
You could try and ask Gemini to recap for a new chat for example
I'd recommend the steps in this article: https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
@echo aurora how's lmarena work-life now compared to before the release of opus 4?5
I'm doing a direct chat but keep getting a thing to choose response A or B like every three responses. Is there a way to disable that? It's really irritating. If I wanted that I'd use battle mode
@echo aurora will this benefit the lmarena team?
Introducing Claude Code Security, now in limited research preview.
It scans codebases for vulnerabilities and suggests targeted software patches for human review, allowing teams to find and fix issues that traditional tools often miss.
Learn more: https://t.co/n4SZ9EIklG
Claude Opus 4.6 or Gemini 3.1 pro?
10
18
1
Claude Opus 4.6
Is gemini 3.1 good in coding compared to opus 4.6
This is cool
Ofcourse not
Its worth an ask
Makes sense
guys i think gemini 3.1 pro is cooking. basically i sent in a prompt. it froze. and i forgot about it for a couple hours so yea
Pineapple is shipping announcements like theres no tomorrow haha
Gonbe 😡
what did i do to bro 😭
Not using premium ai
Sorry what do you mean by this?
for some time my website is not working, every time i give it a prompt the command break, is it happening to me alone?? 👀
Allowing teams to fix issues that traditional tools often miss
💀
I'm not sure tbh 
It's best for everyone if I stay away from the codebase lol
bruh why is it sometimes making me choose between two models in direct chat bruh
it better be correct cause it went against the other grok search and they had two entirely different answers
We are currently experimenting with the occasional Battles in Direct - https://help.arena.ai/articles/8949646387-lmarena-experiments-battles-in-direct
We are hearing a lot of feedback from the community about this and are exploring changes.
Guys, does anyone know any sites like Open Claw AI Agents without restrictions?
I'm already tired of looking
@echo aurora Bro when are we getting the Change Video Models mode, like fr
hey why when I try to attach a .pdf file (any size) into Claude Opus 4 - 6 It gives me this error message?
I usef battle mode wrote a prompt then got the message the prompt violates the policies, open a new one wrote the exact same prompt then it worked... wtf
Do we have a new best AI model, or do we have the downfall of benchmarks in general, as a way of capturing machine intelligence? Full breakdown of Gemini 3.1 Pro, guest-starring the new Sonnet 4.6, plus analysis from 7 papers/posts that will give you much needed context. Oh, and a new record on Simple Bench!
See I’ve been saying it
relatable 
I almost never read those in servers, unless there's some specific update I'm looking for
or like read them at least if I'm active in one place (server) for prolonged periods of time. Lately I'm somewhat less active here 👀
Have yall seen what pika released?
glory to anthropic
I don't think this is going to be related to the upload as a different error would appear, but would recommend these steps: https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
Yeah sorry about that, bit heavy on the announcements today
There shouldn't be anymore today though
(I think, don't hold me to that)
Why 10 minute limit exists on ai generating response just asking
You never know with these AI labs these days
It's a tech limitation
Oh the stories I could tell
Oh alright
bro battles in direct chat need to removed
Just give them your feedback man
We need independent benchmarks
Just give them your feedback be generous
The newest model get the most scrutiny
sigma from ohio
And post it a month later after the results to see if they Nerf it or not
I can feel the ram prices skyrocketing
We want critical feedback, it's important for us to hear these things
I can probably tell in 3 years 1 gb of ram will cost 5k$ atp
Go to Gemini Reddit. lol
What kind of feedback is actually the most valuable for the team?
Honest feedback
Well maybe not
I don't know but ram got really expensive
Ai made everything expensive but slop
Well obviously but feedback to how models are presented, to data presentations?
Sometimes the best type of truth isn’t something that could be articulated unto data point
Ai was made to make life easier. But no we got brainrot
Yeah, well it’s here now so we gotta deal with it
@echo aurora I resign from being a beta tester. This sucks
Stanford AI professor Bejul Somaia delivers a highly technical and historic keynote at the India AI Summit, calling for a unified science of intelligence spanning brains and machines. He highlights data efficiency, energy-efficient AI, quantum neuromorphic computing, and brain-AI integration, warning that we still barely understand how modern AI...
Look at this 😂😂😂
Everyone got it bro
Can I get small context? I don't exactly understand English well
You know the prom here is?
People comparing the human brain thinking to machine thinking
Oh alright
It’s like trying to think like a dog
A completely different way of looking at the world
Okay got it thanks
Not so much on things like model performance or things specific to the models, as the voting should reflect that. But overall feedback about our platform and things we can change we want to hear about.
It also helps when it's explained why. Saying "I like X" is helpful, but saying "I like X because Y" is more helpful.
I wonder why are some models are limited to battle mode only
What about more obvious things like slow AI responses? Thats one thing I've noticed every time I've used Arena, I know processing power is limited but waiting 13 minutes for a small response seems unreasonable right?
@echo aurora thanks for the previous help, but I just had to wait a bit.
Pineapple is typing something big I assume
Or tv
It's still good for us to hear this. A lof of the time it's things were already aware of, or things that are out of our hands to fix. But it's still helpful for us to hear as it makes it clear we're hearing this feedback. The last thing I want is for people to not bring up feedback/issues/etc with the assumption it's something already on our radar. Even though a lot of the time it already is, for the cases it's not that'd be a shame not to hear kind of thing.
Could you please tell me why 3.1 was removed from LMArena? Will it be brought back? Does it make sense to use it in Studio
It's still here
It's just at the bottom of the dropdown list
It's just at the bottom for some reason
loll thank you
🤣🤣
um..
Apparently its results are that terrible? If it’s dropped down, those who tested it – what are the pros and cons? Is 3.1 cool?
It's still a preview
Of course it's still in testing
Which I'm not sure why that's the case
will flag
Battle in Direct is so bad since it makes your chat sloppy since it uses the context of the battle in direct for future answers, and ruins the chat
I think most people who say its bad cant prompt well, it has amazing outputs on specific prompts.
Just start a new one
Does anyone else have this problem where suddenly after lots of code is generated (in 3 - 6 responses), the AI just gives up and a error comes up?
That’s gonna get annoying after a while
yes
Fair point. Especially going to the search to deleted archived chats
Like cmon I want to delete my chats immediately
I'm just asking if it's typical (that's why I asked if somebody else got this problem), not that I want to solve this issue 🤍
This what I mean
The reality on the ground is so much more different than the benchmarks
Man this server got 200k+ people yet chat seems so empty sometimes
It’s like two different worlds
Sometimes I wonder if people are looking at the same thing
Oh i just thought it would be smart to report the issue
I swear sometimes servers with 5k people seem more active
I really think prompting is a huge part of Gemini 3.1 Pro, whilest it has issues at heart (see Vending Bench 2 for example). Most issues dont appear if you prompt specifics. I've been testing this model quite thoroughly and have only has slight hickups whilest it managed to output some quite impressive outputs (Opus 4.6 or even better results)
It’s because the communities are heavily moderated
Hey @echo aurora , when are the contest results?
And people are so technical people are just keeping to themselves
Not banned but politically incorrect
Or whatever you wanna call it
People don’t take their view seriously
Ugh sometimes this site gets stuck in loops of Security verification for no reason
I wonder why do people hate on ai sometimes
And people act like they’re smarter and so it makes people less hesitant to wanna share their ideas cause they don’t wanna feel stupid
There’s a lot of reasons to hate on it lol
I don’t wanna sound like a broken record, but it’s a dirty industry, bro
And doesn’t do itself any favors
In the public perception domain
I guess you got a point
People use it how it was designed it’s working as intended
Yeah brainrot still seems not to go dry
Do you know I feel like if you really truly believe in passionate about something
You gotta be as equal as scrutinizing
Otherwise, you’re gonna get blindsided
And you gotta be able to see if it stands up to the scrutiny
And if it doesn’t then it’s bs
If this thing asks if I'm a robot one more time I'm gonna lose my mind
I'm gonna ask you myself
See this is what I mean
Are you a robot
😂😂😂
Beep boop
Welcome to the future
Sounds human enough
Where the machines scrutinize your humanity
I find that very disrespectful
A human having to verify themselves to a machine
Kind of makes me prejudice to be honest lol
We are so cooked
It’s gonna come to the public restrooms if you forget to flush, you’ll get a ticket in the mail
I haven't been able to use my chat for ten minutes cause it keeps non stop asking for verification
We don’t have to be, but yeah, we will be. I mean there’s just no way around it.
Yeah, that’s the only thing that really kind of bothers me. It is kind of super invasive.
Really high-tech and invasive
And imagine it gonna criticize us for how we poop bruh
it keeps generating image, how do i cancel? plug the wifi off?
it keeps generating image, how do i cancel? plug the wifi off?
Join us for the Royal Society Michael Faraday Prize Lecture delivered by 2025 winner Professor Michael John Wooldridge.Contemporary AI systems like ChatGPT a...
Just delete the chat ATP
is there any chance we're getting 4o back
hell nah
Ugh I tried a new chat and still getting the security loop. I'm getting mad
Login In and out
Take a chill pill I guess
Russians got gigachat. Definitely not gigachad
Kimi 2.5. Getting used to it, just not the same as 4o.
Opinion? Can't come up with proper names Social media wise unlike 4o. Seems generic
kimi is donation 💩
@echo aurora
Thank You
Thank You
Thank You
Is Anthropic from claude or something?

keeps popping out in my old 4o chats
All these scams
lol
AHHH
@echo aurora
FINAL BOSS
THANK YOU DUDE
Damn
I'm not moving from this channel for the next like hour
Scam attacks are getting too insane
💀
I'm not sure how they've been getting around the filter lately too
The problems that come with being a 200k+ community
The problem is I think a lot of these accounts have been in the server for awhile
Then, before each message you send, take an IQ test
But what if I self-ban 😭
Yeah that's the filter working!
The pineapple has a defensive position at the front
We need to embed an AI agent into the chat that will ban scammers
We have this in place, it doesn't always catch everything tho
I don't see any levels here. I dunno if you mean something else though
Gemini 3.1 pro is nerfed in app already
Like I meant lvl 5 to send an image
What the helly is happending
Ohh sorry, was thinking something else
Ai studio still has a superior version
It's okay
Mean people being mean
Thats the same XD
¯_(ツ)_/¯
@echo aurora why some chats it keeps saying "generating image"? i cleared cache, deleted cookies, nothing
Delete the chat from archives I guess
So there's a server called dr stone which is the community of the fans of the anime dr stone and before joining the server they have a question of who's the main character of Dr stone ( this is an important question ) and bots incorrectly answer and incorrect answers give u muted role
I'll have to check this out, do you have a server link by chance?
Have they mention anything yet about the i\direct chat
I do think this would help, but also a problem with these scams is they're essentially stealing other user's Discord account, a lot of these potential scam/bots could are already in the server sort of thing.
Sounds like it's still generating, no?
i have archived that chat, so i guess only delete remains? or can it be fixed by waiting?
@echo aurora dmed
Could have hit this bug - https://help.arena.ai/articles/8691588590-troubleshooting-infinite-generation
grok 4.2 is gone
@echo aurora is it possible to somehow remove this requirement for new users, I don't know how to get rid of it, specially since clicking on it brings me to the channel where we can no longer generate a video lol
Code arena is basically broken. I tried with multiple models such as opus 4.6 and Gemini 3.1 pro and always got this error: something went wrong.
ong ever since they added battles in direct everything’s working so much worse getting so much more errors frequently with no fix forcing me to make new chats, like pretty much everyone hates battles in direct chat but they won’t do anything lol
gemini 3 pro error 😤
Note that Video Arena has been removed from the server. More information can be found in this announcement. Video Arena is still available on the site -> https://arena.ai/video
You'll want to try these steps: https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message
Not what I was asking 🙁 On Discord, for new members, there is still a "Task" they have to do (As shown in the screenshots). If I don't do that task, that black box stays there permanently until it is done.
It is something that needs to be changed in the Server Settings specifically.
It is specifically under the "Getting Started" tasks...
How big is the context-length of Gemini 3.1 pro preview in arena?
Does the arena have a limit, or does it just use the model's limit?
Is Gemini 3.1 the best google-model for immersive and realistic stories?
Have they now beaten Opus-4.6 in that area?
OH
Understood. Thank you.
No worries 😄
I LOVE CLAUDE OPUS 4.6 THINKINMG
@echo aurora my arena champion role please
same!!!
I LOVE it when it says “Something went wrong with this response, please try again.”
guys how to create video before they have
cwaude still thinking
Ai having a mental breakdown on trying to find a easier solution but ended up in a loop
Because it got moved
To the website
Welcome back! Sorry again about that 😭
I think I'm missing something here.. how do I get to generation of videos ??
why can't I see the channels?
Note that Video Arena has been removed from the server. More information can be found in this announcement. cc @raven quartz
where i can find
😲
how to create video?
@hollow ibex Note that Video Arena has been removed from the server. More information can be found in this announcement
ok thx
Where can I get my questions answered about this?
......... This is how many chats i have to go I got go through in a day in the span of maybe a few hours Max around 3. Keep in mind that all of these are individual chats🥲
Let’s see the results
fk him
WTF
YOU'RE CRAZY
Are you able to follow the steps outlined in this article from the If the problem continues section and let me know if/when a jamdev is submitted? Max shouldn't be resulting in that error. We'd love to see more information about this error.
It's because of the error that has been happening with that has only increased in frequency with the whole battle in direct chat stuff stuff
The Battles in Direct experiment shouldn't be causing an increase in errors. I assume it's the Something went wrong you've been seeing? Is this with a specific model?
Battles did create a new error tho
When you prompt your chosen model after the battle thing sometimes it doesn't work
So i have to retry until it eventually fails
(i send a message and it appears as I'm not sending anything i noticed that when i refresh the page)
☠️ did no one see the picture I posted like a day again that said something about increase amount of glitches like this ( i know the picture i show is not in battle mode but it's basically what happens and depending on what happens sometimes I can't even do a retry like what the guy above me just said0
And i keep sending something and refreshing the page until it says it failed to generate
Than i just do retry button and it continues
Could it be a screen recording with a link to the drive?
Yes, but if you could be sure to include the Dev Tools > Network Tab that'd be very helpful. The jamdev includes this which is why it's preferred.
Exactly how could I do that? (I don't know)
Wanted to cross post this message as there has been conversations happening in a few different places and I'd like for this message to travel wide -> For the Battles in Direct experiment we're in the process of rolling this experiment back. We plan to develop this experiment into a better state before releasing again (as an experiment), but the current version is being rolled back.
What browser are you using?
Chrome
- click the three dots (top right corner), select
More Tools > Developer Tools - at the top you'll see a
Network tab, open that - run a prompt in Arena where Max errors out
- you'll see a file that has the
Eval ID(Eval ID is the random set of numbers/letters in the URL) - open that and you'll see a Status Code &
Responsewindow.
Those two areas are really helpful to us to understand what is going wrong.
Reminder to only share this recording in the form in the this article, please do not share it here in this server.
Lastly, want to mention this process is not ideal and not okay. We recognize this is asking way too much for us to get better information on this error. We have plans to built out a system that can handle this much easier for the user.
M2.5 >>
I'm in hate so big, the glm 4.7 flash opus 4.5 thiking must be so smart but my laptop is a potato
Hello
I know how it feels
Here is a ryzen 7 7735hs and 24gb option, I don't know if this would run the glm 4.7 flash well
Check ram usage
I currently have only 16gb and i3 1215u, it works 100% for my use but it doesn't run glm 4.7 flash nor screwed
Maybe I should buy a notebook with 7735hs and rtx 3050, I don't know
16 GB of ram is not enough
At least 64 GB min to run "smart" AIs
Furthermore, if they are thinking models, ram usage is higher
Someone know why like 30% of my ram is allocated to my gpu ?
how can i disable or reduce this
you may be able to use a q4 quant with that as long as an insane amount of vram isn't allocated to your igpu
How y'all doing
16gb of ram though, don't try unless you want less than 1 token per second with a good quant or usable speeds but a terrible model
I already think the qwen 4b 2507 has some intelligence, the glm 4.7 flash no thinking iq4nl would be enough for me
Hi... how can i make videos?
Disabling isn't possible. Reducing it might be inadvisable especially if you'd be restricting it to prohibitively small sizes. Check your firmware setup pre-boot executable environment. Either the setup itself or another pre-boot ex env tool might hold some answers.
Would note if you're looking for the Video Arena bot it's been removed from the server.
The arrival of Chinese artificial intelligence startup DeepSeek comes at a time when US AI companies like OpenAI, Google and Anthropic have yet to see significant returns for their costly efforts to develop newer models.
Two years ago, OpenAI’s ChatGPT became the tech industry’s biggest product in years. Now, leading developers like OpenAI,...
Told you guy
yoo
we gonna have to wait more 6 months for it
Wheres geminine
ILYa was right
?
Data-hungry A.I. developers, which have already sucked up mass amounts of online information from the internet, are starting to hit roadblocks from website owners. Between 2023 and 2024, 5 percent of all data and 25 percent of data from the highest quality sources were restricted across major A.I. datasets, according to a study from the Data Provenance Initiative.
Less gains but more expensive
Look at the games from ChatGPT 2 to 3 all the way to four
Wheres geminine tho
This is a classic diminishing return To get a 10% gain in intelligence, you might need 1000% more data, but that data no longer exists on the "one internet" we have. At least the little hanging fruit.
Isn't this graph showing the opposite of that? That much of the largest task time success gains on METR are in the most recent period?
If Googleis spending $185 billion this year (which they just confirmed in their February earnings call), that graph is basically the receipt. It proves that the "Data Wall" isn't stopping progress it’s just making it incredibly expensive to "buy" your way through.


