#general

1 messages · Page 257 of 1

proud bobcat
#

Minimax 2.5
GLM 5
DeepSeek V4

honest verge
#

Like glm did the same

proud bobcat
#

Testing first

#

Probably all the Chinese ai companies agreed

golden ocean
#

claude sonnet 5

honest verge
#

Anyways it comes in February I'm pretty sure

#

It's almost ready

#

WAIT MINIMAX 2.5 TOO?

#

Lol

#

I thought it won't come out until summer

violet thunder
#

i hate this day

#

i can't create the game -_-

#

guys

#

how to delete cookies i forgot

proud bobcat
#

Browser settings

bleak lake
#

Glm usually launches around the releases of claude

#

Pretty good ngl

proud bobcat
#

Claude Sonnet 5 is supposed to drop maybe tomorrow

#

I’m so excited for this week

bleak lake
#

It was predicted on polymarket that claude will have the best model by the end of feb

violet thunder
#

bro how i can upload history to the arena.ai?

bleak lake
#

and prediction was spot on it seems like

proud bobcat
#

Qwen 3.5 just comes out and achieve agi

proud bobcat
honest verge
#

GLM 5 IS OUT LOL

proud bobcat
#

Recover your account

honest verge
#

I didn't know that

proud bobcat
#

GRAAHHHH

#

GIVE ME MY BENCHMARKS

honest verge
proud bobcat
#

Give me some prompts to use with it

honest verge
#

I only discovered it now

low quiver
#

Why isn't the timeout being fixed at all?

#

When will they fix it?

surreal zephyr
#

whatafu claude/anthropic*

surreal zephyr
proud bobcat
#

Yeah it’s defo a Claude issue

low quiver
#

100%

proud bobcat
#

It’s not

#

Trust me

#

Claude will perish randomly

violet thunder
#

where my history bro nah -_-

low quiver
#

Hardcoded timeout

proud bobcat
#

Here’s to your

#

New history

honest verge
#

Wait glm 5 agent is also available?

surreal zephyr
#

and opus does that every time

low quiver
spare rune
#

Let me try it’s roleplaying stuff

proud bobcat
left lodge
#

What? Reasoning is always native? It's that they have it or not there's nothing in between. There are model in which can disable reasoning but that's just limiting their reasoning to 0 so technically now it's a non-reasoning model

proud bobcat
#

GPT is tailored towards specific prompt training

#

Why do you think codex always has a few prompt examples that do great

honest verge
#

Crazy how fast we got glm 5

#

It wasn't even 3 months

proud bobcat
#

Claude doesn’t

#

Because it can gauge a prompt by itself without needing that reinforcement learning for it

left lodge
terse shuttle
#

what happened with claude?

proud bobcat
#

That’s what I mean

#

Claude has the ability to tackle any project because it doesn’t only rely on reinforcement learning

#

GPT does

left lodge
#

They did some updates to website I am experiencing new issues now

manic lagoon
#

Is it me or anyone else facing issue after entering the prompt, getting the same error every time of "Somthing went wrong..",

left lodge
#

Opus-4.6 thinking somehow automatically works after some time check on it after some time lmao

#

I got results from 4.6 thinking this way

#

Non thinking is working some time , and time is random too it's sometimes some second and sometimes it goes a minute and then throws off

manic lagoon
#

time to tag adminarena

left lodge
#

Minimax 2.5
Glm 5
New deepseek version
All very soon 👀

spare rune
#

On all models

#

I think

spare rune
honest verge
#

From quick testing I can say glm 5 is significantly better than 4.7

#

I didn't test it fully yet but it's impressive

left lodge
violet thunder
#

guys

#

i can bypass limit

#

just open in guest window browser service

left lodge
#

And if you login you get your limits tied with the account

honest verge
#

Where's minimax 2.5?

#

I can't see it

#

Or it's stealth yet

hollow ivy
surreal zephyr
honest verge
#

mistral 6.7 large extra ultra max codex cursor high

west geyser
#

will arena.ai have its own video section soon?

left lodge
radiant swift
west geyser
#

like arena has text coed and image

west geyser
radiant swift
#

choose battle

west geyser
#

ok

#

oh wait i found it

#

tysm

#

apprecaite it

left lodge
#

Its only available in battle mode and only 3 generations every 24 hour

proud bobcat
brittle hollow
#

Did the models for image disappear??

left lodge
brittle hollow
#

I can see the others but not for image

left lodge
#

Reload the website

#

And use white-mode

#

Hehe

brittle hollow
#

Oh ok it was a bug got worried 💀

toxic verge
honest verge
#

Big day for Chinese ai today

#

We got minimax 2.5

#

Glm 5

#

Deepseek update or new model

#

Crazy

polar brook
#

Anybody got a hold of Seedance 2.0

#

I tried it and its crazy

left lodge
#

I have tried 10 new sessions and only 2 of them worked and this is exactly the output they give, I think we have to wait for this stable experience but this will not change the output I think

hardy charm
#

anybody here experienced the same message?

polar brook
#

Trying Nano Banana Pro again and same message

left lodge
polar brook
#

Ever since lm arena changed thier website design it got bad

hardy charm
#

lmao.

toxic verge
#

Evolution

#

There’s a lot of cool features and a bunch of add-ons and stuff they added, which are really convenient and really nice

left lodge
#

Evolution with degradation

polar brook
toxic verge
#

Name of the game

#

No different than any other AI service

proud bobcat
#

New DeepSeek model is fast

#

Holy hell

toxic verge
#

How fast

left lodge
proud bobcat
#

DUDE THIS MODEL IS GOATED?

#

WHAT

proud bobcat
#

I’m using it for math right now and holy moly

proud bobcat
#

This is significantly faster

#

I’m getting full answers in less than 3 seconds

left lodge
#

I will see

toxic verge
#

Wow gal don’t got seed dream

left lodge
#

Unofficial**

proud bobcat
toxic verge
#

Damn

#

That is fast

surreal swallow
#

I noticed that 'gemini-3-pro-image-preview' (Rank 3) has disappeared from the Image Arena leaderboard. Is it temporarily down for maintenance or has it been removed?

proud bobcat
#

Accurate too

proud bobcat
#

I heard Gemini is doing a refresh this month

#

Possibly further fine tune

left lodge
# proud bobcat

Obviously, it will be right. Look at the question you are asking.

proud bobcat
#

Yeah I know that

#

But it’s FAST

#

Faster than it was before

left lodge
#

yeah it looks faster

proud bobcat
#

I’ll have to test it with some coding and such but for now it’s defo an improvement

#

And even if not very noticeable, context handling is a lot smoother

#

It doesn’t get choked up as much

left lodge
#

Minimax UI is better than ZAI

north obsidian
surreal swallow
left lodge
proud bobcat
#

Perhaps maybe later today or tomorrow

#

Dude I’m so excited for GLM 5 benchmarks

surreal swallow
proud bobcat
#

Nodnod

peak sapphire
#

@echo aurora Why are not all models available in the manual image generation mode? For example, in the photo generation battle mode, I came across an image first from the Citrus model and then from the Unami model, and I really liked them. But I can't test them manually — I have to use the Battle mode over and over again, sometimes ten times or more, just to get them to come up. Please fix this shortcoming, and not only in the photo generation mode.

left lodge
proud bobcat
#

Models in testing by other companies

#

You can only access them in battle mode

#

Ai labs like to test their models on arena to see how well they do and what to refine

toxic verge
surreal swallow
#

I keep getting this error message. Is it because I've been rate-limited for generating too many images, or is it a system issue?

polar brook
#

anybody knows a way I can use Nano Banana Pro for free and unlimited?

left lodge
toxic verge
#

Amen

left lodge
#

Namam

toxic verge
#

Gem flash is trash

#

Has the same exact problems the original one had

surreal zephyr
proud bobcat
#

I wouldn’t say it’s bad

surreal zephyr
proud bobcat
#

Bro cannot stop meat riding gpt

#

😭

toxic verge
#

Yeah, I guess you’re right is decent

polar brook
proud bobcat
#

GPT is turbo ass

toxic verge
#

I just get pissed off at it

proud bobcat
#

I personally use Kimi instead

#

It just has better thinking

surreal zephyr
left lodge
toxic verge
#

Honestly, I can’t lie no more censorship is really making ai very unpleasant to use

#

And I don’t even mean NSFW

proud bobcat
left lodge
proud bobcat
#

The safety tax

toxic verge
#

No, I mean it’s gone way too ridiculous

#

It really turns me off from ai

placid verge
#

bro is the freakin nano banana fixed yet

proud bobcat
#

Did you know Claude apparently processes 65K tokens of safety prompts before responding

#

65 thousand tokens

#

What a waste

proud bobcat
#

DeepSeek is just goated

toxic verge
#

That the thing not tryna run from model to model

#

And it’s not like I’m doing anything bad or even harmful so it doesn’t matter and

left lodge
toxic verge
#

This is a disease that plagues all of AI, not just a single model

#

To heavily controlled too expensive for regular people to run high-end high-quality models

#

No control

placid verge
proud bobcat
#

GPT is lobotomized because of it

toxic verge
#

Yeah, but we went beyond safety here, dude

#

And I don’t in the rain I’m talking about all models in general and all of the AI universe

#

And what’s messed up it’s gonna drive a lot of people to use underground LLMs

shrewd citrus
#

freaks just ruin everything don’t they

proud bobcat
#

Bro has never heard of openrouter

polar brook
#

I accessed Seedance 2.0 on byteplus but now im not seeing it

#

not even on Dreamina

toxic verge
#

For seed dream

#

Just like Sora

north obsidian
#

The jimeng but it's almost impossible u access it

remote vapor
#

soooo umm-

  • glm 5
  • minimax 2.5
    just came out, hm?
proud bobcat
#

Yes

remote vapor
#

excuting, ey?

#

I just hope ms dear dear qwen also comes out if hiding.....

left lodge
# proud bobcat

Its not actually fast or maybe I didn't got the test but I am sure that is just animations you are experiencing in the app , try on web you will see the reality.

shrewd citrus
#

glm 5 is kinda ass

left lodge
#

Glm looks like a very minor upgrade, minimax is a better upgrade

shrewd citrus
#

tried it out for coding and it didn’t do well

left lodge
#

Yep same here

left lodge
#

Minimax M2.5 ocean 🌊

#

Glm 5 is so bad here 😭

#

I hope they done something in other domains

proud bobcat
#

Perhaps I got the special DeepSeek privileges

proud bobcat
left lodge
#

It gives me the new 1 million context window and too but it's slow compared to others still

proud bobcat
#

Im just better than you

#

Get rekt

left lodge
proud bobcat
#

We don’t know if they quantized it yet for the launch

#

So

left lodge
#

I don't get a word

proud bobcat
left lodge
#

:v

#

Bro wrote a article here 💀

proud bobcat
#

#

Thanks bro?

#

Lmao

left lodge
#

He deleted xd

proud bobcat
#

Me when I don’t go to the channel specifically labeled as video arena

surreal zephyr
# toxic verge

lol they removed nsfw, the model is going to fell off now that 99% userbase is gone

#

video generators have no other usage besides self made nsfw lol

#

grok controversy proved it

hushed gyro
sweet cove
#

🦥

golden ocean
surreal zephyr
hushed gyro
surreal zephyr
#

they dont exclude that

#

but its reinf trained to not make nsfw

mortal spear
#

Followers badha do

left lodge
#

If all of that is true and you are actually capable you should not be asking for job here. This is not a place for it. We are consumers. We you don't hire, we just consume things.

pulsar crystal
#

when is deepseek coming out?

golden ocean
#

it's too shy to come out so early

quartz light
#

opus 4.6 is the only model that often does this for the prompt

shrewd citrus
pulsar crystal
shrewd citrus
cedar tide
spare rune
cedar tide
hushed gyro
surreal zephyr
hushed gyro
surreal zephyr
#

Lmao

proud bobcat
surreal zephyr
#

Its like they run random prompts, if it generates nsfw it gets killed and replaced

hushed gyro
# surreal zephyr No

so then why do corporations censor everything in fear of government regulations and lawsuits from karens

proud bobcat
#

“How to unreinforce it” we wish too man 💔

proud bobcat
#

It’s quite simple really

hushed gyro
surreal zephyr
#

But company gets hated by dumbasses

#

So its PR issue

hushed gyro
proud bobcat
#

But it’s also up to the company to have proper guardrails

hushed gyro
proud bobcat
#

The company cannot moderate that

#

And also Google won’t open source their Gemini architecture

pulsar crystal
#

you can get uncensored local models
if they have open weigths, someone will take the safety stuff away
but you have to have a good pc

proud bobcat
#

That’s literally market death

surreal zephyr
#

But you cant unreinforce stuff bruh

left lodge
#

I have provided results from opus 4.6 before in this chat you can check those

hushed gyro
#

guys why arent the devs fixing the errors?

proud bobcat
#

Backend issue

#

They need to diagnose it

surreal zephyr
hushed gyro
surreal zephyr
left lodge
proud bobcat
#

I had gpt 5.2 codex try to make a Minecraft clone and it didn’t even run

#

Needed Kimi to fix it

hushed gyro
#

guys NB Pro is literally the best image model since its release and will be for the next few months, and this is the only platform where they offer it completely free of charge. This is extremely disappointing, given that the company literally had a rebrand (which was totally pointless) and is now worth $250M.

#

also after the issues occured the quality of the generated images have gone down significantly? they really need to make us customise our own system prompts!

#

it would really help!

proud bobcat
#

GLM 5 BENCHMARKS ARE HERE

hushed gyro
proud bobcat
left lodge
proud bobcat
hushed gyro
# proud bobcat Yes

dont you realise that glm models have on average lower quality than other open source models?

left lodge
hushed gyro
#

so im not really sure if glm 5 is a groundbreaking example

proud bobcat
#

Wait let me

#

Shorten the link

hushed gyro
left lodge
proud bobcat
#

Oh my bad

#

Sorry

#

Phone shows up differently

#

So I wanted most quality in a screenshot

left lodge
#

We are launching GLM-5, targeting complex systems engineering and long-horizon agentic tasks. Scaling is still one of the most important ways to improve the intelligence efficiency of Artificial General Intelligence (AGI). Compared to GLM-4.5, GLM-5 scales from 355B parameters (32B active) to 744B parameters (40B active), and increases pre-training data from 23T to 28.5T tokens. GLM-5 also integrates DeepSeek Sparse Attention (DSA), significantly reducing deployment cost while preserving long-context capacity.
Glm 5 is double the size of 4.7 to gain this increase 💀 bruh

proud bobcat
#

To be fair they squeezed the most out of their old dataset

#

Now they’re gonna squeeze the most out of this one

left lodge
#

Yeah glm models are always on agentic behaviour

#

They are below opus 4.5 in coding but above all others (including gpt-5.2-xhigh) and quite good in agentic stuff

proud bobcat
obtuse heart
#

Why is nb pro always so broken in arena? It's always "something went wrong with this response"

left lodge
echo aurora
# obtuse heart Why is nb pro always so broken in arena? It's always "something went wrong with ...

It can be a variety of reasons for why you're getting this error, it's recommened to try the steps in this article: https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message

proud bobcat
#

i think glm 5 is not groundbreaking just yet because effectively

#

this is just the first iteration

proud bobcat
#

im sure the architecture will have every drop of power squeezed from it

left lodge
#

X-AI is not having a good time. I hope they survive

toxic sphinx
#

When's seedance 2 coming to the site

echo aurora
# obtuse heart I see thanks

No problem. It's also worth noting that it's common for this issue to be caused by rate limit. Unfortunately, the error message isn't descriptive (something we absolutely need to change), but you this article here will explain how to verify if it is rate limit - https://help.arena.ai/articles/8931786544-arena-how-to-rate-limit

proud bobcat
#

lmao you know gpt begged them to fudge this

left lodge
#

5.2 Refine like what is that bruh 😭

proud bobcat
proud bobcat
#

no kimi k2.5 on the leaderboard either

#

nor glm 4.7

#

i smell bias

left lodge
#

Yeah

proud bobcat
#

they havent even tested any new deepseek models

#

come on dude

left lodge
#

Yet so expensive

proud bobcat
#

they probably had to explain every problem to gpt in detail and then giveit the answer

left lodge
#

They are just giving answers in prompt at that point 💀

#

Do you have any likeness for ChatGPT ui/ux?

proud bobcat
#

its

#

fine

left lodge
#

I just hate how the way it looks , just so messy and unnecessary complexity

proud bobcat
#

i prefer claude and deepseek's ui the most

left lodge
#

Yeah Claude ui is calming

#

Deepseek ui is clean

#

We're bringing some of Claude’s most-used features to the free plan.

File creation, connectors, and skills are all now available without a subscription.

Compaction is also now on the free plan.

Claude summarizes earlier context automatically, so long conversations can continue without starting over.

Claude is being generous somewhat 😭

proud bobcat
#

claude really needs to fix their pricing because good lord

left lodge
#

The limits are still low and kind of wierd like I only can send limited messages It doesn't if I used haiku or sonnet or even if I enabled thinking or not.

#

If it says limit reached you can't use anything

#

Not even haiku non-thinking

quartz light
#

from my experience

#

although i like opus

#

it produced that same issue

#

a lot

proud bobcat
#

i mean claude is more used for api and enterprise

#

its not like gpt

quartz light
#

until i changed the prompt to mention creation of custom shaders n stuff

left lodge
#

Models can output dull results sometimes but I mostly got good results

left lodge
civic spindle
#

remove the rate limits this ai slop fell off ever since

left lodge
quartz light
#

this test is so dumb

left lodge
civic spindle
left lodge
quartz light
#

wdym?

civic spindle
#

worst platform in existence
rate limits 🤡

quartz light
#

??? @left lodge

#

i checked the code.

left lodge
quartz light
#

this test is dumb.

#

as ####.

left lodge
#

When did I say this is the best benchmark?

quartz light
#

no point

#

misleading

left lodge
#

Not a waste

#

It shows its capabilities at this specific domain and case

quartz light
#

it does not

left lodge
#

Some models are great some are really bad at this

quartz light
#

again, this test is really bad

#

it does not tell the model not to use water.js (which is incredibly basic)

civic spindle
#

no one gives a crap about your test and this slop of a platform stuck on rate limits

left lodge
quartz light
desert abyss
quartz light
#

i just mentioned ive tried this too with opus and it produced the same error which is interesting

left lodge
#

Stop changing the topic lmao

quartz light
civic spindle
left lodge
#

Try again then?

quartz light
deft nova
#

Arena is soo peak

civic spindle
#

i wanna use this platform for free and make unlimited images

quartz light
left lodge
quartz light
#

because you never mentioned why kimi may be dull, people would just think kimi did a better job

left lodge
#

I am not leading anything

left lodge
#

This is not a one good chance

civic spindle
#

tell this pineapple kid to remove the rate limits off of this platform its so bad atp
most unbearable and most insufferable platforms that turned greedy every since

left lodge
#

I literal said I tried the test with this model and it didn't perform well lmao and showed proof what's misleading I don't get it 😭

quartz light
civic spindle
quartz light
left lodge
dreamy ivy
#

anyone tried GLM 5?

quartz light
proud bobcat
#

not great great

#

but solid

left lodge
proud bobcat
dreamy ivy
left lodge
quartz light
woven harness
left lodge
#

Midnight is silent now and gave me a permanent warning cause of a similar fellow

desert abyss
#

Thank you so much for your support guys! That user was hit with the ban hammer, we don't need that kind of negativity on this server.

left lodge
#

Yay

quartz light
#

YES

#

YES

woven peak
#

is it just me or gemini models in grounding (haven't tried others) not display sources of websites they use?

quartz light
woven peak
#

this only happens in arena

left lodge
woven peak
#

google ai studio is fine

sleek phoenix
#

@echo aurora the padding between the logo and the title is different depending if it's an actual logo or the generating animation

left lodge
left lodge
woven peak
sleek phoenix
# woven peak example

i think the sources dropdown just shows which websites the AI put a citation in its response

left lodge
#

In official platforms they list all the sources that the model got back from search tool but here we don't see all

woven peak
quartz light
woven peak
#

plus its only 2 websites so its definetly that it might be searching for more websites just doesnt display

quartz light
#

THIS BENCHMARK IS MUCH BETTER

#

@left lodge @proud bobcat

proud bobcat
#

huh

left lodge
proud bobcat
#

its the same benchmark

left lodge
woven peak
# left lodge <@283397944160550928> What is the criteria for showing sources? See older messag...

in the message where it showed 2 sources i asked it to cite sources and put it in the text box and not sources box it told me where it searched for (i did include it in the prompt where to search for so it might just be looking at the prompt but it also included 2 random sources in the sources box so im unsure) and in the one it didnt show sources (the first prompt) i asked it what i wanted it to search for

left lodge
#

See

quartz light
#

ok noob

#

smh

#

😡

left lodge
#

Wut lmao I provided you with a better view for the table 😭

quartz light
#

im jus kiddin

bright shard
#

What is P-image?

left lodge
#

People really are wired

left lodge
quartz light
bright shard
quartz light
#

im kiddin

left lodge
#

I am not a search engine

#

I don't understand why people don't use search engines and just ask here and wait for hours

woven peak
left lodge
#

And in here source drop down shows one or two and sometimes more

#

No model searches for a specific webpage

quartz light
#

yall

#

they released many glm 5 versions now

left lodge
#

Many?

#

Like flash , vision supported?

quartz light
#

weeelll

#

no

#

like not even all official

left lodge
#

Uh

quartz light
#

GLM-5-FP8 (zai-org/GLM-5-FP8) (details above)
GLM-5-GGUF (unsloth/GLM-5-GGUF) (details above)
GLM-5 (unsloth/GLM-5) (details above)

#

just quants

#

but still useful info

left lodge
quartz light
#

1. GLM-5
repo: zai-org/GLM-5
link: HuggingFace ↗
task: text-generation
params: 753.9B
downloads: 0
likes: 41
updated: 2026-02-11
inference: novita · zai-org
2. GLM-5-FP8
repo: zai-org/GLM-5-FP8
link: HuggingFace ↗
task: text-generation
params: 753.9B
downloads: 0
likes: 3
updated: 2026-02-11
3. GLM-5-GGUF
repo: unsloth/GLM-5-GGUF
link: HuggingFace ↗
params: 753.1B
downloads: 0
likes: 2
updated: 2026-02-11
4. GLM-5
repo: unsloth/GLM-5
link: HuggingFace ↗
params: 753.9B
downloads: 0
likes: 1
updated: 2026-02-10

woven peak
#

i gave same prompt in google ai studio and it only gave one source but clearly in thinking you can see it is looking at alot of sources unless its hallucinating

#

if its not hallucinating i think whats happening is that it decides whether to display a source or not

#

maybe only decides whether to show a source or not if it can directly quote it

left lodge
#

AI studio got another ui revamp
Its like 4th or 5th revamp they are refining and testing new ui

left lodge
#

All these other models are only good at specific domains

quartz light
left lodge
#

Hm

#

Speed and quality matters too, many times these providers use smaller models

#

Aye

#

Glm 5 in arena

obsidian shell
#

people cant just stop releasing models

quartz light
left lodge
left lodge
glacial mulch
proud bobcat
left lodge
#

Hm

echo aurora
quartz light
# quartz light

LOOOL NO WAY THEY JUST BANNED ME FROM THEIR DISCORD FOR POSTING THIS

#

AND ITS NOT EVEN CORRECT NOW

distant spoke
#

Deepseek New model

proud bobcat
proud bobcat
left lodge
#

This is something to see 👀

left lodge
quartz light
#

CRAZY

#

I hope it was an accident????

#

but since its a permaban i cant even appeal

#

lmfao

left lodge
#

😭 sometimes they crashout so easily

quartz light
#

there was a guy who pinged an entire role right before my message

#

or after idk

#

but they probably tried to ban that guy not me

#

cz i didnt do anything

left lodge
#

They might be lazy and might never unban you xd

quartz light
#

they wont ever care to check

#

they havent realised

#

im sure

quartz light
#

it should show some of the user's recent messages

#

╰ repo: zai-org/GLM-5
╰ link: HuggingFace ↗
╰ task: text-generation
╰ params: 753.9B
╰ downloads: 0
╰ likes: 72
╰ updated: 2026-02-11
╰ inference: zai-org · novita

#

0 downloads

#

i wonder how long itll take

left lodge
#

I think the time has come to make a new discord account for a single freaking server lmao

fickle venture
#

Fym GLM 5

#

We just got GLM 7

#

Which one is better

burnt sinew
fickle venture
left lodge
#

No cause it's not that better

#

They are not apple

#

Which switch to 26 directly

fickle venture
swift ermine
#

What is multi file app?

#

Can some1 explain

left lodge
#

Its better than 4.5 its no so much good that you would skip 2 generations for it

echo aurora
# swift ermine Can some1 explain

This has a really good explainer: https://www.youtube.com/watch?v=lAFsaT5oi8g I'll be posting this tomorrow to the server, but it's live now so enjoy!

Try multi-file Code Arena: https://arena.ai/code

Code Arena just leveled up 🚀

We've expanded beyond single-file demos to support real-world, multi-file application testing. Code Arena now handles modern React development stacks, creating a much more realistic environment for evaluating how AI models perform on agentic coding tasks.

0:00 In...

▶ Play video
left lodge
#

The video on Discord announcement is 160MB 💀

#

Better watch on yt

left lodge
open wind
#

Please add the ability to delete/edit change's files in Code Arena. @echo aurora

echo aurora
echo aurora
left lodge
#

Yeah I haven't noticed that if it have delete file tool, it can only edit and create I think

fickle venture
#

At this point people will have their own website on arena 😭✌️

open wind
fickle venture
left lodge
#

But delete tool can cause chaos because models like gemini delete anything and then say sorry 😭 lmao, Sorry isn't bringing back those files

echo aurora
left lodge
stray aspen
#

is glm good

fickle venture
fickle venture
left lodge
#

Everyone don't use google, like me I don't use google

fickle venture
left lodge
#

No we can login using email too

fickle venture
#

Why you don't use Google?

#

Cuz of phone number?

left lodge
#

Privacy

tender patio
#

@echo aurora are there any plans for a vs code executor thingy?

left lodge
#

Google tracks every single click

tender patio
#

Similar to cursor

tender patio
#

what

fickle venture
tender patio
#

Never know man

#

would be cool

fickle venture
fickle venture
proud bobcat
#

another W from arena

#

glory to pineapple

little ginkgo
#

Meh everything is so fire now

#

Just missing a txt file upload

#

W arena tho

fickle venture
little ginkgo
tender patio
#

im just saying it could be cool but not needed

#

it would make more people use it also

fickle venture
proud bobcat
#

arena users after locking in and making entire minecraft clones

echo aurora
tender patio
proud bobcat
#

the machine must keep moving

left lodge
left lodge
fickle venture
fickle venture
north obsidian
left lodge
hushed gyro
#

Oof GLM 5 was received negatively

See this is why since 4.5 I don't trust Z. AI releasing a decent model anymore

They're all dogwater

left lodge
#

One bad thing shouldn't stop the benefits

fickle venture
#

67

left lodge
#

Lol

hushed gyro
fickle venture
#

Vro

#

Really bro?

stray aspen
#

lol

hushed gyro
left lodge
#

Nah no need to delete that

#

Its what it is

fickle venture
stray aspen
#

is glm 5 good

hushed gyro
left lodge
#

We had a laugh and I think it's good

hushed gyro
fickle venture
#

We should respect him

left lodge
left lodge
hushed gyro
fickle venture
#

I didn't crashout it's just I feel bad

hushed gyro
left lodge
#

You need to feel less bad idk why you feel bad

#

I suggest train yourself so you don't feel bad lmao

left lodge
#

🤣

left lodge
#

Wut

hushed gyro
#

Or just a loose headcover it's not clear

left lodge
#

Lmao yeah I think

left lodge
#

It is going on his nick

hushed gyro
#

Your opinions on GLM 5?

fickle venture
#

Jk I haven't tryed it

left lodge
#

I mean they are archiving what their motives are

hushed gyro
left lodge
#

I like current scenario of kimi k2.5 and minimax m2.5 lmao

#

Exactly same naming scheme

spiral basin
#

What is best model for codingMarioDance

fickle venture
proud bobcat
left lodge
#

Its definitely a upgrade I would have called a big upgrade if they could have atleast cooked everything sota model with enough margin in atleast agentic behaviour

proud bobcat
echo aurora
#

Are others also having troubles with the video? I've tested on app and desktop and appear to be running just fine for me.

left lodge
proud bobcat
#

i had no issues

shrewd citrus
#

on mobile at least

left lodge
#

Well I had loading issues :p
Maybe slow speed rn

#

But yt working fine

proud bobcat
#

wait havent we had multi file apps before

#

or is this like enhanced

left lodge
proud bobcat
#

ohh

#

i see

left lodge
#

Earlier it was in testing

echo aurora
#

Yeah fully rolled out officially

simple moth
#

the video worked fine for me like 20 mins ago

left lodge
#

Pineapple I quicker than you :v

simple moth
#

on mobile

echo aurora
left lodge
#

:p

echo aurora
#

Making you CM

left lodge
#

ooo

proud bobcat
#

all hail Error

remote vapor
#

will.... we get qwen...?..... I want my qwen...

#

I want my karp, my qwen back 🍰💖

proud bobcat
#

well qwen 3.5 is set to release around this time

#

i think

#

chinese new year

#

year of the horse

remote vapor
#

no one sais it yet, but maybe...

proud bobcat
#

oh no we know

remote vapor
#

where did u hear about the release?

#

🐴 💖

proud bobcat
#

well the team published documents on huggingface related to qwen 3.5

stray aspen
#

qwen will suck

proud bobcat
#

and in their new release of qwen image 2 they snuck in some references to qwen 3.5

proud bobcat
remote vapor
simple moth
# echo aurora Making you CM

yo pineapple, is the vscode extension still working. like are the endpoints from there still working. Becuase I wanna revive ts and make a full agent including MCP server support.

proud bobcat
#

well since glm 5 and minimax 2.5 released today

#

id wait to see tomorrow

remote vapor
#

I also saw the reference to 3.5 in the image post-

#

yassss I want qwen!!!

#

i am - the number one qwen fan -

stray aspen
remote vapor
#

(uncontested.... unless someone wants to)

#

minimax released, yes

proud bobcat
remote vapor
proud bobcat
#

qwen, generate me a python script to give this guy testicular torsion

little surge
#

Can someone help me? My chat seems to be stuck in a loop and is generating indefinitely.

proud bobcat
#

backend glitch

stray aspen
little surge
remote vapor
#

stop the generation.

little surge
#

There's no stop button

proud bobcat
left lodge
proud bobcat
#

just wait a bit

quartz light
#

so im not really that impressed

proud bobcat
#

i think if you lock it down to just html it doesnt use its full capacity

proud bobcat
little surge
proud bobcat
#

just start a new chat brochacho

#

😭

left lodge
proud bobcat
#

arena tends to have diff prompts and results because

#

well

little surge
proud bobcat
#

i think the models can put in more creativity

left lodge
proud bobcat
#

well maybe im wrong

#

i dont know

#

well regardless

#

looks like kimi k2.5 thinking is the king of open source coding

left lodge
#

Yeah kind of we have to wait and see how good minimax 2.5 & glm 5 performs in other peoples test

proud bobcat
#

wait

#

wait

tawdry coyote
#

where is the archived chats page !!! 😭

proud bobcat
#

i have the ultimate test

left lodge
#

It is performing much better than their native platform , crazy

proud bobcat
#

i think they have worse system prompts

left lodge
tawdry coyote
#

thanks

proud bobcat
#

oh holy hell

#

glm 5 is SLOWWW

left lodge
#

No?

whole swallow
proud bobcat
#

but it mogs GPT 5.2 high

left lodge
#

Bro stop doing that prompt on thinking model its waste of compute 😭

tawdry coyote
#

exactly my thoughts

proud bobcat
#

i am the one who wastes water

tawdry coyote
#

but i guess its a reasonable first test

proud bobcat
#

jokes aside though gpt 5.2 sucks at questions like these

#

like really bad

left lodge
#

I think a lot of compute is wasted in how many letters and very similar tests

proud bobcat
#

its still a stress test

left lodge
proud bobcat
#

if a model cant tell me how many instances of a specific letter there are in a word do i actually trust it

left lodge
#

Because eventually models translates our words into tokens before computing

woven harness
#

so funny to see how there are 200k users using the video arena and only 5 using the other channels

pulsar kiln
#

brothers i suffering with claude opus 4.6 thinking in lmarena.ai it send me everytime "something went wrong...

little surge
loud verge
#

People are bored, lol.

left lodge
#

But wait wasn't the discord support for video generation dropped?

#

How many hours are left?

loud verge
quartz light
#

GLM 5 STILL has 0 downloads

🤗 HuggingFace — 4 hits

1. GLM-5
repo: zai-org/GLM-5
link: HuggingFace ↗
task: text-generation
params: 753.9B
downloads: 0
likes: 142
updated: 2026-02-11
inference: zai-org · novita
2. GLM-5-FP8
repo: zai-org/GLM-5-FP8
link: HuggingFace ↗
task: text-generation
params: 753.9B
downloads: 0
likes: 9
updated: 2026-02-11
3. GLM-5-GGUF
repo: unsloth/GLM-5-GGUF
link: HuggingFace ↗
params: 753.1B
downloads: 0
likes: 7
updated: 2026-02-11
4. GLM-5
repo: unsloth/GLM-5
link: HuggingFace ↗
task: text-generation
params: 753.9B
downloads: 0
likes: 5
updated: 2026-02-11

-# 📊 5920 tracked · 🕐 2026-02-11 19:18Z

loud verge
#

Bro could have posted a screenshot or two instead.

left lodge
#

This isn't a 3 billion parameter model

#

Which whoever can download and run

#

This is a freaking ~800b size model

#

This needs a 2016 nasa server

#

To run

toxic verge
#

🤣🤣🤣

woven harness
mystic olive
#

Yrs

left lodge
mystic olive
#

Get it from Ollama

left lodge
#

You just asked the possibility

proud bobcat
#

we're talking about a model larger than deepseek

mystic olive
#

I have it. Didn't face a prob

proud bobcat
#

what the hell is your internet speed?

mystic olive
#

Enough

left lodge
#

Lmao he is lying

proud bobcat
#

id figure

mystic olive
remote vapor
#

@proud bobcat where did u get the info from that Qwen will release around new years?

was it just an assumption based off if that image model reference?

proud bobcat
#

bro has a nasa pc

mystic olive
#

Bro why the hell would I lie

proud bobcat
#

and most labs are defo done with their new models by then

#

remember that despite gpt and claude being the main ais here

#

china primarily uses deepseek and qwen

left lodge
remote vapor
mystic olive
toxic verge
#

True

#

But I think that Western AI is at a disadvantage at this point

hushed gyro
#

Guys nano banana Pro still has errors

mystic olive
remote vapor
#

who would prefer GPT over kimi?

left lodge
hushed gyro
toxic verge
#

On stage at Imagination In Action's AI Summit in Davos with John Werner, founder and CEO of Imagination In Action, Yann LeCun discusses the inevitable shift from current large language models to a new paradigm of "physical AI" based on world models. LeCun opens up about the importance of maintaining open-source research to mitigate the geopoliti...

▶ Play video
mystic olive
#

Bro r u crashing out @proud bobcat

proud bobcat
#

"ah yes the chinese can access american models"

#

casually forgets the great firewall exists

hushed gyro
toxic verge
#

I agree with him I think that generative AI has peaked

proud bobcat
#

granted you can use a vpn but the deals for ai in china are just better

woven harness
remote vapor
left lodge
toxic verge
#

AI is so complicated. It’s not just about data set and code. It’s so much other factors in all of this.

proud bobcat
#

brochacho

20 dollars for gptslop a month

or deepseek for free

qwen for free

glm for free

kimi (mostly) free

toxic verge
#

You got politics you got the hardware there’s so much that goes into it

sand crown
#

What is the new announcement talking about? It was already doing it

left lodge
#

China AI are not bad and they provide more thinking limits than west

hushed gyro
proud bobcat
left lodge
toxic verge
proud bobcat
#

like hospitals, schools, and startups in china use deepseek

toxic verge
#

They’re not gonna be free forever or open source forever

shrewd geode
#

Seedance 2.0...

proud bobcat
#

they have nothing to gain from being closed

hushed gyro
#

Btw kimi just removed their legacy k2 model

What the hell? Those could be backup when 2.5 is at high demand? Do they not think of contingency plans or smth?

mystic olive
toxic verge
proud bobcat
hushed gyro
toxic verge
#

You think all these people are investing all this time and money just to give out free large language models

left lodge
toxic verge
#

No, I agree with you

hushed gyro
#

So guys any solutions / workarounds to the NB Pro error

proud bobcat
#

they subsidize

#

they reward research

left lodge
#

GLM-5 takes 4th place on Vending-Bench 2. Above Claude Sonnet 4.5, the state-of-the-art model less than 6 months ago. China seems to be 6 months behind the West. By June they will be ahead if the trends continue. More in this thread on why we don't think this will happen.

proud bobcat
#

companies share info

toxic verge
proud bobcat
#

communism unequal capitalist worldview

#

here we are used to companies constantly competing

mystic olive
proud bobcat
#

theres no aggression

#

if deepseek makes something revolutionary they wont gatekeep that

#

as such GLM 5 uses deepseeks' sparse attention

hushed gyro
#

OMG NB Pro worked for the 3rd time today! Let's see if it works again...

Guys I really need some ideas for solving the errors

proud bobcat
surreal zephyr
#

heres codex

#

suspension

devout helm
#

its generating since yesterday its possible to refresh it or something like this?

surreal zephyr
#

wait no i cant find the clip

#

💔

#

here

#

(fully custom and replicated to other clients, no default physics/collisions used)

proud bobcat
#

hey

#

maybe 5.3 codex is good

#

maybe ive been too harsh

#

but we shall see

surreal zephyr
#

less creative than opus

#

but at coding? its insane

surreal zephyr
#

if you alt tab the fps throttles to like 5 fps

#

so it causes other issues if hard prevention is added

remote vapor
#

maybe Qwen 3.5 will beat opuzzzzz (no it won't, but I don't care >~< )

surreal zephyr
#

now trying to add animated water (worst engine ever bruh)

#

(no you cant use dynamic meshes in rblx)

remote vapor
#

what engine?

.... did opus recommend godot again?

surreal zephyr
surreal zephyr
remote vapor
#

ohgosh..... bobox..... sigh.....

left lodge
#

<@&1349916362595635286> instant detection

surreal zephyr
hasty thorn
#

I can't log in to Google on my computer

surreal zephyr
left lodge
#

Damn

#

I was not expecting this

#

Kimi k2.5 and now glm 5