#general | Arena | Page 250

hollow imp Feb 5, 2026, 11:28 PM

#

Because it's exceeding the thinking limit set by arena

echo aurora Feb 5, 2026, 11:34 PM

#

We're collecting these possible false positives in this thread: #1447983134426660894 could you also share the prompt used?

verbal nimbus Feb 5, 2026, 11:34 PM

#

👀 I wonder how it does

uneven peak Feb 5, 2026, 11:35 PM

#

I hope we get Opus 4.6 thinking 32k 🥀😔

honest verge Feb 5, 2026, 11:37 PM

#

Is it just me or opus 4.6 thinking makes less detailed results than opus 4.6 no thinking?

uneven peak Feb 5, 2026, 11:38 PM

#

honest verge Is it just me or opus 4.6 thinking makes less detailed results than opus 4.6 no ...

Nah 4.6 thinking make more detailed results

gilded shell Feb 5, 2026, 11:39 PM

#

HAY

uneven peak Feb 5, 2026, 11:39 PM

#

Ka

icy yew Feb 5, 2026, 11:41 PM

#

honest verge Is it just me or opus 4.6 thinking makes less detailed results than opus 4.6 no ...

Ehh idk

#

Can you show like examples

#

.

crystal rapids Feb 5, 2026, 11:44 PM

#

hollow imp Because it's exceeding the thinking limit set by arena

Why have a model up if it always exceeds the limit, also on the 2nd attempt it crashed after it's thinking so rather a token limit perhaps. I would be guessing just instability, everyone trying to use it all at the same time, ect.

summer hound Feb 6, 2026, 12:04 AM

#

Yup that limit is screwing almost every prompt I try. I guess this model thinks so much more

keen beacon Feb 6, 2026, 12:06 AM

#

I can’t even accept the terms 🙁 I think it was because the captcha was bugged 🙁

patent oracle Feb 6, 2026, 12:31 AM

#

Q: Has anyone ever managed to convert a painting into a photoreal image using Nana Banana Pro? I can't seem to get to work tried all kind of prompts
Thank you!

summer hound Feb 6, 2026, 12:34 AM

#

yea dude this has to get fixed. unusable

loud verge Feb 6, 2026, 12:38 AM

#

Guys

#

Does grok 4 search not work anymore?

toxic verge Feb 6, 2026, 12:57 AM

#

echo aurora We're collecting these possible false positives in this thread: <#14479831344266...

This is an image edit no prompt

echo aurora Feb 6, 2026, 1:01 AM

#

summer hound yea dude this has to get fixed. unusable

We are looking into this.

thick pawn Feb 6, 2026, 1:01 AM

#

summer hound yea dude this has to get fixed. unusable

It's been a problem for ages and it still hasn't been fixed

echo aurora Feb 6, 2026, 1:03 AM

#

keen beacon I can’t even accept the terms 🙁 I think it was because the captcha was bugged �...

Can you try a different browser? With all of the issues you've seen today something is for sure off.

echo aurora Feb 6, 2026, 1:04 AM

#

thick pawn It's been a problem for ages and it still hasn't been fixed

It is worth noting this something went wrong error message is the generic error message that'll trigger for many different reasons (including rate limit).

thick pawn Feb 6, 2026, 1:09 AM

#

echo aurora It is worth noting this `something went wrong` error message is the generic erro...

I get it a lot, sometimes I get that message, but I can just refresh the page and the image is actually generating perfectly fine. Other times I get the message for seemingly no reason multiple times in a row, and I have to wait a few minutes before it works again, and that's not due to me hitting my rate limit

burnt sinew Feb 6, 2026, 1:22 AM

#

anyone here know how to get free ide usage for claude opus 4.6?

toxic verge Feb 6, 2026, 1:26 AM

#

echo aurora It is worth noting this `something went wrong` error message is the generic erro...

Idk how you guys will be able to fix captcha it does it’s job good if you loosen it it will be easier to bot. It’s a double edge sword

frosty shuttle Feb 6, 2026, 1:29 AM

#

summer hound yea dude this has to get fixed. unusable

Same thing here. The model thinks for a long time and is interrupted by the response limit when it's time to give the answer. I think only removing the response limit will make it work correctly.

toxic verge Feb 6, 2026, 1:33 AM

#

patent oracle Q: Has anyone ever managed to convert a painting into a photoreal image using Na...

#

stone cape Feb 6, 2026, 1:39 AM

#

burnt sinew anyone here know how to get free ide usage for claude opus 4.6?

kiro ide you can sub for $0 for first month i think if lucky

toxic verge Feb 6, 2026, 1:43 AM

#

I don’t really follow social media. I don’t have a Twitter or anything like that and I kind of stopped using Reddit but you know how I knew that anthropic was about to release a new model?

stone cape Feb 6, 2026, 1:44 AM

#

This seems VERY interesting https://discord.com/channels/1340554757349179412/1469144854264021174

toxic verge Feb 6, 2026, 1:44 AM

#

https://youtu.be/tjW_gms7CME?si=9z9_yVZfgLZTV39j

YouTube

NBC News

Anthropic CEO speaks about 'powerful' AI risks and regulation

Dario Amodei, the CEO of the AI company Anthropic, joined "Top Story" to discuss his new essay "The Adolescence of Technology: Confronting and Overcoming the Risks of Powerful A.I." In the essay, Amodei warns of the risks that come with artificial intelligence and also spoke with NBC News' Tom Llamas about AI regulation and control.

For more co...

▶ Play video

#

Cause he always goes on the news before they release a new model and always starts preaching

primal orbit Feb 6, 2026, 1:47 AM

#

opus 4.6 easily gonna be top 1 model on lmarena by a large margin

toxic verge Feb 6, 2026, 1:48 AM

#

Ofc it’s new

#

His tripping if he think it’s gunna replace software 🤣

#

It’s like saying ai video model will replace directors

north obsidian Feb 6, 2026, 1:51 AM

#

primal orbit opus 4.6 easily gonna be top 1 model on lmarena by a large margin

In code, but in text? I don't think so

shrewd citrus Feb 6, 2026, 1:54 AM

#

toxic verge His tripping if he think it’s gunna replace software 🤣

nah ai will definitely replace software engineers tho

#

like out of all the things any ai learnt

#

what was the thing it mastered first

#

it’s coding

#

apparently coding is the easiest thing for an ai to do

#

im gonna say that by the end of this year ai wld be better than 60-70% of engineers currently working

delicate fable Feb 6, 2026, 2:06 AM

#

shrewd citrus im gonna say that by the end of this year ai wld be better than 60-70% of engine...

It already is I feel like with opus 4.6

toxic verge Feb 6, 2026, 2:30 AM

#

Ya’all crazy

cloud zinc Feb 6, 2026, 2:44 AM

#

proud bobcat Feb 6, 2026, 2:48 AM

#

My favorite moment is when OpenAI released GPT 5.3 codex but didn’t set up any api for it

#

This model was so rushed

#

Really makes me sad cause I thought they’d take their time and not just rush out a release

frosty lava Feb 6, 2026, 2:59 AM

#

is the gpt 5.3 codex is available on lmarena ?

#

very interested to do a comparison between this one and the opus 4.6 with same prompt

proud bobcat Feb 6, 2026, 2:59 AM

#

frosty lava is the gpt 5.3 codex is available on lmarena ?

Not yet since OpenAI hasn’t released it on their api

frosty lava Feb 6, 2026, 3:00 AM

#

oh alright

#

thank you

nimble snow Feb 6, 2026, 3:01 AM

#

😢

#

i cant keep doin dis every prompt

cloud zinc Feb 6, 2026, 3:01 AM

#

you are robot

frosty lava Feb 6, 2026, 3:01 AM

#

if you have vpn it might be cause of it

nimble snow Feb 6, 2026, 3:02 AM

#

frosty lava if you have vpn it might be cause of it

i dont

#

im js cooked

frosty lava Feb 6, 2026, 3:02 AM

#

then idk

#

the opus 4.6 non thinking was already better than the 4.5 thinking in my test so i wonder what the thinking version can do

nimble snow Feb 6, 2026, 3:02 AM

#

frosty lava then idk

what u play on rovblox mud

frosty lava Feb 6, 2026, 3:03 AM

#

nimble snow what u play on rovblox mud

Nothing cause i can't find any good games for now

#

i already played every good existing one and im tired

toxic verge Feb 6, 2026, 3:05 AM

#

proud bobcat This model was so rushed

Why
. 5 series all cursed

nimble snow Feb 6, 2026, 3:05 AM

#

frosty lava i already played every good existing one and im tired

u play demon hunter?

toxic verge Feb 6, 2026, 3:06 AM

#

nimble snow 😢

Maybe be your browser. Try login in ans out your account

maiden fulcrum Feb 6, 2026, 3:12 AM

#

What is the rate limit for battle mode in the arena and when does it reset?

frosty lava Feb 6, 2026, 3:16 AM

#

nimble snow u play demon hunter?

no sorry

broken elm Feb 6, 2026, 3:44 AM

#

is lm down ?

past trail Feb 6, 2026, 3:52 AM

#

Is there any ai or website that can build full stack apps for free ...without any paywall

sturdy mica Feb 6, 2026, 3:57 AM

#

nimble snow what u play on rovblox mud

i literally only play armored patrol on robiox

robust sonnet Feb 6, 2026, 4:16 AM

#

Yo guys

#

What is the rate limit on opus 4.6 in code mode?

maiden fulcrum Feb 6, 2026, 4:32 AM

#

These reCAPTCHAS are so annoying, It doesn't go away at all

#

I selected all fire hydrants, yet it is telling me to try again

toxic verge Feb 6, 2026, 4:35 AM

#

Login in log out clear history and try different browsers

frosty lava Feb 6, 2026, 4:38 AM

#

i think gpt 5.3 is a big improvement actually and is better than opus 4.6

#

can't wait to see even stronger model but the improvement is already cool

stoic solar Feb 6, 2026, 4:44 AM

#

Heyy! Anyone have opus 4.5 free trick? I really need it like we can share files too

frosty lava Feb 6, 2026, 4:45 AM

#

opus 4.6 seems to do exactly what your asking but its almost every time less aesthetically pleasing so idk

stoic solar Feb 6, 2026, 4:49 AM

#

Where can i access opus 4.6?

frosty lava Feb 6, 2026, 4:49 AM

#

stoic solar Where can i access opus 4.6?

both claude website and lmarena

stoic solar Feb 6, 2026, 4:50 AM

#

Okay, thanks buddy. Does it have file upload and vision?

frosty lava Feb 6, 2026, 4:50 AM

#

Sorry i don't know actually

stoic solar Feb 6, 2026, 4:50 AM

#

Oki

#

I checked it, it doesn't have 😞

frosty lava Feb 6, 2026, 4:52 AM

#

stoic solar I checked it, it doesn't have 😞

its here, try on lmarena

#

oh my bad

#

i just understand what you said

stoic solar Feb 6, 2026, 4:52 AM

#

No I mean the upload and vision capabilities

frosty lava Feb 6, 2026, 4:52 AM

#

yeah sorry

stoic solar Feb 6, 2026, 4:52 AM

#

Yeah not an issue

#

Do you guys use antigravity?

frosty lava Feb 6, 2026, 4:53 AM

#

Yes but im mad at it cause even with the option "always proceed" it will ask me to confirm every prompt, ive seen other people with the same issue and can't find a fix

stoic solar Feb 6, 2026, 4:53 AM

#

Yes and the rate limits sucks

frosty lava Feb 6, 2026, 4:53 AM

#

yh

stoic solar Feb 6, 2026, 4:54 AM

#

Any other good ide?

#

Other than vs code with Co pilot

#

Cursor

#

?

frosty lava Feb 6, 2026, 4:54 AM

#

I use cursor so i don't know if there's a better one

stoic solar Feb 6, 2026, 4:54 AM

#

Depends on model

thick pawn Feb 6, 2026, 5:30 AM

#

Videos are currently having an infinite generation issue it seems

maiden fulcrum Feb 6, 2026, 5:31 AM

#

toxic verge Login in log out clear history and try different browsers

It didn't work

frosty lava Feb 6, 2026, 5:41 AM

#

lol i tried opus 4.6 its been thinking sooooooo much ive never saw that before

#

brainstorming on my prompt

thorn lantern Feb 6, 2026, 5:42 AM

#

thoughts on opus 4.6 vs. opus 4.5 for software engineering tasks for actual software engineers? In terms of real world usage? I noticed opus 4.5 edged out 4.6 in 2-3 metrics on the official release document, but opus 4.6 did better on most.

But in terms of real world use, what are people's thoughts comparing the two thus far? I've read that opus 4.6 can be "too agentic", but not sure if that's a universal opinion or just a one-off

frosty lava Feb 6, 2026, 5:43 AM

#

i don't know why but its been 6 minute thinking non stop on my single prompt

#

no lag its just really thinking

#

too much

thorn lantern Feb 6, 2026, 5:43 AM

#

I read that's expected, but I bet that's eating a ton of tokens..

#

and for simpler stuff, that's probably not needed

#

For example for devs who like to take control of the process and implement step-by-step with less "do it all for me"

frosty lava Feb 6, 2026, 5:44 AM

#

if you want i can send you the whole thinking for a single prompt (its not done yet)

#

its impressive

#

how much it actually think

thorn lantern Feb 6, 2026, 5:44 AM

#

That would be valuable to see, if you don't mind. you can dm me

frosty lava Feb 6, 2026, 5:44 AM

#

yeah i send you that

thorn lantern Feb 6, 2026, 5:45 AM

#

I've used 4.5 a lot, but haven't experimented with 4.6 yet

frosty lava Feb 6, 2026, 5:45 AM

#

i sent you the thinking

thorn lantern Feb 6, 2026, 5:46 AM

#

Got it, thanks! I'll respond in dm

burnt sinew Feb 6, 2026, 6:08 AM

#

stone cape kiro ide you can sub for $0 for first month i think if lucky

literally nevermind i forgot how good good chat + bad cli is

toxic verge Feb 6, 2026, 6:43 AM

#

maiden fulcrum It didn't work

I’ve been testing this out for a little

#

Try making a new account not Gmail

maiden fulcrum Feb 6, 2026, 6:43 AM

#

hmm

#

why

toxic verge Feb 6, 2026, 6:44 AM

#

I’m not sure exactly but it helps sometimes

#

Unless your sending requests to fast to model

#

Then your going to get them

#

Also try resetting ur modem if all else fails

frosty lava Feb 6, 2026, 6:47 AM

#

Okay i figured out something, opus 4.6 for some reason, when given a complex work to do, it will think without actually writing code, you can see a huge thinking during 5 / 6 minute then error from lmarena, but no code written, so you have to tell it to actually write code

#

its weird but yeah it is how it is

burnt sinew Feb 6, 2026, 6:52 AM

#

When's opus 4.6 going to be preliminary on leaderboard

echo aurora Feb 6, 2026, 6:53 AM

#

burnt sinew When's opus 4.6 going to be preliminary on leaderboard

We'll be sure to post an announcement when the leaderboards get an update with opus 4.6

slim spire Feb 6, 2026, 6:55 AM

#

echo aurora We'll be sure to post an announcement when the leaderboards get an update with o...

Does max use opus 4.6?

frosty lava Feb 6, 2026, 6:56 AM

#

echo aurora We'll be sure to post an announcement when the leaderboards get an update with o...

Hey, is it an issue from lmarena that opus 4.6 actually think so much it end up saying an error happened please try again ? it's actually very very often, the model will think 4 / 5 minute then just "an error happened"

#

its when given a complex work

#

code

left lodge Feb 6, 2026, 6:59 AM

#

New feature in testing. 👀

#

#

Take a screenshot
is just not working

#

Why is it even there? To take screenshots of the current session?

slim spire Feb 6, 2026, 7:04 AM

#

it's still in testing

left lodge Feb 6, 2026, 7:04 AM

#

And the transparent error popup is nice but why is it at the bottom?
It should be somewhere the bg is clear

slim spire Feb 6, 2026, 7:04 AM

#

it's not even out it's still in testing so it might not work that correctly yet

#

don't expect anything that is still being tested to work instantly

left lodge Feb 6, 2026, 7:05 AM

#

They shipped that means it should work

#

Atleast somewhat

#

Models in arena are executing commands! Theya re installating packages?!

echo aurora Feb 6, 2026, 7:11 AM

#

slim spire Does max use opus 4.6?

I believe it has the potential to use all models in Battle (excluding the codenamed models).

left lodge Feb 6, 2026, 7:12 AM

#

I hope these tools come to text modality or a completely new modality where it have all these tools available to use :>

#

Without the system prompt of code modality

echo aurora Feb 6, 2026, 7:12 AM

#

frosty lava Hey, is it an issue from lmarena that opus 4.6 actually think so much it end up ...

There is going to be a some instability with the model today as it's getting a lot of use as you can imagine.

frosty lava Feb 6, 2026, 7:14 AM

#

echo aurora There is going to be a some instability with the model today as it's getting a l...

Thank you, and do you know when gpt 5.3 will actually be available on lmarena ?

echo aurora Feb 6, 2026, 7:14 AM

#

left lodge New feature in testing. 👀

Nex experiment! https://help.arena.ai/articles/4680044864-arena-experiments-image-to-code

Arena Experiments: Image to Code

We are currently experimenting with a new feature: Image to Code. This feature allows users to upload an Image to Code Arena. Your prompts will now

icy yew Feb 6, 2026, 7:14 AM

#

frosty lava Thank you, and do you know when gpt 5.3 will actually be available on lmarena ?

When the API is out

#

There is no API for arena to use them right now

echo aurora Feb 6, 2026, 7:15 AM

#

frosty lava Thank you, and do you know when gpt 5.3 will actually be available on lmarena ?

For the most part I won't be sharing details about if/when new features/models/etc. are landing.

frosty lava Feb 6, 2026, 7:15 AM

#

icy yew There is no API for arena to use them right now

Oh i see

echo aurora Feb 6, 2026, 7:15 AM

#

left lodge > Take a screenshot is just not working

Thanks for sharing this, will flag. blobthanks

echo aurora Feb 6, 2026, 7:16 AM

#

left lodge > Take a screenshot is just not working

Any particular steps to repro this? I wasn't able to get this error.

rigid holly Feb 6, 2026, 7:16 AM

#

So whats the opinion about opus 4.6?

Cuz i just found out about it like this second

icy yew Feb 6, 2026, 7:17 AM

#

rigid holly So whats the opinion about opus 4.6? Cuz i just found out about it like this se...

Good

#

Dif better then 4.5

#

But benchmarks say the new gpt 5.3 codex is better

#

But until the API is fully out we don't know

rigid holly Feb 6, 2026, 7:18 AM

#

I mean... its out on openrouter so who knows

frosty lava Feb 6, 2026, 7:25 AM

#

opus 4.6 did a decent 3d world much better than opus 4.5

#

but gpt 5.3 might be even better

#

at coding and overall i guess

left lodge Feb 6, 2026, 7:31 AM

#

echo aurora Any particular steps to repro this? I wasn't able to get this error.

Just click the button?
Maybe cause of brave browser, try to replicate it with brave browser on android.

#

Hey pineapple can you create a seprate coding channels?

#

Guess the models behind these.

https://019c31d4-e917-7510-a7a6-f593cc0fcf35.arena.site/

https://019c31d4-e917-7b02-b61c-dff4a28a6d74.arena.site/

#

They are made by sota models of the same lab

icy yew Feb 6, 2026, 7:34 AM

#

frosty lava but gpt 5.3 might be even better

We don't know but we do know it's better in terminal bench which is like a important benchmark so the rest I think Claude wins on it

left lodge Feb 6, 2026, 7:34 AM

#

But the difference in one version upgrade is so much like bruh what

#

It could be this single instance but we will see

icy yew Feb 6, 2026, 7:36 AM

#

Made this with Claude 4.6 thinking
https://019c31d8-ccde-7e39-afa3-188c58cd1868.arena.site/
What do y'all think

Weather Dashboard

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

toxic verge Feb 6, 2026, 7:38 AM

#

Who here is a nano pro?

toxic verge Feb 6, 2026, 7:40 AM

#

echo aurora Nex experiment! https://help.arena.ai/articles/4680044864-arena-experiments-imag...

I tried

#

atomic lagoon Feb 6, 2026, 7:46 AM

#

echo aurora Nex experiment! https://help.arena.ai/articles/4680044864-arena-experiments-imag...

How do I get this? Is it slowly rolling out?

royal crater Feb 6, 2026, 7:49 AM

#

Hi

#

I want to work with my GitHub repo. But it doesn't support github connector like perplexity. So wht to do now ?

#

Like I want the ai to create pr make changes put commit etc

icy yew Feb 6, 2026, 7:57 AM

#

echo aurora Any particular steps to repro this? I wasn't able to get this error.

Do you know what type of thinking the thinking mode of Claude 4.6 on arena is?

Low medium high or max

undone saffron Feb 6, 2026, 8:26 AM

#

Me using Opus 4.6 after the announcement:

sleek phoenix Feb 6, 2026, 8:35 AM

#

undone saffron Me using Opus 4.6 after the announcement:

why did you edit it

#

it was correct

royal crater Feb 6, 2026, 8:35 AM

#

I want to work with my GitHub repo. But it doesn't support github connector like perplexity. So wht to do now ?

undone saffron Feb 6, 2026, 8:35 AM

#

sleek phoenix why did you edit it

Vencord
Good

hollow imp Feb 6, 2026, 8:40 AM

#

crystal rapids Why have a model up if it always exceeds the limit, also on the 2nd attempt it c...

I see that daily with opus 4.5 32k. Finally found someone that struggles with me

golden ocean Feb 6, 2026, 8:54 AM

#

sleek phoenix why did you edit it

lmfao

#

real

austere sundial Feb 6, 2026, 9:04 AM

#

Omg lmarena became Light...
You can't for explosions, you can't for soldiers falling off the horse....
Drastic even chatgpt does that

fresh urchin Feb 6, 2026, 9:16 AM

#

Guys how good is opus 4.6?

sleek crow Feb 6, 2026, 9:32 AM

#

@echo aurora

#

the thinking model dosent work

gilded kiln Feb 6, 2026, 9:33 AM

#

Please add copilot too!!

golden ocean Feb 6, 2026, 9:44 AM

#

I'll let my work speak for itself http://localhost:8000/index.html

undone saffron Feb 6, 2026, 9:46 AM

#

sleek crow <@283397944160550928>

Check if you get [this error](#general message) too

spare rune Feb 6, 2026, 9:50 AM

#

toxic verge

That’s..

#

Interesting

fickle venture Feb 6, 2026, 9:50 AM

#

sleek crow <@283397944160550928>

Copy the prompt you sent him and refresh the website and send the prompt you copied

fickle venture Feb 6, 2026, 9:51 AM

#

golden ocean I'll let my work speak for itself http://localhost:8000/index.html

https://tenor.com/view/how-do-we-tell-him-mr-krabs-spongebob-meme-spongebob-meme-gif-24063814

Tenor

spare rune Feb 6, 2026, 9:51 AM

#

sleek crow <@283397944160550928>

Thinking is only for code right now

#

Right

fickle venture Feb 6, 2026, 9:51 AM

#

spare rune Thinking is only for code right now

No it's on text

spare rune Feb 6, 2026, 9:51 AM

#

Ohh

fickle venture Feb 6, 2026, 9:51 AM

#

Code and text

spare rune Feb 6, 2026, 9:51 AM

#

I might have to test it for creativiry

fickle venture Feb 6, 2026, 9:51 AM

#

spare rune I might have to test it for creativiry

Take advantage NOW because there is no limit

spare rune Feb 6, 2026, 9:51 AM

#

I lost my laptop so I lost all the motivation to do anything else 😭

spare rune Feb 6, 2026, 9:51 AM

#

fickle venture Take advantage NOW because there is no limit

No

#

I’m gonna use it 1 time

compact flame Feb 6, 2026, 9:52 AM

#

Guys what glasses mean on ai models

fickle venture Feb 6, 2026, 9:52 AM

#

spare rune No

I was using it many times yesterday 😭 still didn't hit limits

spare rune Feb 6, 2026, 9:52 AM

#

compact flame Guys what glasses mean on ai models

Smart or something

icy yew Feb 6, 2026, 9:53 AM

#

gilded kiln Please add copilot too!!

Copilot is just chatgpt

fickle venture Feb 6, 2026, 9:53 AM

#

gilded kiln Please add copilot too!!

Copilot doesn't exist

icy yew Feb 6, 2026, 9:54 AM

#

compact flame Guys what glasses mean on ai models

Vision

#

Like it can see images or something

compact flame Feb 6, 2026, 9:54 AM

#

icy yew Like it can see images or something

I think vision is the file icon

icy yew Feb 6, 2026, 9:54 AM

#

compact flame I think vision is the file icon

The file icon is pdf

compact flame Feb 6, 2026, 9:54 AM

#

Oh alright

icy yew Feb 6, 2026, 9:55 AM

#

fickle venture I was using it many times yesterday 😭 still didn't hit limits

Wait Claude 4.6 doesn't have rate limit?

fickle venture Feb 6, 2026, 9:56 AM

#

compact flame I think vision is the file icon

Sunglasses is vision
File is pdf upload

compact flame Feb 6, 2026, 9:56 AM

#

fickle venture Sunglasses is vision File is pdf upload

Alright thanks

fickle venture Feb 6, 2026, 9:56 AM

#

icy yew Wait Claude 4.6 doesn't have rate limit?

I mean I used it many times YESTERDAY still didn't hit limit

icy yew Feb 6, 2026, 9:56 AM

#

fickle venture I mean I used it many times YESTERDAY still didn't hit limit

Like how much tho

compact flame Feb 6, 2026, 9:56 AM

#

fickle venture I mean I used it many times YESTERDAY still didn't hit limit

Maybe thinking version only has limits?

icy yew Feb 6, 2026, 9:56 AM

#

dot3 dot4

fickle venture Feb 6, 2026, 9:56 AM

#

compact flame Maybe thinking version only has limits?

I was using thinking

compact flame Feb 6, 2026, 9:56 AM

#

Oh alr

fickle venture Feb 6, 2026, 9:56 AM

#

icy yew Like how much tho

Uhh I think 10-20 messages

compact flame Feb 6, 2026, 9:57 AM

#

So how good is opus 4.6

icy yew Feb 6, 2026, 9:57 AM

#

I mean I spammed it yesterday day night and didn't hit any limit

compact flame Feb 6, 2026, 9:57 AM

#

Or is it just same as 4.5

icy yew Feb 6, 2026, 9:57 AM

#

compact flame Or is it just same as 4.5

Smarter but 5.3 codex wins in one benchmark

fickle venture Feb 6, 2026, 9:57 AM

#

compact flame So how good is opus 4.6

I love it, I fixed a project that been working for while no other ai fixed it. Thanks opus 4.6

icy yew Feb 6, 2026, 9:58 AM

#

icy yew Smarter but 5.3 codex wins in one benchmark

But every other one Claude wins

compact flame Feb 6, 2026, 9:58 AM

#

icy yew Smarter but 5.3 codex wins in one benchmark

I mean one benchmark is no big difference but still good

fickle venture Feb 6, 2026, 9:58 AM

#

icy yew I mean I spammed it yesterday day night and didn't hit any limit

Probably rn it's free but today or tomorrow will add limits

fickle venture Feb 6, 2026, 9:58 AM

#

compact flame I mean one benchmark is no big difference but still good

compact flame Feb 6, 2026, 9:59 AM

#

fickle venture

Well damn

icy yew Feb 6, 2026, 9:59 AM

#

fickle venture

Also I think terminal bench is like a important one for agentic coding

#

🫀

fickle venture Feb 6, 2026, 9:59 AM

#

icy yew Also I think terminal bench is like a important one for agentic coding

Yeah it is

#

I think like the way it run commands on terminal

compact flame Feb 6, 2026, 9:59 AM

#

First time seeing chatgpt cook

icy yew Feb 6, 2026, 10:00 AM

#

compact flame First time seeing chatgpt cook

They released the codex 5.3 the nano second claude released 4.6

#

code_arena

compact flame Feb 6, 2026, 10:00 AM

#

I wonder if they'll add search to text arena for better results maybe

#

Like not cutting off ai from internet

icy yew Feb 6, 2026, 10:01 AM

#

compact flame I wonder if they'll add search to text arena for better results maybe

Grok is probably the best for up to date information

fickle venture Feb 6, 2026, 10:01 AM

#

The heck are these models on arena

icy yew Feb 6, 2026, 10:01 AM

#

fickle venture The heck are these models on arena

The hell

fickle venture Feb 6, 2026, 10:01 AM

#

compact flame I wonder if they'll add search to text arena for better results maybe

Search already exist

icy yew Feb 6, 2026, 10:02 AM

#

fickle venture The heck are these models on arena

What ai is named beluga😭

compact flame Feb 6, 2026, 10:02 AM

#

fickle venture Search already exist

I mean like search for the text arena not the separate thing

rigid holly Feb 6, 2026, 10:02 AM

#

So does anyone know the data training cutoff for this model?

Opus 4.5 i think it was early 2025

icy yew Feb 6, 2026, 10:02 AM

#

compact flame I mean like search for the text arena not the separate thing

Ye it's kinda annoying

fickle venture Feb 6, 2026, 10:03 AM

#

compact flame I mean like search for the text arena not the separate thing

If I remember I told a model to search online and it actually did while thinking on text arena

https://arena.ai/?chat-modality=search&mode=direct

fickle venture Feb 6, 2026, 10:04 AM

#

rigid holly So does anyone know the data training cutoff for this model? Opus 4.5 i think i...

Probably 2024

rigid holly Feb 6, 2026, 10:04 AM

#

Thats less than before tho

fickle venture Feb 6, 2026, 10:04 AM

#

Idk you can ask it and it will answer

compact flame Feb 6, 2026, 10:05 AM

#

I wonder why are the trainings are even cutoff

shrewd citrus Feb 6, 2026, 10:06 AM

#

because they can’t learn like that much reliable data or

rigid holly Feb 6, 2026, 10:06 AM

#

Yeah well 4.5 says its early 2025. And 4.6 isnt out yet

shrewd citrus Feb 6, 2026, 10:07 AM

#

like Claude says oh it has reliable data up to April 2025 but can still get info up to July or something

icy yew Feb 6, 2026, 10:07 AM

#

compact flame I wonder why are the trainings are even cutoff

Glm 5 is trying to fix this so it has access and gets info without even searching because it searched before

fickle venture Feb 6, 2026, 10:07 AM

#

compact flame I wonder why are the trainings are even cutoff

To not know what happened today so they won't take control of the world

fickle venture Feb 6, 2026, 10:08 AM

#

rigid holly Yeah well 4.5 says its early 2025. And 4.6 isnt out yet

4.6 is out...

icy yew Feb 6, 2026, 10:08 AM

#

rigid holly Yeah well 4.5 says its early 2025. And 4.6 isnt out yet

Your late

#

It's out

#

🫀 🫀 🫀

rigid holly Feb 6, 2026, 10:08 AM

#

Not on the model list it aint

shrewd citrus Feb 6, 2026, 10:08 AM

#

It is just search it

fickle venture Feb 6, 2026, 10:08 AM

#

rigid holly Not on the model list it aint

It's the last model just scroll all the way down or search opus

icy yew Feb 6, 2026, 10:09 AM

#

rigid holly Not on the model list it aint

It literally is though

#

Scroll down

fickle venture Feb 6, 2026, 10:09 AM

#

rigid holly Not on the model list it aint

icy yew Feb 6, 2026, 10:09 AM

#

Like the last model in the selection

rigid holly Feb 6, 2026, 10:09 AM

#

Ah it was hidden at the bottom

Expected it at the top

My bad

fickle venture Feb 6, 2026, 10:10 AM

#

rigid holly Ah it was hidden at the bottom Expected it at the top My bad

They did that on purpose I just can't prove it

left lodge Feb 6, 2026, 10:10 AM

#

Broo

#

#

💀

fickle venture Feb 6, 2026, 10:10 AM

#

Anthropic being themselves

icy yew Feb 6, 2026, 10:10 AM

#

fickle venture They did that on purpose I just can't prove it

https://tenor.com/view/james-doakes-doakes-curious-suspicious-questioning-gif-14390669560130901665

Tenor

golden ocean Feb 6, 2026, 10:11 AM

#

https://cdn.discordapp.com/attachments/559403230438883351/1468572366748778734/attachment.gif?ex=69867c58&is=69852ad8&hm=b13740d4b33b98af98027a4ee7c86990f84f61b5d5a0ce1d05516d4faa980145&

compact flame Feb 6, 2026, 10:11 AM

#

I guess nobody is safe from greed

bright spade Feb 6, 2026, 10:12 AM

#

the exacution of code dont work ?

left lodge Feb 6, 2026, 10:12 AM

#

Its on the last because its not on the leaderboard rn.
Sorthing is same as leaderboard and models not on the leaderboard are at the last

rigid holly Feb 6, 2026, 10:12 AM

#

Well its still early 2025

Slightly dissapointing

Was hoping for more recent stuff to be more available

Like knowing who the new pope is

fickle venture Feb 6, 2026, 10:13 AM

#

rigid holly Well its still early 2025 Slightly dissapointing Was hoping for more recent st...

I think they did that because they know ai can see the future before us

#

https://tenor.com/view/ش-gif-9093367425263329574

Tenor

fickle venture Feb 6, 2026, 10:14 AM

#

golden ocean https://cdn.discordapp.com/attachments/559403230438883351/1468572366748778734/at...

Well atleast I have a domain

http://localhost:8000/index.html

rigid holly Feb 6, 2026, 10:14 AM

#

Ok but still. Not even a middle 2025

fickle venture Feb 6, 2026, 10:14 AM

#

rigid holly Ok but still. Not even a middle 2025

They might do it on the next model because Opus 4.6 was in training in 2025

#

Probably

light sleet Feb 6, 2026, 10:15 AM

#

Will gpt 5.3 better than Opus 4.6?

sterile tartan Feb 6, 2026, 10:15 AM

#

fickle venture They might do it on the next model because Opus 4.6 was in training in 2025

Training for these Minor Improvements?

#

💀

sterile tartan Feb 6, 2026, 10:15 AM

#

light sleet Will gpt 5.3 better than Opus 4.6?

Probably Not

rigid holly Feb 6, 2026, 10:16 AM

#

That is a good point and yeah

The 4.1 and sonnet 4 had 2024 data before the upgrade

icy yew Feb 6, 2026, 10:16 AM

#

light sleet Will gpt 5.3 better than Opus 4.6?

Only in terminal bench

fickle venture Feb 6, 2026, 10:16 AM

#

light sleet Will gpt 5.3 better than Opus 4.6?

Nothing confirmed if GPT 5.3 is coming this month just don't believe anything you see. But my guess is No

rare fractal Feb 6, 2026, 10:16 AM

#

Where does the arena get the money to pay for all these expensive models?

sterile tartan Feb 6, 2026, 10:16 AM

#

fickle venture Nothing confirmed if GPT 5.3 is coming this month just don't believe anything yo...

But Codex 5.3 is already Out

light sleet Feb 6, 2026, 10:16 AM

#

fickle venture Nothing confirmed if GPT 5.3 is coming this month just don't believe anything yo...

Kk

fickle venture Feb 6, 2026, 10:16 AM

#

rare fractal Where does the arena get the money to pay for all these expensive models?

See their blog

icy yew Feb 6, 2026, 10:16 AM

#

sterile tartan But Codex 5.3 is already Out

But not the api

sterile tartan Feb 6, 2026, 10:16 AM

#

rare fractal Where does the arena get the money to pay for all these expensive models?

SECRET

sterile tartan Feb 6, 2026, 10:16 AM

#

icy yew But not the api

True

rigid holly Feb 6, 2026, 10:17 AM

#

I remember that in decdmber it was thoughg sonnet 4.6 or 4.7 was coming out that month

fickle venture Feb 6, 2026, 10:17 AM

#

sterile tartan But Codex 5.3 is already Out

Well idk why but I think they trying to improve something on Gpt 5.3

rigid holly Feb 6, 2026, 10:17 AM

#

Same deal?

sterile tartan Feb 6, 2026, 10:17 AM

#

fickle venture Well idk why but I think they trying to improve something on Gpt 5.3

I C

#

Makes Sense

golden ocean Feb 6, 2026, 10:17 AM

#

fickle venture Well atleast I have a domain http://localhost:8000/index.html

SO TRUE

fickle venture Feb 6, 2026, 10:18 AM

#

rigid holly I remember that in decdmber it was thoughg sonnet 4.6 or 4.7 was coming out that...

I mean internet is just full of lies

left lodge Feb 6, 2026, 10:19 AM

#

left lodge Broo

Now i think if this is true , what will they release in place of actual next sucessor of sonnet 4.5??

sterile tartan Feb 6, 2026, 10:19 AM

#

fickle venture I mean internet is just full of lies

Like Our Profiles

#

💀

left lodge Feb 6, 2026, 10:19 AM

#

Haiku???

rigid holly Feb 6, 2026, 10:19 AM

#

What about haiku?

fickle venture Feb 6, 2026, 10:21 AM

#

Haiku is boring no one cares

fickle venture Feb 6, 2026, 10:22 AM

#

sterile tartan Like Our Profiles

Its okay They hide our identity so yeah

icy yew Feb 6, 2026, 10:22 AM

#

left lodge Haiku???

Never even tried that

#

Probably dog water

fickle venture Feb 6, 2026, 10:22 AM

#

Same lol

sterile tartan Feb 6, 2026, 10:22 AM

#

fickle venture Its okay They hide our identity so yeah

Truth

toxic verge Feb 6, 2026, 10:30 AM

#

sterile tartan SECRET

They get funding

sterile tartan Feb 6, 2026, 10:30 AM

#

toxic verge They get funding

Yeah from A Crypto Venture

toxic verge Feb 6, 2026, 10:31 AM

#

https://www.prnewswire.com/news-releases/lmarena-secures-100m-in-seed-funding-to-bring-scientific-rigor-to-ai-reliability-302462025.html

LMArena Secures $100M in Seed Funding to Bring Scientific Rigor to ...

/PRNewswire/ -- LMArena, the open community platform for evaluating the best AI models, has secured $100 million in seed funding led by a16z and UC Investments...

#

They’re burning some money though

#

Just like most companies

#

Except they’re not technically a business somewhat

#

#

Lightspeed, Laude Ventures

#

next ivy Feb 6, 2026, 10:37 AM

#

ngl i thought it was sonnet 5 instead of opus 4.6, did not expect that

toxic verge Feb 6, 2026, 10:37 AM

#

#

https://deepmind.us.org/blog/lmarena-funding-hits-150m-at-1-7b-valuation

DeepMind

LMArena Funding Hits $150M at $1.7B Valuation

LMArena has secured $150 million in funding at a $1.7 billion valuation to advance its AI benchmarking platform. The article details the company's crowdsourced evaluation methods, challenges with traditional benchmarks, top AI model rankings, and how developers use the service for real-world testing and insights.

rigid holly Feb 6, 2026, 10:38 AM

#

So

In terms of writing long texts do we still need to wait for it to polished or what

Cuz it keeps crashing

Does it still need those funky numbers at tge end of the model?

left lodge Feb 6, 2026, 10:48 AM

#

icy yew Never even tried that

Haiku is good combination of speed and intelligence

icy yew Feb 6, 2026, 10:54 AM

#

left lodge Haiku is good combination of speed and intelligence

Wasn't that for sonnet

#

Or is it way faster

high dirge Feb 6, 2026, 11:01 AM

#

is there a way to find out the exact model the max routed to not just the organization

#

since it could be helpful to see what models are best at what prompts

spare rune Feb 6, 2026, 11:10 AM

#

Hahahaha omg this is so real pls sen me the link

#

😂😂😂

covert iris Feb 6, 2026, 11:15 AM

#

gimme money KBBQ_woww

uneven lance Feb 6, 2026, 11:26 AM

#

In lmarena leaderboard Gemini 3 pro ranks 1st while in Artificial Analysis leaderboard Gpt 5.2 pro tanks first

#

Is Artificial Analysis biased?

#

They only rank using trust me bro benchmarks...

golden ocean Feb 6, 2026, 11:29 AM

#

crack bench

echo dome Feb 6, 2026, 11:29 AM

#

idk if this one works

echo dome Feb 6, 2026, 11:30 AM

#

fickle venture The heck are these models on arena

who invited beluga into arena bro

frigid tusk Feb 6, 2026, 11:38 AM

#

why is opus 4.6 the lowest here

north obsidian Feb 6, 2026, 11:40 AM

#

echo dome idk if this one works

Yes, this AI just take ur prompt and put the best AI for it to answer u or speak

icy yew Feb 6, 2026, 11:40 AM

#

left lodge Haiku is good combination of speed and intelligence

To be honest it's actually decent

#

I wou still go for sonnet for balance tho

north obsidian Feb 6, 2026, 11:40 AM

#

It can be since google model until xAI

golden ocean Feb 6, 2026, 11:40 AM

#

frigid tusk why is opus 4.6 the lowest here

so that their money burned less fast because less attention

frigid tusk Feb 6, 2026, 11:41 AM

#

golden ocean so that their money burned less fast because less attention

but its the best of all?

echo dome Feb 6, 2026, 11:41 AM

#

frigid tusk why is opus 4.6 the lowest here

mr krabs

golden ocean Feb 6, 2026, 11:41 AM

#

🦀 money 🦀 money 🦀 money

echo dome Feb 6, 2026, 11:42 AM

#

(that's the answer)

frigid tusk Feb 6, 2026, 11:42 AM

#

echo dome mr krabs

https://tenor.com/view/ahoklollmao-of-course-ohhhh-oh-that-makes-sense-gif-11896286840118452668

Tenor

golden ocean Feb 6, 2026, 11:42 AM

#

REAL

left lodge Feb 6, 2026, 11:43 AM

#

icy yew I wou still go for sonnet for balance tho

Yeah sonnet is for everyday tasks and you can haiku for fast and quick answer without being too conscious about it being hallucinated or wrong, it is fastest model by claude and is seemed better compared to other similiar size models

left lodge Feb 6, 2026, 11:44 AM

#

left lodge Broo

It might be true lmao

#

It shows improvements from sonnet 4.5 but not from opus 4.5

echo dome Feb 6, 2026, 11:45 AM

#

wait what just happened to sonnet creator

echo dome Feb 6, 2026, 11:46 AM

#

left lodge It might be true lmao

wait gemini 3 flash thinking have thinking? i forgor

fickle venture Feb 6, 2026, 11:59 AM

#

left lodge It might be true lmao

Dam Gemini 3 Flash is the easiest one to jailbreak lol

left lodge Feb 6, 2026, 12:03 PM

#

echo dome wait gemini 3 flash thinking have thinking? i forgor

Gemini 3 flash literally doesn't have a non thinking variant

glacial dock Feb 6, 2026, 12:04 PM

#

I’m getting non stop time outs with opus 4.6 thinking, meanwhile with the same question 4.5 has never timed out …is this a bug or is it because it’s brand new and needs more time ?

left lodge Feb 6, 2026, 12:04 PM

#

fickle venture Dam Gemini 3 Flash is the easiest one to jailbreak lol

Cause of its hallucinations and behaviour

#

Make minecraft with touch controls

Opus 4.6 thinking
https://019c32cb-de47-77c5-aaf3-30b23c910a2f.arena.site/

Gpt 5.1 codex max ultra premium pro high figh wifi bluetooth
https://019c32cb-de47-78f8-8f93-6c88f6728a2d.arena.site/

MiniCraft - Voxel Builder

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

Touchcraft: Tiny Minecraft

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

#

Wth is 5.1 even doing wth is that

north obsidian Feb 6, 2026, 12:10 PM

#

left lodge > Make minecraft with touch controls Opus 4.6 thinking https://019c32cb-de47-77...

What model made this?

#

It's good but the floor

hollow ivy Feb 6, 2026, 12:11 PM

#

north obsidian What model made this?

Anthropic's Claude (Opus)

left lodge Feb 6, 2026, 12:12 PM

#

Specifically opus 4.6 thinking

#

I just said Make minecraft with touch controls

#

One prompt

north obsidian Feb 6, 2026, 12:13 PM

#

I liked it

left lodge Feb 6, 2026, 12:13 PM

#

Gpt 5.2 codex
https://019c32dc-b394-79d2-a3f1-08e74bf4e7ae.arena.site/

BlockCraft Touch Builder

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

#

Gpt models are so weird rn

hollow ivy Feb 6, 2026, 12:15 PM

#

opus 4.6 takes forever to answer a simple prompt in arena

hollow ivy Feb 6, 2026, 12:15 PM

#

left lodge Gpt 5.2 codex https://019c32dc-b394-79d2-a3f1-08e74bf4e7ae.arena.site/

OAI is history

left lodge Feb 6, 2026, 12:15 PM

#

hollow ivy opus 4.6 takes forever to answer a simple prompt in arena

Cause its not made for simple prompts ¯_(ツ)_/¯

north obsidian Feb 6, 2026, 12:15 PM

#

left lodge Gpt 5.2 codex https://019c32dc-b394-79d2-a3f1-08e74bf4e7ae.arena.site/

Yeah

wind ember Feb 6, 2026, 12:16 PM

#

now even direct chat has captcha?

hollow ivy Feb 6, 2026, 12:16 PM

#

left lodge Gpt models are so weird rn

Anthropic (and Deepmind) have defeated OpenAI

wind ember Feb 6, 2026, 12:16 PM

#

like come on ...#

left lodge Feb 6, 2026, 12:16 PM

#

hollow ivy opus 4.6 takes forever to answer a simple prompt in arena

Its literally made to think and reason for the hardest problems

hollow ivy Feb 6, 2026, 12:16 PM

#

-# later, xAI might join the victors

left lodge Feb 6, 2026, 12:16 PM

#

wind ember like come on ...#

It already had from the starting

wind ember Feb 6, 2026, 12:16 PM

#

this is annoying

north obsidian Feb 6, 2026, 12:16 PM

#

left lodge Gpt models are so weird rn

I never had like it, I prefer Gemini and Claude

wind ember Feb 6, 2026, 12:17 PM

#

left lodge It already had from the starting

it was only on voting session

left lodge Feb 6, 2026, 12:17 PM

#

north obsidian I never had like it, I prefer Gemini and Claude

Yeah gpt lost it with gpt 5 launch

ocean ferry Feb 6, 2026, 12:18 PM

#

can anyone try this prompt for Gemini 3 Pro GA?

Create a nice looking and rich SaaS about Gemini 3 Pro GA by Google Deepmind, it must has a mock about the Gemini 3 Pro Preview which is so lazy and it's fixed on Gemini 3 Pro GA, output in single html, must use tailwind css(cdn) and i don't want shiity website, should be really cool and good and should never use emoji in the html.

#

https://x.com/i/status/2019458279856877818

Chetaslua (@chetaslua)

🚨 How to Access new Gemini 3GA

go to @arena
battle mode - write your prompt - chances are set at 25%
how to difference
gemini-3-pro with google logo is normal
gemini-3-pro without google logo is GA

show me your demo now

#

i never get it bro

#

i only get gemini with google logo bruh

#

so please any1 try it

wind ember Feb 6, 2026, 12:19 PM

#

ocean ferry so please any1 try it

not working anymore

hollow ivy Feb 6, 2026, 12:20 PM

#

ocean ferry https://x.com/i/status/2019458279856877818

does this guy favor Lua language?

left lodge Feb 6, 2026, 12:20 PM

#

wind ember it was only on voting session

No, google captchas were already on direct chats

toxic verge Feb 6, 2026, 12:21 PM

#

I don’t know how they’re gonna replace them. They need it for anti bot

#

Because it’s really effective

wind ember Feb 6, 2026, 12:22 PM

#

left lodge No, google captchas were already on direct chats

they need to do smth about it

toxic verge Feb 6, 2026, 12:22 PM

#

That’s the thing I don’t know what alternatives there are I can’t think of any

left lodge Feb 6, 2026, 12:22 PM

#

Wait i just noticed chat titles are not first prompt

#

Hmm

golden ocean Feb 6, 2026, 12:23 PM

#

10 gallons of water per custom chat title

toxic verge Feb 6, 2026, 12:24 PM

#

Fr

wind ember Feb 6, 2026, 12:24 PM

#

golden ocean 10 gallons of water per custom chat title

a lot

left lodge Feb 6, 2026, 12:24 PM

#

Thats not correct bro 😭

wind ember Feb 6, 2026, 12:24 PM

#

not sending any prompt anymore

toxic verge Feb 6, 2026, 12:24 PM

#

Oh yeah, it’s not correct but there’s a bitter drop of truth in there in a general sense

#

Exaggerated

left lodge Feb 6, 2026, 12:25 PM

#

Claude opus 4.5

non thinking

https://019c32dd-76b3-7fd8-8986-8e3397f6fb27.arena.site/

Minecraft Touch

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

#

This one is nice one too ↑

#

Gemini 3 pro
https://019c32dd-76b2-7edd-b2a2-750076c5c961.arena.site/

Arena Web Dev App

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

#

It isnt even rerendering anything 😭

ocean ferry Feb 6, 2026, 12:30 PM

#

wind ember not working anymore

the ga deleted?

zealous sparrow Feb 6, 2026, 12:30 PM

#

wind ember not working anymore

actually it does

#

It's just a gemini 3 pro model

#

One is direct with logo
another is stealth with logo

left lodge Feb 6, 2026, 12:31 PM

#

I literally have zero interest in gemini models cause of their hallucinations and attitude issues

wind ember Feb 6, 2026, 12:31 PM

#

zealous sparrow One is direct with logo another is stealth with logo

its only available in battle mode?

zealous sparrow Feb 6, 2026, 12:31 PM

#

wind ember its only available in battle mode?

probably so

left lodge Feb 6, 2026, 12:31 PM

#

wind ember its only available in battle mode?

Yes

zealous sparrow Feb 6, 2026, 12:31 PM

#

i have a gen with 2 gemini 3 pros

wind ember Feb 6, 2026, 12:32 PM

#

mm i see

ocean ferry Feb 6, 2026, 12:34 PM

#

zealous sparrow One is direct with logo another is stealth with logo

can u try it with my prompt plz

#

i have tried it for 2+ hours and i only get the gemini 3 pro with logo

zealous sparrow Feb 6, 2026, 12:35 PM

#

ocean ferry i have tried it for 2+ hours and i only get the gemini 3 pro with logo

the GA is also with logo, you can only tell by quality

shrewd citrus Feb 6, 2026, 12:36 PM

#

does anyone else have the problem where opus 4.6 thinking just thinks for too long

#

and then stops working

left lodge Feb 6, 2026, 12:36 PM

#

Just three looks wierd they should have extensions showing

shrewd citrus Feb 6, 2026, 12:37 PM

#

like 4.5 would think for a minute max before it starts outputting something

north obsidian Feb 6, 2026, 12:37 PM

#

left lodge Claude opus 4.5 > non thinking https://019c32dd-76b3-7fd8-8986-8e3397f6fb27.ar...

Better than Claude lol

left lodge Feb 6, 2026, 12:37 PM

#

It is by claude

#

👀

north obsidian Feb 6, 2026, 12:37 PM

#

Hmmm

ocean ferry Feb 6, 2026, 12:37 PM

#

zealous sparrow the GA is also with logo, you can only tell by quality

bruh😭

north obsidian Feb 6, 2026, 12:37 PM

#

I see it now

#

Claude 4.5 was better than Claude 4.6 thinking 🤔

north obsidian Feb 6, 2026, 12:38 PM

#

left lodge Claude opus 4.5 > non thinking https://019c32dd-76b3-7fd8-8986-8e3397f6fb27.ar...

It made a floor

toxic verge Feb 6, 2026, 12:39 PM

#

left lodge I literally have zero interest in gemini models cause of their hallucinations an...

Attitude?👀
.

uneven lance Feb 6, 2026, 12:42 PM

#

Gpt 5.3 codex is so bad at frontend

#

But backend it shines

left lodge Feb 6, 2026, 12:45 PM

#

toxic verge Attitude?👀 .

Yeah i dont like its character, its lazy, doesn't accept its own mistakes, doesn't follow instructions, a literal karen

#

When i say dont be lazy, instead of doing work it literally says i am not lazy 😭

uneven lance Feb 6, 2026, 12:46 PM

#

I had to beg with Gemini flash so it follows my lead

left lodge Feb 6, 2026, 12:46 PM

#

Yeah flash is even worse

uneven lance Feb 6, 2026, 12:47 PM

#

Within thinking it's the worst

left lodge Feb 6, 2026, 12:47 PM

#

They dont have any reliability

uneven lance Feb 6, 2026, 12:47 PM

#

It removes features

#

When I ask it to add a feature it removes the previous version of the code too 😡

#

So now I just use GLM for coding

left lodge Feb 6, 2026, 12:48 PM

#

Glm is good

#

Its tool use capability i like it but it outputs literal articles even for simple questions

uneven lance Feb 6, 2026, 12:50 PM

#

A system prompt makes it give one line answer...

#

I have to instruct it how to use its tools

#

The model is so agreeable too

#

Isn't gpt 5.3 codex assisted by 5.2 codex in it's creation?

#

No wonder the front-end capability is meh

wind ember Feb 6, 2026, 12:53 PM

#

left lodge Glm is good

#

left lodge Feb 6, 2026, 12:53 PM

#

wind ember

Good doesn't mean sota

#

Good is good not perfect

wind ember Feb 6, 2026, 12:53 PM

#

what is it good at?

#

frontend?

left lodge Feb 6, 2026, 12:54 PM

#

Try yourself

wind ember Feb 6, 2026, 12:54 PM

#

i did

#

im asking you what is it good a t

#

maybe im missing something

spare rune Feb 6, 2026, 12:54 PM

#

wind ember maybe im missing something

Same

wind ember Feb 6, 2026, 12:54 PM

#

although i still think glm 4.7 is the best chinese model yet

left lodge Feb 6, 2026, 12:55 PM

#

I haven't used it for front-end

wind ember Feb 6, 2026, 12:55 PM

#

alongside deepseek v3.2

left lodge Feb 6, 2026, 12:55 PM

#

What about Kimi 2.5?

wind ember Feb 6, 2026, 12:56 PM

#

left lodge What about Kimi 2.5?

meh

#

starting to look more like gemini 3 clone

#

its heavily trained on gemini outputs

spare mango Feb 6, 2026, 12:58 PM

#

Gemini, the "best" chatbot.

#

"in 2024".

zealous sparrow Feb 6, 2026, 12:58 PM

#

what model did you use

#

Fast or thinking

spare mango Feb 6, 2026, 12:58 PM

#

Pro.

zealous sparrow Feb 6, 2026, 12:59 PM

#

it knows what year it is, just does silly messups..

spare mango Feb 6, 2026, 12:59 PM

#

If it knows what year it is, then why does it do the silly messup.

#

That's like me saying I sometimes forget my name.

#

That would reduce my credibility and reliability by a lot.

north obsidian Feb 6, 2026, 1:03 PM

#

2 y ago I saw a brutal AI error in math, was about meta AI it made all the equation but in the final it was like 255 + 1 it said 257

shrewd citrus Feb 6, 2026, 1:08 PM

#

spare mango Gemini, the "best" chatbot.

i hate it when models say “you are absolutely right”

#

like why can’t they be right when i first asked the question

spare mango Feb 6, 2026, 1:09 PM

#

shrewd citrus like why can’t they be right when i first asked the question

Honestly same.

#

I have to argue with, and debunk the AI's false claims, after which I realize I just wasted my time arguing with an AI.

icy yew Feb 6, 2026, 1:11 PM

#

spare mango Pro.

Gork the best in time lines tbh

golden ocean Feb 6, 2026, 1:11 PM

#

You're absolutely right

icy yew Feb 6, 2026, 1:12 PM

#

It doesn't really get years wrong or dates

icy yew Feb 6, 2026, 1:14 PM

#

spare mango Pro.

Gemini 3 when it doesn't hallucinate absolutely cooks especially in language

#

Sadly it hallucinates like crazy

hollow ivy Feb 6, 2026, 1:25 PM

#

icy yew Feb 6, 2026, 1:26 PM

#

Claude ofc

hollow ivy Feb 6, 2026, 1:26 PM

#

icy yew Claude ofc

I agree, but i still want to see, if Claude Opus manages to get 100% in this poll, and which model lands second place.

keen beacon Feb 6, 2026, 1:28 PM

#

how to fix it?

icy yew Feb 6, 2026, 1:29 PM

#

keen beacon how to fix it?

It just happens I think

#

Unstable

keen beacon Feb 6, 2026, 1:30 PM

#

🙁

hollow ivy Feb 6, 2026, 1:31 PM

#

hollow ivy Feb 6, 2026, 1:31 PM

#

hollow ivy

(..the 2nd poll)

keen beacon Feb 6, 2026, 1:31 PM

#

icy yew It just happens I think

but does it go back to normal? because I already tried closing the browser and everything, and it didn’t work , it’s still giving this same problem

hollow ivy Feb 6, 2026, 1:35 PM

#

eternal saffron Feb 6, 2026, 1:36 PM

#

website is down?

icy yew Feb 6, 2026, 1:37 PM

#

eternal saffron website is down?

Uh no

eternal saffron Feb 6, 2026, 1:38 PM

#

keen beacon Feb 6, 2026, 1:41 PM

#

🙁

icy yew Feb 6, 2026, 1:44 PM

#

eternal saffron

Could be a problem on your side

#

It works for me

eternal saffron Feb 6, 2026, 1:49 PM

#

alr mate

frozen osprey Feb 6, 2026, 2:06 PM

#

Does nano banana pro work

#

It keeps giving errors

somber sky Feb 6, 2026, 2:08 PM

#

did you use multiple accounts like me?

frozen osprey Feb 6, 2026, 2:08 PM

#

somber sky did you use multiple accounts like me?

Nah

somber sky Feb 6, 2026, 2:12 PM

#

or generate any feminine related?!?!!?

#

huh????!?!?!?

sterile tartan Feb 6, 2026, 2:18 PM

#

What are Opus Rate Limits on Arena?

plucky sparrow Feb 6, 2026, 2:24 PM

#

has anyone tried this on opus?

modest prism Feb 6, 2026, 2:32 PM

#

Opus 4.6 thinking gives a timeout when thinking longer than a certain time. Any plan to fix it

vast fern Feb 6, 2026, 2:35 PM

#

@echo aurora are there any plans for adding gpt 5.3

icy yew Feb 6, 2026, 2:37 PM

#

vast fern <@283397944160550928> are there any plans for adding gpt 5.3

They can't

#

The API for 5.3 codex isn't out

golden ocean Feb 6, 2026, 2:37 PM

#

is there an api for talking to @icy yew

icy yew Feb 6, 2026, 2:37 PM

#

After the API comes out

icy yew Feb 6, 2026, 2:38 PM

#

golden ocean is there an api for talking to <@1272512245284474953>

https://cdn.discordapp.com/attachments/1283423642579107905/1302654122008252446/image0.gif

vast fern Feb 6, 2026, 2:47 PM

#

keen beacon how to fix it?

login

icy yew Feb 6, 2026, 2:50 PM

#

golden ocean is there an api for talking to <@1272512245284474953>

Screenshot_2026-02-06-17-20-12-237_com.android.chrome-edit.jpg

#

https://tenor.com/view/respect-moment-gif-15169320514465779908

Tenor

keen beacon Feb 6, 2026, 3:02 PM

#

vast fern login

dont work

frosty shuttle Feb 6, 2026, 3:11 PM

#

The new Claude program is still unusable for me; it thinks for a long time before giving an answer, but is interrupted by the site's limit. Has anyone managed to use it yet?

shrewd citrus Feb 6, 2026, 3:13 PM

#

frosty shuttle The new Claude program is still unusable for me; it thinks for a long time befor...

exact same thing is happening to me too

fleet lintel Feb 6, 2026, 3:17 PM

#

do we have gemini pro ga candidate on LMArena?

#

What is the ranking according to this group?

claude 4.6 > gpt 5.3 > gemini pro ga
OR
claude 4.6 > gemini pro ga > gpt 5.3

i believe these are the only two possibilities 🙂

timber iris Feb 6, 2026, 3:19 PM

#

https://discord.com/channels/1340554757349179412/1469259617661091964

glass perch Feb 6, 2026, 3:31 PM

#

Why is opus4.6 thinking forever man

#

I swear it never stops thinking

icy yew Feb 6, 2026, 3:31 PM

#

glass perch Why is opus4.6 thinking forever man

At most for me it takes the maximum 30 seconds

#

For hard stuff

glass perch Feb 6, 2026, 3:32 PM

#

Im tryna get it to make a story game

#

That lasts like 10 mins

#

Which might explain it

north obsidian Feb 6, 2026, 3:43 PM

#

fleet lintel What is the ranking according to this group? claude 4.6 > gpt 5.3 > gemini pr...

My ranking Claude > Gemini > Chatgpt

north obsidian Feb 6, 2026, 3:43 PM

#

fleet lintel What is the ranking according to this group? claude 4.6 > gpt 5.3 > gemini pr...

Code?

plucky basalt Feb 6, 2026, 3:43 PM

#

dude why cant i test any model on this website

#

it reasons for 5 mins and it breaks

burnt sinew Feb 6, 2026, 4:01 PM

#

fleet lintel What is the ranking according to this group? claude 4.6 > gpt 5.3 > gemini pr...

What's ga

icy yew Feb 6, 2026, 4:04 PM

#

burnt sinew What's ga

General availability or something

sturdy mica Feb 6, 2026, 4:08 PM

#

hollow ivy

wow you guys havent touched gpt 5.3 codex

obsidian cargo Feb 6, 2026, 4:08 PM

#

I'd definitely rank claude 4.6 over gemini 3 at this point, esp with gemini 3 being so terse

celest orchid Feb 6, 2026, 4:09 PM

#

icy yew

lol

mystic olive Feb 6, 2026, 4:14 PM

#

obsidian cargo I'd definitely rank claude 4.6 over gemini 3 at this point, esp with gemini 3 be...

Gemini 3 is still better overall tho

icy yew Feb 6, 2026, 4:15 PM

#

mystic olive Gemini 3 is still better overall tho

The main problem is it hallucinates like a lot

obsidian cargo Feb 6, 2026, 4:16 PM

#

idk my big problem with gemini 3 is it doesn't do long outputs. also it always wants to name characters Elara Vance

fickle venture Feb 6, 2026, 4:23 PM

#

obsidian cargo idk my big problem with gemini 3 is it doesn't do long outputs. also it *always*...

cosmic falcon Feb 6, 2026, 4:26 PM

#

Can u guys release a premium version of the arena , so theres dedicated support , most of the time server crashes on my experience , just an opinion btw

icy yew Feb 6, 2026, 4:36 PM

#

cosmic falcon Can u guys release a premium version of the arena , so theres dedicated support ...

Wdym server crashes

#

Does the website not work or the ais

acoustic garden Feb 6, 2026, 4:37 PM

#

What's causing this error? I haven't used it at all today, I don't have a limit.

icy yew Feb 6, 2026, 4:37 PM

#

acoustic garden What's causing this error? I haven't used it at all today, I don't have a limit.

With Claude?

acoustic garden Feb 6, 2026, 4:37 PM

#

icy yew With Claude?

yes

icy yew Feb 6, 2026, 4:37 PM

#

acoustic garden yes

Oh no that's normal

#

Claude code with opus 4.6 is unstable

acoustic garden Feb 6, 2026, 4:38 PM

#

I use 4.5

icy yew Feb 6, 2026, 4:38 PM

#

Well
Idk

#

🫀

acoustic garden Feb 6, 2026, 4:38 PM

#

(

sturdy mica Feb 6, 2026, 4:48 PM

#

obsidian cargo idk my big problem with gemini 3 is it doesn't do long outputs. also it *always*...

Eli Vance half life

toxic verge Feb 6, 2026, 4:51 PM

#

mystic olive Gemini 3 is still better overall tho

Gemini sucks. All though it’s all I use right now only because of nano banana

#

Well not sucks but has issues

frosty shuttle Feb 6, 2026, 4:59 PM

#

toxic verge Gemini sucks. All though it’s all I use right now only because of nano banana

It's always the same, Google nerfs models after 5 days of release, the same thing happened with Gemini 2.5.

junior spoke Feb 6, 2026, 4:59 PM

#

Nano banana lagging rn right

burnt sinew Feb 6, 2026, 5:11 PM

#

icy yew General availability or something

Ah so would ai studio one count

burnt sinew Feb 6, 2026, 5:15 PM

#

mystic olive Gemini 3 is still better overall tho

For chatting sure maybe. For coding? Not in the slightest compared to opus 4.6 only thing going for it now is that its free on cli

pulsar crystal Feb 6, 2026, 5:34 PM

#

i love max
why does max not tell me what model responded?
it would be useful to know

i guess max does does not know it?
because athropic is internally routing?

echo aurora Feb 6, 2026, 5:38 PM

#

pulsar crystal i love max why does max not tell me what model responded? it would be useful to ...

Going to respond in #ask-here in a bit blobthumbsup

echo aurora Feb 6, 2026, 5:40 PM

#

plucky basalt dude why cant i test any model on this website

Are you able to create a post in #1343291835845578853 and explain a bit more what's going on? What modalities are you using, is this mobile or desktop issue, is there an error message, etc. Anything you think is relevant would be helpful to know.

echo aurora Feb 6, 2026, 5:41 PM

#

frosty shuttle The new Claude program is still unusable for me; it thinks for a long time befor...

It is going to be a bit unstable, but it is working. I'd recommend trying the steps in this article when you run into this error message.

Arena Troubleshooting: Something went wrong with this response... e...

You may sometimes see the error message: “Something went wrong with this response, please try again.”
This is a general error message. It can

zealous sparrow Feb 6, 2026, 5:41 PM

#

acoustic garden What's causing this error? I haven't used it at all today, I don't have a limit.

Python in codearena?

dense pumice Feb 6, 2026, 5:46 PM

#

why lmarena isn't opening

burnt pulsar Feb 6, 2026, 5:54 PM

#

Opus-4.6-Thinking is too unstable for me with longer tasks, 4.5-Thinking-32K is way better.

toxic verge Feb 6, 2026, 6:01 PM

#

frosty shuttle It's always the same, Google nerfs models after 5 days of release, the same thin...

Ya gem 3 is sad to see what it became

icy yew Feb 6, 2026, 6:02 PM

#

toxic verge Ya gem 3 is sad to see what it became

Google says It makes these insane codes and when you actually try it it ain't the same level as they showed

burnt sinew Feb 6, 2026, 6:27 PM

#

burnt pulsar Opus-4.6-Thinking is too unstable for me with longer tasks, 4.5-Thinking-32K is ...

Yes for me it times out. I have been copying its thinking context and pasting it back to it so it can have the very long thinking times. Otherwise it will just keep trying and failing

proud bobcat Feb 6, 2026, 6:28 PM

#

babe wake up

#

gpt 5.3

echo aurora Feb 6, 2026, 6:28 PM

#

burnt pulsar Opus-4.6-Thinking is too unstable for me with longer tasks, 4.5-Thinking-32K is ...

This has been flagged to the team btw cc @burnt sinew

burnt sinew Feb 6, 2026, 6:29 PM

#

echo aurora This has been flagged to the team btw cc <@506708936167260160>

Tbh I thought it was an issue with every model... even gemini 3 officially errors out at around 10 minutes

burnt pulsar Feb 6, 2026, 6:29 PM

#

Thanks, I usually analyze/optimize Mesa/Linux Kernel files of around 2000 lines of code. Opus 4.6-Thinking really struggles there.

burnt sinew Feb 6, 2026, 6:30 PM

#

burnt pulsar Thanks, I usually analyze/optimize Mesa/Linux Kernel files of around 2000 lines ...

Wouldn't you use 4.6 non thinking then?

#

Or... what i said earlier with copying thinking context manually before it errors

burnt pulsar Feb 6, 2026, 6:31 PM

#

It usually errors out within the thinking process already. But sometimes it finishes thinking but then only gets not that far with the answer.

wind ember Feb 6, 2026, 6:32 PM

#

proud bobcat babe wake up

thats glm 5

proud bobcat Feb 6, 2026, 6:33 PM

#

....

#

every model that has alpha in its name

#

was a cloaked

#

openai model

#

oh my god the joke flew over my head

#

im such an idiot

burnt sinew Feb 6, 2026, 6:34 PM

#

burnt pulsar It usually errors out within the thinking process already. But sometimes it fini...

If you copy the thinking it'll pick up from there

burnt sinew Feb 6, 2026, 6:35 PM

#

burnt pulsar It usually errors out within the thinking process already. But sometimes it fini...

Does the server in your tag really have free ai usage like lmarena?

burnt pulsar Feb 6, 2026, 6:36 PM

#

burnt sinew If you copy the thinking it'll pick up from there

Yeah, that was a workaround that I've used so far.

burnt pulsar Feb 6, 2026, 6:37 PM

#

burnt sinew Does the server in your tag really have free ai usage like lmarena?

Well, yupp gives out credits and each prompt costs credits, but you get credity by reviewieng the output (with some gamification, so sometimes you lose some, sometimes you get much more credits per review).

#

Payout to EU has been suspended though, hence I didn't make any money there.

burnt sinew Feb 6, 2026, 6:38 PM

#

burnt pulsar Payout to EU has been suspended though, hence I didn't make any money there.

Like they don't give credits to eu?

burnt pulsar Feb 6, 2026, 6:38 PM

#

You still get credits, but you cannot cash out via Paypal to EU at the moment. But I am more in there for science and the access to the latest models.

burnt sinew Feb 6, 2026, 6:38 PM

#

burnt pulsar You still get credits, but you cannot cash out via Paypal to EU at the moment. B...

Cash out what??

#

You can make money from there?

burnt pulsar Feb 6, 2026, 6:39 PM

#

Credits = Money -> Cashing out your earned credits.

#

Yeah, 1000 Credits are 0,90 EUR at the moment.

burnt sinew Feb 6, 2026, 6:39 PM

#

burnt pulsar Well, yupp gives out credits and each prompt costs credits, but you get credity ...

How much free credits do they give you

#

Did leaderboards just update?

#

No announcement yet

burnt pulsar Feb 6, 2026, 6:41 PM

#

burnt sinew How much free credits do they give you

There is a base of free credits at the start. But you need to earn more credits to pay for each chat. The model costs vary. The earnings vary a lot, too.

burnt sinew Feb 6, 2026, 6:41 PM

#

Crazy

burnt pulsar Feb 6, 2026, 6:41 PM

#

It somehow works, though.

burnt sinew Feb 6, 2026, 6:41 PM

#

1502 THINKING MODEL to 1576 NON thinking

#

Aye there's the announcement

burnt pulsar Feb 6, 2026, 6:42 PM

#

But as I wasn't able to cash out at the moment (and it might take many more months to resolve it), it is more interesting for people outside of the EU.

burnt sinew Feb 6, 2026, 6:43 PM

#

@echo aurora What's the difference between code and text->coding

mighty surge Feb 6, 2026, 6:45 PM

#

wich is better rn? Opus 4.6 or Codex 5.3?

quartz pike Feb 6, 2026, 6:46 PM

#

opus absolutelly murdured the leaderboards lol

#

even tho it failed in my benchmark

burnt sinew Feb 6, 2026, 6:47 PM

#

From what

quartz pike Feb 6, 2026, 6:47 PM

#

to gemini 3 pro

burnt sinew Feb 6, 2026, 6:47 PM

#

quartz pike even tho it failed in my benchmark

What is

#

I mean what is that from polymarket?

#

Doesn't look like it

#

Ah

quartz pike Feb 6, 2026, 6:47 PM

#

burnt sinew What is

making a portfolio website

burnt sinew Feb 6, 2026, 6:48 PM

#

burnt sinew <@283397944160550928> What's the difference between code and text->coding

@echo aurora Also what was the point of not including thinking 4.6 and have just normal 4.6?

echo aurora Feb 6, 2026, 6:48 PM

#

burnt sinew <@283397944160550928> What's the difference between code and text->coding

Code is going to be Code Arena leaderboard & Text->Coding is coding tasks done in Text modality, leaderboard here.

echo aurora Feb 6, 2026, 6:48 PM

#

burnt sinew <@283397944160550928> Also what was the point of not including thinking 4.6 and ...

We're still gathering votes for thinking version.

burnt sinew Feb 6, 2026, 6:49 PM

#

echo aurora We're still gathering votes for thinking version.

Shouldnt there be more votes on the thinking one? I would assume more people would use that one

echo aurora Feb 6, 2026, 6:49 PM

#

burnt sinew Shouldnt there be more votes on the thinking one? I would assume more people wou...

Votes are generated via Battle, not Side by Side (where you can manually select a model)

burnt sinew Feb 6, 2026, 6:49 PM

#

echo aurora Code is going to be [Code Arena leaderboard](<https://arena.ai/leaderboard/code>...

Are there currently plans to make code arena eventually include non pure front-end tasks?

zealous sparrow Feb 6, 2026, 6:50 PM

#

@echo aurora I think 4.6 thinking has higher error rates

burnt sinew Feb 6, 2026, 6:50 PM

#

zealous sparrow <@283397944160550928> I think 4.6 thinking has higher error rates

It does he already forwarded it

zealous sparrow Feb 6, 2026, 6:50 PM

#

burnt sinew It does he already forwarded it

ah gotcha

echo aurora Feb 6, 2026, 6:51 PM

#

I wouldn't want to share more info about future plans until we're ready to, but overall our team is wanting to bring a lot more features to Code Arena

quartz pike Feb 6, 2026, 6:51 PM

#

echo aurora I wouldn't want to share more info about future plans until we're ready to, but ...

oooo

burnt sinew Feb 6, 2026, 6:51 PM

#

echo aurora I wouldn't want to share more info about future plans until we're ready to, but ...

Yeah thats fine

limber panther Feb 6, 2026, 6:52 PM

#

yo

#

4.6 opus is really good, the only issue it has is no access to external textures and libraries

#

when coding

#

im really excited for claude 5 sonnet tho

#

its supposed to be huge and even better at coding tasks than opus

stray aspen Feb 6, 2026, 6:54 PM

#

whats the rate limit of claude 4.6 think

limber panther Feb 6, 2026, 6:54 PM

#

stray aspen whats the rate limit of claude 4.6 think

i think like 10 prompts

#

or 15

stray aspen Feb 6, 2026, 6:55 PM

#

great

limber panther Feb 6, 2026, 6:55 PM

#

someone made a farm game using 4.6 opus

#

its good

stray aspen Feb 6, 2026, 6:55 PM

#

send it

burnt sinew Feb 6, 2026, 6:55 PM

#

limber panther 4.6 opus is really good, the only issue it has is no access to external textures...

Thats your job

limber panther Feb 6, 2026, 6:55 PM

#

burnt sinew Thats your job

u cant do that in lmarena

burnt sinew Feb 6, 2026, 6:56 PM

#

limber panther u cant do that in lmarena

Yeah you can provide it links to assets

limber panther Feb 6, 2026, 6:56 PM

#

i meant if it had access to search and browser websites that would be really great

limber panther Feb 6, 2026, 6:56 PM

#

burnt sinew Yeah you can provide it links to assets

does it work on the current models in arena?

#

if u provide links, it cannot open them or extract anything

burnt sinew Feb 6, 2026, 6:56 PM

#

limber panther does it work on the current models in arena?

It works on every single model anywhere..

stray aspen Feb 6, 2026, 6:57 PM

#

how do i send opus 4.6 think images

burnt sinew Feb 6, 2026, 6:57 PM

#

limber panther if u provide links, it cannot open them or extract anything

Yeah but it can include them as assets??

burnt sinew Feb 6, 2026, 6:57 PM

#

stray aspen how do i send opus 4.6 think images

Nope

limber panther Feb 6, 2026, 6:57 PM

#

burnt sinew Yeah but it can include them as assets??

so you send the links and say to put them in the assets?

#

@zealous sparrow

#

https://019c344b-e8de-7dd0-ae14-f03fa0c3273e.arena.site/

PlayStation 2

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

burnt sinew Feb 6, 2026, 6:58 PM

#

limber panther if u provide links, it cannot open them or extract anything

You don't need to open a link to use assets in code

burnt sinew Feb 6, 2026, 6:58 PM

#

limber panther so you send the links and say to put them in the assets?

Yeah direct link to the images

#

And say the dimensions of the image

icy yew Feb 6, 2026, 6:58 PM

#

echo aurora I wouldn't want to share more info about future plans until we're ready to, but ...

What's the rate limit for Claude 4.6 thinking

topaz epoch Feb 6, 2026, 6:58 PM

#

i neeed help which is best opus 4.6 or 4.6 thinking for python coding?

icy yew Feb 6, 2026, 6:59 PM

#

topaz epoch i neeed help which is best opus 4.6 or 4.6 thinking for python coding?

Thinking ofc

burnt sinew Feb 6, 2026, 6:59 PM

#

Like I made flappy bird 1:1 clone using that it just took all asset links

limber panther Feb 6, 2026, 6:59 PM

#

burnt sinew You don't need to open a link to use assets in code

yea i realized, im dum lol

#

i just need to search for the links and ask it to put them as assets

topaz epoch Feb 6, 2026, 6:59 PM

#

icy yew Thinking ofc

thank dude

limber panther Feb 6, 2026, 7:00 PM

#

burnt sinew Like I made flappy bird 1:1 clone using that it just took all asset links

althought gemini 3 pro could put the assets itself, like planet textures without even asking for it

#

i think gemini 3 pro training data is pretty good

burnt sinew Feb 6, 2026, 7:03 PM

#

limber panther althought gemini 3 pro could put the assets itself, like planet textures without...

Yep

#

It did that for flappy bird

#

But it used external asset links

uneven peak Feb 6, 2026, 7:04 PM

#

limber panther Feb 6, 2026, 7:04 PM

#

burnt sinew But it used external asset links

yeah, my point is that gemini 3 pro has alot of assets in its training data and can use them without the user noticing

burnt sinew Feb 6, 2026, 7:04 PM

#

limber panther yeah, my point is that gemini 3 pro has alot of assets in its training data and ...

Yeah sure or just try asking opus to use its own asset links

iron laurel Feb 6, 2026, 7:09 PM

#

Capture_decran_2026-02-06_a_20.09.01.png

#

What is the level of thinking for 4.6 Thinking?

#

< or > than 32K?

honest verge Feb 6, 2026, 7:09 PM

#

iron laurel

16k I think

iron laurel Feb 6, 2026, 7:09 PM

#

honest verge 16k I think

So it's better to use Thinking 32K

honest verge Feb 6, 2026, 7:10 PM

#

They don't have it for now

rancid turtle Feb 6, 2026, 7:10 PM

#

Hello

honest verge Feb 6, 2026, 7:10 PM

#

iron laurel So it's better to use Thinking 32K

It's only for opus 4.5

#

Not for 4.6

iron laurel Feb 6, 2026, 7:10 PM

#

honest verge It's only for opus 4.5

Yes that's what I meant

echo aurora Feb 6, 2026, 7:11 PM

#

icy yew What's the rate limit for Claude 4.6 thinking

We don't have these rate limits publicly listed. Although this is something we're considering.

rancid turtle Feb 6, 2026, 7:11 PM

#

is arena ai downloaded for ios or no?

inner relic Feb 6, 2026, 7:12 PM

#

Did they fix the response bug

icy yew Feb 6, 2026, 7:12 PM

#

rancid turtle is arena ai downloaded for ios or no?

It's a website

echo aurora Feb 6, 2026, 7:12 PM

#

rancid turtle is arena ai downloaded for ios or no?

Nope, there isn't an app, but this may change one day.

echo aurora Feb 6, 2026, 7:12 PM

#

inner relic Did they fix the response bug

Which bug are you referring to?

inner relic Feb 6, 2026, 7:13 PM

#

"Something went wrong"

rancid turtle Feb 6, 2026, 7:13 PM

#

echo aurora Nope, there isn't an app, but this may change one day.

Okay, thank you

inner relic Feb 6, 2026, 7:13 PM

#

echo aurora Which bug are you referring to?

i am talking about the bug that cuts the response off.

burnt sinew Feb 6, 2026, 7:14 PM

#

inner relic "Something went wrong"

You mean after it thinks for a while?

echo aurora Feb 6, 2026, 7:15 PM

#

inner relic i am talking about the bug that cuts the response off.

I'd encourage you to scan #1343291835845578853 for a similar post and share/tag me there. Or if there isn't one that directly lines up with the problems you're having create a new post.

icy yew Feb 6, 2026, 7:17 PM

#

echo aurora I'd encourage you to scan <#1343291835845578853> for a similar post and share/ta...

Btw is Claude 4.6 dynamic thinking is the same on arena

rigid holly Feb 6, 2026, 7:18 PM

#

Alright i tried the 4.6 model in writing stories

Dont know what to think honestly

Like the writing is not BAD

But it feels drier in dialogue for one than previous models

Can't speak about code or other such things. I Dont use ai models for such things as code or image generation

inner relic Feb 6, 2026, 7:19 PM

#

I am using claude opus non thinking and this happens..

rigid holly Feb 6, 2026, 7:21 PM

#

Oh yeah that happened to me to in writing. I was using thinking tho. 4k words work i think cuz i also had it work on shorter chapters, but 8k words get crashed

honest verge Feb 6, 2026, 7:21 PM

#

rigid holly Oh yeah that happened to me to in writing. I was using thinking tho. 4k words wo...

I think opus 4.6 has some limits

#

It says every time "I have to make it in my limit of 20 steps"

#

When it thinks

rigid holly Feb 6, 2026, 7:22 PM

#

Tf does 20 steps even mean

icy yew Feb 6, 2026, 7:22 PM

#

honest verge It says every time "I have to make it in my limit of 20 steps"

What's 20 steps

rigid holly Feb 6, 2026, 7:24 PM

#

I will say this tho. The restrictions are more loose in what it rejects from writing than the previous model

latent merlin Feb 6, 2026, 7:24 PM

#

Hello I am new here

honest verge Feb 6, 2026, 7:27 PM

#

rigid holly Tf does 20 steps even mean

Idk

#

But it says 20 steps

icy yew Feb 6, 2026, 7:32 PM

#

honest verge I think opus 4.6 has some limits

Soo the thinking is set to high could be the cause of the thinking limit making it get a error

topaz epoch Feb 6, 2026, 7:32 PM

#

Bro i was using 4.6 thinking and it keep getting stuck in the middle because of thinking

icy yew Feb 6, 2026, 7:32 PM

#

Idk

topaz epoch Feb 6, 2026, 7:32 PM

#

icy yew Soo the thinking is set to high could be the cause of the thinking limit making ...

Exactly

prisma cipher Feb 6, 2026, 7:57 PM

#

icy yew Soo the thinking is set to high could be the cause of the thinking limit making ...

The lmarena team needs to increase the context limit in all responses for models of this type; this way, that frequent "error" is avoided. It even avoids giving the model instructions for response limitations, while also saving time and writing.

#

The typical limit is approximately 8350 words per answer in Claude, but lmarena has to increase the limit until everything is completely finished and not limited.

honest verge Feb 6, 2026, 8:01 PM

#

prisma cipher The typical limit is approximately 8350 words per answer in Claude, but lmarena ...

You can do it without this but it will require splitting prompts

prisma cipher Feb 6, 2026, 8:04 PM

#

honest verge You can do it without this but it will require splitting prompts

Mine requires a detailed and complete answer; therefore, it is necessary to apply these restrictions, instead of putting everything in a single response.

honest verge Feb 6, 2026, 8:22 PM

#

Finally opus 4.6 thinking actually think for some time not just for 1 second

echo aurora Feb 6, 2026, 8:43 PM

#

prisma cipher The lmarena team needs to increase the context limit in all responses for models...

This has been flagged to the team btw.

icy yew Feb 6, 2026, 8:44 PM

#

echo aurora This has been flagged to the team btw.

This is probably the core issue regarding the errors when using Claude 4.6

#

Hope it gets fix soon

gleaming roost Feb 6, 2026, 8:47 PM

#

evilcat

honest verge Feb 6, 2026, 8:48 PM

#

gleaming roost <:evilcat:1161027214645665842>

Idk it doesn't happen to me now

gleaming roost Feb 6, 2026, 8:49 PM

#

Perhaps it's just a matter of luck

icy yew Feb 6, 2026, 8:50 PM

#

gleaming roost <:evilcat:1161027214645665842>

It's fixed for me

gleaming roost Feb 6, 2026, 8:54 PM

#

#

peepoCry

granite tide Feb 6, 2026, 8:55 PM

#

opus opus let me use opus

honest verge Feb 6, 2026, 8:56 PM

#

Please arena I need opus 4.6 thinking 32k

#

My opus 4.6 thinking is kinda homeless

#

I live with my opus 4.5 32k

honest verge Feb 6, 2026, 8:57 PM

#

gleaming roost

Maybe it can't make too much code or text

#

Because of the limit

prisma cipher Feb 6, 2026, 9:04 PM

#

echo aurora This has been flagged to the team btw.

Thank you for the notification.

uneven peak Feb 6, 2026, 9:08 PM

#

@echo aurora how old are you? NGS_smiles

icy yew Feb 6, 2026, 9:09 PM

#

uneven peak <@283397944160550928> how old are you? <:NGS_smiles:931827407780970536>

Asking a manger his age

#

https://cdn.discordapp.com/attachments/1276881295447691330/1354443134377005128/attachment-7.gif

uneven peak Feb 6, 2026, 9:10 PM

#

Ayo chill fam 😭

limber panther Feb 6, 2026, 9:23 PM

#

@echo aurora why do i get an error that corrupts the whole project, when I use the coding arena?

#

i tried this in many chats, and 50% of the chats get corrupted at the end of coding when publishing the app

prisma cipher Feb 6, 2026, 9:26 PM

#

limber panther <@283397944160550928> why do i get an error that corrupts the whole project, whe...

Read the following:

#general message

limber panther Feb 6, 2026, 9:27 PM

#

prisma cipher Read the following: https://canary.discord.com/channels/1340554757349179412/134...

so its the context limit destroying the whole project when its done coding?

prisma cipher Feb 6, 2026, 9:29 PM

#

limber panther so its the context limit destroying the whole project when its done coding?

It is the limit of the context that prevents the work from being finished, that is why the error appears.

prisma cipher Feb 6, 2026, 9:31 PM

#

limber panther so its the context limit destroying the whole project when its done coding?

You can limit it to 8350 words per response and see if it actually finishes the job. You can try it.

limber panther Feb 6, 2026, 9:33 PM

#

prisma cipher You can limit it to 8350 words per response and see if it actually finishes the ...

but my specific prompt was to produce long and complete code...

#

seems like arena.ai wont allow that

prisma cipher Feb 6, 2026, 9:34 PM

#

limber panther seems like arena.ai wont allow that

Use the model in text mode and generate the code to copy it later. Use reasoning mode.

limber panther Feb 6, 2026, 9:35 PM

#

prisma cipher Use the model in text mode and generate the code to copy it later. Use reasoning...

code arena is the only affected part of the context limit?

golden ocean Feb 6, 2026, 9:35 PM

#

icy yew

i'm a new soul, i came to this strange world, hoping i could learn a bit about how to give and take but since i came here felt the joy and the fear finding myself making every possible mistake

prisma cipher Feb 6, 2026, 9:36 PM

#

limber panther code arena is the only affected part of the context limit?

No. Text mode is included.

limber panther Feb 6, 2026, 9:36 PM

#

prisma cipher No. Text mode is included.

yeah it does cut mid coding

#

but it doesnt corrupt the whole chat

#

like code arena...

prisma cipher Feb 6, 2026, 9:39 PM

#

limber panther like code arena...

It includes instructions for dividing the answer into parts, with each part having a maximum of 8350 words. It's useful.

#

The other thing is to write the code directly, clean, without comments, without artificial simplification, and completely unified. It's a very powerful instruction.

red meadow Feb 6, 2026, 9:46 PM

#

why am i always getting an error when claude 4.6 gets done with its task in code mode?

echo aurora Feb 6, 2026, 9:47 PM

#

red meadow why am i always getting an error when claude 4.6 gets done with its task in code...

We are looking into these reported problems, but it's worth trying these steps in the meantime as they may help: https://help.arena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message

Arena Troubleshooting: Something went wrong with this response... e...

You may sometimes see the error message: “Something went wrong with this response, please try again.”
This is a general error message. It can

icy yew Feb 6, 2026, 9:50 PM

#

Now this is impressive
W Claude
https://vt.tiktok.com/ZSaw8kyst/

TikTok

TikTok · AstroKobi

11.3K likes, 139 comments. “NASA just let AI drive their $3 Billion Mars Rover!”

hollow snow Feb 6, 2026, 9:51 PM

#

where is this opus 4.6 think in the leaderboard

prisma cipher Feb 6, 2026, 9:54 PM

#

limber panther yeah it does cut mid coding

Include these instructions at the end of your prompt:

Each response must be consistent with all of the above and without deviations, proactively correcting anything without waiting for explicit instructions from the user.```

This instruction is very powerful, especially when it is something serious and in production mode, but useful for testing the model's capabilities.

toxic verge Feb 6, 2026, 9:54 PM

#

red meadow why am i always getting an error when claude 4.6 gets done with its task in code...

Time it out space out requests

prisma cipher Feb 6, 2026, 9:54 PM

#

Good luck.

limber panther Feb 6, 2026, 9:54 PM

#

prisma cipher Include these instructions at the end of your prompt: ```Generate the definitiv...

thanks for helping man

stray aspen Feb 6, 2026, 9:55 PM

#

#

claude4.6 gave me a pretty nice roblocks camera system

limber panther Feb 6, 2026, 10:05 PM

#

stray aspen claude4.6 gave me a pretty nice roblocks camera system

can it code an nice GUI

#

for roadblocks

#

💀

stray aspen Feb 6, 2026, 10:07 PM

#

yes

#

its actually cooking lol

#

and its not even the thinking version

limber panther Feb 6, 2026, 10:07 PM

#

stray aspen its actually cooking lol

show me the image, cuz i wanna see

limber panther Feb 6, 2026, 10:08 PM

#

stray aspen and its not even the thinking version

i used 4.6 opus in html games today, it cooks good

viral cedar Feb 6, 2026, 10:08 PM

#

how is claude 4.6 opus like 5x better than gemini 3 pro

limber panther Feb 6, 2026, 10:08 PM

#

i made a good solar system simulator with assets

limber panther Feb 6, 2026, 10:08 PM

#

viral cedar how is claude 4.6 opus like 5x better than gemini 3 pro

dw, sonnet 5 is gonna be even better

#

sonnet 5 is the master of coding

prisma cipher Feb 6, 2026, 10:10 PM

#

limber panther i made a good solar system simulator with assets

Do you have online samples so I can see how it actually works?

limber panther Feb 6, 2026, 10:10 PM

#

prisma cipher Do you have online samples so I can see how it actually works?

its not complete yet, but i can give u the arena.ai test link

#

cuz it has a minor bug in loading Earth's texture

#

https://019c34b5-2776-7bb5-b2a6-4d6ba40b36d1.arena.site/

3D Planet Earth Simulator

Check out what I built in Arena's Code Arena - Content is user-generated and unverified

viral cedar Feb 6, 2026, 10:12 PM

#

limber panther dw, sonnet 5 is gonna be even better

google lowkey kinda disappoting me rn

#

they gotta catch up

limber panther Feb 6, 2026, 10:12 PM

#

viral cedar google lowkey kinda disappoting me rn

google is cooking, we just wait

viral cedar Feb 6, 2026, 10:12 PM

#

limber panther google is cooking, we just wait

like idk how benchmarks show that claude 4.6 opus is like 3% better or sum

#

but its even evident w/ webdev and documentation making

#

i gave it requirements saying make me documentation for so and so programming lang, and it completely half asses it and ignores half of my instructions.

limber panther Feb 6, 2026, 10:13 PM

#

viral cedar like idk how benchmarks show that claude 4.6 opus is like 3% better or sum

4.6 opus is way better than 4.5 opus, if you test them both you gonna see a big difference

viral cedar Feb 6, 2026, 10:13 PM

#

meanwhile claude 4.6 opus basically turns into slave and acts like its being held at gunpoint

limber panther Feb 6, 2026, 10:13 PM

#

viral cedar meanwhile claude 4.6 opus basically turns into slave and acts like its being hel...

lol

viral cedar Feb 6, 2026, 10:14 PM

#

or even a simple half-assed prompt saying make me UI like palantir

#

gemini and 4.5 opus will just half-ass it as usual

#

4.6 opus will immiedately cook up and make u whole UI lib that actually looks decent and is bug free for most part

prisma cipher Feb 6, 2026, 10:16 PM

#

limber panther https://019c34b5-2776-7bb5-b2a6-4d6ba40b36d1.arena.site/

It's a start. What I like is space warfare and first-person perspective.

#

I have in mind that Opus 4.6 will help me create a unique and realistic universe to integrate my character into, but I will do that at some point if possible.

#

Console, PC, and mobile games are linear and feature repetitive stories. My universe will be more than that.

#

It's just for playing around for a while, not for getting addicted.

toxic verge Feb 6, 2026, 10:33 PM

#

viral cedar i gave it requirements saying make me documentation for so and so programming la...

All ai does this . This is the way they’re optimized.

atomic lagoon Feb 6, 2026, 10:47 PM

#

viral cedar i gave it requirements saying make me documentation for so and so programming la...

Because you need to go more in depth for it. Also tell it to not be basic or be lazy whatsoever or it will be by default. AI does what’s fastest not what’s best quality

solar hollow Feb 6, 2026, 10:47 PM

#

poll_question_text

is opus 4.6 an improvement on 4.5?

victor_answer_votes

7

total_votes

11

victor_answer_id

1

victor_answer_text

yes

limber panther Feb 6, 2026, 10:49 PM

#

4.6 is significantly better than 4.5 once you test it yourself

stray aspen Feb 6, 2026, 10:55 PM

#

we need image uploads for opus 4.6

north obsidian Feb 6, 2026, 10:55 PM

#

stray aspen we need image uploads for opus 4.6

Up

echo aurora Feb 6, 2026, 11:02 PM

#

stray aspen we need image uploads for opus 4.6

Big +1

#

This is being worked on

steep jewel Feb 6, 2026, 11:07 PM

#

limber panther https://019c34b5-2776-7bb5-b2a6-4d6ba40b36d1.arena.site/

this is terrible

steep jewel Feb 6, 2026, 11:08 PM

#

steep jewel this is terrible

i could pitch you like 100 much better ways to simulate a plent's atmosphere. i made procedural textures in blender in like 5 minutes that look 100x better than this. not to mention the shadows just dont work

limber panther Feb 6, 2026, 11:09 PM

#

steep jewel this is terrible

its non thinking, and my prompt was pretty simple

steep jewel Feb 6, 2026, 11:09 PM

#

limber panther its non thinking, and my prompt was pretty simple

yeah fair enough

limber panther Feb 6, 2026, 11:09 PM

#

steep jewel i could pitch you like 100 much better ways to simulate a plent's atmosphere. i ...

yeah but, you're not my AI assistant who responds less than 5 seconds..

#

lol

steep jewel Feb 6, 2026, 11:10 PM

#

limber panther yeah but, you're not my AI assistant who responds less than 5 seconds..

yes, and what i make also isnt a steaming pile of dogshit

limber panther Feb 6, 2026, 11:10 PM

#

steep jewel yes, and what i make also isnt a steaming pile of dogshit

yeah ik devs are better than AI at coding, but this is impressive for someone who doesn't know a bit about coding languages

steep jewel Feb 6, 2026, 11:10 PM

#

limber panther https://019c34b5-2776-7bb5-b2a6-4d6ba40b36d1.arena.site/

nobody would choose this over something made by a human. its only value comes from being extremely easy to have an ai make for you and free, which is not negligible but necessary to understand you're pitching quantity > quality

limber panther Feb 6, 2026, 11:11 PM

#

steep jewel nobody would choose this over something made by a human. its only value comes fr...

sonnet 5 is also reported to be much better than this 4.6 opus

steep jewel Feb 6, 2026, 11:11 PM

#

limber panther sonnet 5 is also reported to be much better than this 4.6 opus

at coding? no lol

stray aspen Feb 6, 2026, 11:11 PM

#

how do i use sonnet 5

limber panther Feb 6, 2026, 11:11 PM

#

steep jewel at coding? no lol

which is said to be released late feb early march

steep jewel Feb 6, 2026, 11:12 PM

#

stray aspen how do i use sonnet 5

you cant yet, its not released

limber panther Feb 6, 2026, 11:12 PM

#

steep jewel at coding? no lol

at 83% SWE bench, its way better

steep jewel Feb 6, 2026, 11:12 PM

#

limber panther at 83% SWE bench, its way better

oh ok i didnt see

#

did they put out the numbers

#

weird they'd make a sonnet model super good at coding when coding is quality > quantity

limber panther Feb 6, 2026, 11:13 PM

#

steep jewel did they put out the numbers

from leaks, it seems to be close to that number. no one knows anythin yet

steep jewel Feb 6, 2026, 11:13 PM

#

and opus is supposed to be good at complex, structured tasks

steep jewel Feb 6, 2026, 11:13 PM

#

limber panther from leaks, it seems to be close to that number. no one knows anythin yet

leaks are usually pretty terrible sources of information. this goes for anything

limber panther Feb 6, 2026, 11:14 PM

#

steep jewel leaks are usually pretty terrible sources of information. this goes for anything

yup, there are no actual sources on it yet

steep jewel Feb 6, 2026, 11:14 PM

#

literally any time one of the big ai companies does anything now you have 60 wojaks on twitter saying its agi superintelligence from the preview builds they've sent out

limber panther Feb 6, 2026, 11:14 PM

#

when 4.6 opus launched, it was pretty decent not too impressive

steep jewel Feb 6, 2026, 11:14 PM

#

yeah i tried it. its pretty good

limber panther Feb 6, 2026, 11:15 PM

#

people expected sonnet 5 with better coding and stuff, but it got delayed

#

also sonnet is pretty cheap at $3 per input / $15 per output compared to opus

steep jewel Feb 6, 2026, 11:16 PM

#

i've been thinking about a system of fine tuning over the top of the base model where you have a few elo based examples the ai is trained to respond like to specific criteria. essentially what is already done with safety but for code

#

i also believe you can create a "perceived prompt" that the ai sees and the stupid half-thought-out prompt given by the human. you have an intermediary ai that goes in and edits the prompt so its good and leaves little to the stochastic imagination

#

nano banana already does this, as well as hunyuan, qwen, and most other ai companies

proud bobcat Feb 6, 2026, 11:25 PM

#

K2.5 instant is really strong

#

Damn

#

Thinking mogs it but it’s nice to see

prisma cipher Feb 6, 2026, 11:29 PM

#

limber panther also sonnet is pretty cheap at $3 per input / $15 per output compared to opus

What is known about the context? Is it 1 million or more?

modest prism Feb 6, 2026, 11:31 PM

#

Please help how do I fix opus 4.6 thinking timeout error

verbal nimbus Feb 6, 2026, 11:32 PM

#

proud bobcat K2.5 instant is really strong

What's surprising is how well it scores on long context

toxic verge Feb 6, 2026, 11:33 PM

#

I think that’s the key here

#

It could also be argued and a case can be made that perhaps is actually occurring isn’t necessarily an improved model as much as it could be improved memory and hardware on their end

verbal nimbus Feb 6, 2026, 11:34 PM

#

It's the only open source model that's competitive on long context reading comprehension:

#

Gemini 3 Flash's score is insane, but Opus 4.6 scores higher in MRCR needle-in-a-haystack. Opus is still not on the above benchmark yet though.

thorny drum Feb 6, 2026, 11:35 PM

#

was opus 4.6 an anon model first

toxic verge Feb 6, 2026, 11:35 PM

#

Gemini is fraud

#

Probably has the worst memory issues of all the models

#

That’s how I feel when I use Gemini

verbal nimbus Feb 6, 2026, 11:36 PM

#

toxic verge Probably has the worst memory issues of all the models

On the app, yes, I think they do prune parts of the context. You can actually test this by asking the fast model to output a transcript of the entire chat so far. If you closely, it actually leaves parts out (not entirely sure if it's just the model, but probably not).

toxic verge Feb 6, 2026, 11:37 PM

#

I don’t even bother for one reason only I don’t code and I don’t see the reason for long text because you’re still dealt with the problem of the models all hedging hard

verbal nimbus Feb 6, 2026, 11:38 PM

#

I guess I can test it more but it's a bit time consuming to recreate a long convo.

toxic verge Feb 6, 2026, 11:38 PM

#

It’s like musical chairs

#

They alter the words and meanings of the semantics and hedging is one of the most messed up things about AI in my opinion

#

Grant more authority to model than it does to users intent

#

#

Here’s an example

proud bobcat Feb 6, 2026, 11:47 PM

#

verbal nimbus It's the only open source model that's competitive on long context reading compr...

Another day another Chinese banger

toxic verge Feb 6, 2026, 11:48 PM

#

#

You see how it alters the words now imagine with a long context

#

It completely stripped away the emotion, the individuality, the uniqueness of expression from my statement into

#

#

Look kimi instant

#

#

Va thinking

#

stray tusk Feb 7, 2026, 12:18 AM

#

Hi

molten robin Feb 7, 2026, 12:19 AM

#

i hate this endless generating bug so much.

main nexus Feb 7, 2026, 12:21 AM

#

molten robin i hate this endless generating bug so much.

ong 😭

molten robin Feb 7, 2026, 12:22 AM

#

main nexus ong 😭

it slows down my LUA experiments SO MUCH

verbal nimbus Feb 7, 2026, 12:37 AM

#

toxic verge

Wow, what a prompt 🤣

rugged abyss Feb 7, 2026, 12:40 AM

#

What model is beluga? Is this an alias or do I just not know that model?

hazy forge Feb 7, 2026, 12:41 AM

#

show the output we may be able to tell

rugged abyss Feb 7, 2026, 12:51 AM

#

I got it again, its from Amazon
https://codepen.io/Emilio-the-encoder/pen/raLZKRo

hazy forge Feb 7, 2026, 1:02 AM

#

seem to be actually pretty good

shrewd citrus Feb 7, 2026, 1:05 AM

#

rugged abyss I got it again, its from Amazon https://codepen.io/Emilio-the-encoder/pen/raLZKR...

wait when did bezoz step down from ceo 💀

green yacht Feb 7, 2026, 1:06 AM

#

shrewd citrus wait when did bezoz step down from ceo 💀

5 years ago ish

echo aurora Feb 7, 2026, 1:06 AM

#

shrewd citrus wait when did bezoz step down from ceo 💀

July 2021 😏

shrewd citrus Feb 7, 2026, 1:07 AM

#

so for the past 5 years I’ve been thinking that Jeff was still the ceo 😭

balmy mist Feb 7, 2026, 1:36 AM

#

which company is pony alpha??

frosty lava Feb 7, 2026, 2:00 AM

#

balmy mist which company is pony alpha??

where do you found it ?

stray aspen Feb 7, 2026, 2:00 AM

#

<@&1349916362595635286>

glacial dock Feb 7, 2026, 2:44 AM

#

How the quack is everyone doing

hardy lion Feb 7, 2026, 2:49 AM

#

glacial dock How the quack is everyone doing

pretty good, how about you Ducky?

copper cape Feb 7, 2026, 2:52 AM

#

is here anybody looking for the developer?

glacial dock Feb 7, 2026, 3:07 AM

#

hardy lion pretty good, how about you Ducky?

Just working to pay that duck support

toxic verge Feb 7, 2026, 3:22 AM

#

spare rune Feb 7, 2026, 3:22 AM

#

molten robin it slows down my LUA experiments SO MUCH

Lua or luau

molten robin Feb 7, 2026, 3:23 AM

#

spare rune Lua or luau

glua

spare rune Feb 7, 2026, 3:23 AM

#

Oh

#

dot1

molten robin Feb 7, 2026, 3:23 AM

#

I do GMod lua ai experiments

spare rune Feb 7, 2026, 3:23 AM

#

Ok

#

Omg broo

#

Why is ro * lox a banned world

#

I’m gonna die

strange sluice Feb 7, 2026, 3:26 AM

#

spare rune Why is ro * lox a banned world

because life isnt always roadblocks

toxic verge Feb 7, 2026, 3:27 AM

#

Cuz of scammers

spare rune Feb 7, 2026, 3:44 AM

#

toxic verge Cuz of scammers

Elon Musk

#

See

#

It works

#

free

#

Money

#

Free money

#

bitcoin

#

Btw

old garden Feb 7, 2026, 3:51 AM

#

strange sluice because life isnt always roadblocks

https://tenor.com/view/grilled-chicken-grilled-chicken-eat-grilled-chicken-eat-gif-9823209876167517923

Tenor

strange sluice Feb 7, 2026, 4:04 AM

#

old garden https://tenor.com/view/grilled-chicken-grilled-chicken-eat-grilled-chicken-eat-g...

hi

old garden Feb 7, 2026, 4:04 AM

#

strange sluice hi

toxic verge Feb 7, 2026, 4:22 AM

#

Who is that

fiery gull Feb 7, 2026, 4:27 AM

#

toxic verge Who is that

Let's say a guy who liked children a lot

toxic verge Feb 7, 2026, 4:29 AM

#

Like cheese pizza?

austere sundial Feb 7, 2026, 5:03 AM

#

OMG Lmarena stopped Someone is work with some function that I probably won't use

sturdy mica Feb 7, 2026, 5:49 AM

#

old garden

diddy baszucki

#

hell no

old garden Feb 7, 2026, 5:49 AM

#

fiery gull Let's say a guy who liked children a lot

david baszuki?

old garden Feb 7, 2026, 5:49 AM

#

sturdy mica hell no

what about it

old garden Feb 7, 2026, 5:49 AM

#

sturdy mica hell no

https://tenor.com/view/i'm-new-here-say-hi-discord-im-new-here-say-hi-im-new-here-say-hi-discord-new-member-discord-new-gif-8065649195492145845

Tenor

sturdy mica Feb 7, 2026, 5:49 AM

#

websim is cancer

#

dude you have so much stuff

old garden Feb 7, 2026, 5:50 AM

#

ik websim is lowk going bankrupt or somethingf

#

i have 1.2k folowers on ther i think

sturdy mica Feb 7, 2026, 5:50 AM

#

old garden https://tenor.com/view/i%27m-new-here-say-hi-discord-im-new-here-say-hi-im-new-h...

i joined this server like months ago i just rejoined after leaving for a while you can search my name

old garden Feb 7, 2026, 5:50 AM

#

ok

sturdy mica Feb 7, 2026, 5:54 AM

#

holy

#

your websim page is full of slop

#

nobody is playing this bro 🙏

sturdy mica Feb 7, 2026, 5:58 AM

#

old garden ok

old garden Feb 7, 2026, 5:58 AM

#

sturdy mica nobody is playing this bro 🙏

ik

#

iwas just testing

#

if the ai knew how to make btools system

#

i never released that game to the public

#

many of my projects are unrelased

#

like 99% of them

sturdy mica Feb 7, 2026, 5:58 AM

#

yes you did

#

i was playing it

#

all your private games are public

#

theres a lot

old garden Feb 7, 2026, 5:59 AM

#

https://websim.com/@Trey6383/sonic-2d/50 i mean ive been working on this

Sonic 2D

#

lately

#

and this https://websim.com/@Trey6383/earthbound/83

earthbound

sturdy mica Feb 7, 2026, 5:59 AM

#

its too fast

old garden Feb 7, 2026, 5:59 AM

#

ik

sturdy mica Feb 7, 2026, 5:59 AM

#

if you stay still you glide lol

#

wow this game is awesome

old garden Feb 7, 2026, 5:59 AM

#

ive just been so caught up with my more important projects that

#

i havent had time to work on

#

the quality ones

old garden Feb 7, 2026, 6:00 AM

#

sturdy mica its too fast

you can see though that i was working oj changing the speed, the sliders of speed, and accel rates

#

#

sturdy mica Feb 7, 2026, 6:00 AM

#

cool

#

what game is that

old garden Feb 7, 2026, 6:01 AM

#

earthbound

#

https://en.wikipedia.org/wiki/EarthBound

EarthBound

EarthBound, originally released in Japan as Mother 2: Gīgu no Gyakushū, is a 1994 role-playing video game developed by Ape Inc. (now Creatures Inc.) and HAL Laboratory and published by Nintendo for the Super Nintendo Entertainment System. The second entry in the Mother series, it follows a young boy named Ness and his party of Paula, Jeff and ...

sturdy mica Feb 7, 2026, 6:01 AM

#

yeah i mean on websim

old garden Feb 7, 2026, 6:02 AM

#

o

old garden Feb 7, 2026, 6:02 AM

#

old garden and this https://websim.com/@Trey6383/earthbound/83

this ^

#

the intro is bad rn

#

i havent had time to fix it

#

but recently a lot of websim staff have been fired
free credits have been removed
some of the other popular users are just quitting

sturdy mica Feb 7, 2026, 6:04 AM

#

websim has always been slop

#

lol

old garden Feb 7, 2026, 6:08 AM

#

i personally dont agree with that statement
it was so good
when free users got 50 free gens a day
and the team gave out free max subscriptions (i was one of the first to get one)

sturdy mica Feb 7, 2026, 6:16 AM

#

old garden i personally dont agree with that statement it was so good when free users got...

no im talking about the cheap AI generated games on the front page

#

nothing was ever fun on that platform

old garden Feb 7, 2026, 6:16 AM

#

thats not really websims fault