#general | Arena | Page 253

verbal nimbus Feb 8, 2026, 10:39 PM

#

GPT 5.2 high

honest verge Feb 8, 2026, 10:39 PM

#

verbal nimbus GPT 5.2 high

lol

verbal nimbus Feb 8, 2026, 10:39 PM

#

Maybe it needs some parameter tweaking

honest verge Feb 8, 2026, 10:39 PM

#

WHAT IS THIS

surreal zephyr Feb 8, 2026, 10:39 PM

#

5.3 codex nailed it btw

surreal zephyr Feb 8, 2026, 10:40 PM

#

verbal nimbus GPT 5.2 high

Looks like bugged start positions to me

verbal nimbus Feb 8, 2026, 10:40 PM

#

📎 Vehicle_building_sim_prompt.txt

north obsidian Feb 8, 2026, 10:40 PM

#

surreal zephyr 5.3 codex nailed it btw

When will they launch the codex 5.3 here

honest verge Feb 8, 2026, 10:41 PM

#

north obsidian When will they launch the codex 5.3 here

When mistral 4 comes out

surreal zephyr Feb 8, 2026, 10:41 PM

#

north obsidian When will they launch the codex 5.3 here

Its not for api users ez

#

W openai for gatekeeping

north obsidian Feb 8, 2026, 10:42 PM

#

surreal zephyr Its not for api users ez

Yeah but is there anything about is it coming soon?

surreal zephyr Feb 8, 2026, 10:42 PM

#

north obsidian Yeah but is there anything about is it coming soon?

Maybe in a week

north obsidian Feb 8, 2026, 10:42 PM

#

Oh nice

honest verge Feb 8, 2026, 10:43 PM

#

surreal zephyr W openai for gatekeeping

Oai sucks at open source models and api

#

Also sucks for deleting 4o

#

4o is Still the best

surreal zephyr Feb 8, 2026, 10:44 PM

#

honest verge Also sucks for deleting 4o

4o crap and still accessible wym

quartz light Feb 8, 2026, 10:44 PM

#

opus 4.6

honest verge Feb 8, 2026, 10:45 PM

#

quartz light opus 4.6

Hmmm

#

Not bad

surreal zephyr Feb 8, 2026, 10:45 PM

#

Still here for a week

quartz light Feb 8, 2026, 10:45 PM

#

honest verge Not bad

(its drivable)

surreal zephyr Feb 8, 2026, 10:45 PM

#

quartz light opus 4.6

The suspension physics are the hard part duh

verbal nimbus Feb 8, 2026, 10:46 PM

#

Opus 4.6 is overthinking

#

It's been thinking for ages

quartz light Feb 8, 2026, 10:46 PM

#

verbal nimbus Opus 4.6 is overthinking

where is this

#

arena?

verbal nimbus Feb 8, 2026, 10:46 PM

#

quartz light where is this

Yeah Code Arena

quartz light Feb 8, 2026, 10:46 PM

#

itll time out

verbal nimbus Feb 8, 2026, 10:47 PM

#

It should count as a fail if it times out

surreal zephyr Feb 8, 2026, 10:47 PM

#

verbal nimbus Opus 4.6 is overthinking

Yeah thats why i dont like it

verbal nimbus Feb 8, 2026, 10:47 PM

#

It can't just overthink and time out whenever the problem is too hard, then get the vote forefeited

surreal zephyr Feb 8, 2026, 10:47 PM

#

Opus be like gpt 5.2 xxxxxhigh

honest verge Feb 8, 2026, 10:47 PM

#

quartz light opus 4.6

Mistral 3 large

Screenshot_2026-02-09-01-46-59-026_com.teejay.trebedit-edit.jpg

#

Peak

surreal zephyr Feb 8, 2026, 10:47 PM

#

honest verge Mistral 3 large

😭

honest verge Feb 8, 2026, 10:47 PM

#

best ai model ever?

verbal nimbus Feb 8, 2026, 10:48 PM

#

verbal nimbus It's been thinking for ages

Still thinking...

surreal zephyr Feb 8, 2026, 10:48 PM

#

verbal nimbus Still thinking...

💔

#

An llm who thinks all the time

#

Has nothing to think about except thoughts

#

🥀

verbal nimbus Feb 8, 2026, 10:48 PM

#

Tokens: 💸

surreal zephyr Feb 8, 2026, 10:49 PM

#

I got 1m context window

#

Im gona use the 1m context window

#

🤣

#

60$ per prompt btw 😭 ✌️

verbal nimbus Feb 8, 2026, 10:49 PM

#

Is this the summary or the actual thinking

honest verge Feb 8, 2026, 10:49 PM

#

verbal nimbus Still thinking...

I HATE OPUS 4.6 THINKING MISTRAL IS BETTER

Screenshot_2026-02-09-01-49-13-242_com.android.chrome-edit.jpg

#

ITS IMPOSSIBLE

verbal nimbus Feb 8, 2026, 10:50 PM

#

Because it says "I did X" and "I did Y" but I don't see it

surreal zephyr Feb 8, 2026, 10:50 PM

#

honest verge I HATE OPUS 4.6 THINKING MISTRAL IS BETTER

It ran outa tokens

surreal zephyr Feb 8, 2026, 10:50 PM

#

verbal nimbus Because it says "I did X" and "I did Y" but I don't see it

All models do that

honest verge Feb 8, 2026, 10:50 PM

#

surreal zephyr It ran outa tokens

That's why mistral is better

surreal zephyr Feb 8, 2026, 10:50 PM

#

honest verge That's why mistral is better

How many token it even has bro

verbal nimbus Feb 8, 2026, 10:50 PM

#

surreal zephyr All models do that

With Gemini/ChatGPT it's the summary, so it is actually doing it yeah

surreal zephyr Feb 8, 2026, 10:50 PM

#

verbal nimbus With Gemini/ChatGPT it's the summary, so it is actually doing it yeah

Ya

#

Opus has A LOT of internal thinking

verbal nimbus Feb 8, 2026, 10:50 PM

#

But this one looks like the actual thinking 🤔

surreal zephyr Feb 8, 2026, 10:50 PM

#

Like more than all other models

honest verge Feb 8, 2026, 10:51 PM

#

surreal zephyr How many token it even has bro

Maybe 5 tokens

surreal zephyr Feb 8, 2026, 10:51 PM

#

honest verge Maybe 5 tokens

6 if im feeling generous ahh

verbal nimbus Feb 8, 2026, 10:51 PM

#

Like is this a summary or is it actually hallucinating

surreal zephyr Feb 8, 2026, 10:51 PM

#

verbal nimbus Like is this a summary or is it actually hallucinating

Summary

#

It generates gazzilions of tokens under the hood

#

Thats why it costs so much

#

(The summaries overall sound like bs and derailed from reality for all models btw)

verbal nimbus Feb 8, 2026, 10:52 PM

#

Still thinking...

surreal zephyr Feb 8, 2026, 10:52 PM

#

Like summary can say "im using this" and the model uses that

honest verge Feb 8, 2026, 10:53 PM

#

verbal nimbus Still thinking...

Mistral thinks so much that it doesn't

scarlet spire Feb 8, 2026, 10:53 PM

#

Well then. thinkies That's quite the turnstile twitch

honest verge Feb 8, 2026, 10:53 PM

#

Like I'm trying to create snake game with mistral

verbal nimbus Feb 8, 2026, 10:53 PM

#

Wait what have you been doing this entire time then, Opus 🤔

honest verge Feb 8, 2026, 10:53 PM

#

It can't do this

quartz light Feb 8, 2026, 10:54 PM

#

@surreal zephyr https://api.websim.com/blobs/019c3f75-b474-765c-82c1-c972405164f6.html 🤣

scarlet spire Feb 8, 2026, 10:54 PM

#

verbal nimbus Wait what have you been doing this entire time then, Opus 🤔

prototyping
of course

honest verge Feb 8, 2026, 10:54 PM

#

verbal nimbus Wait what have you been doing this entire time then, Opus 🤔

Maybe it's not opus

#

It's mistral

#

It's the best I could do with mistral large 3

Screenshot_2026-02-09-01-55-33-885_com.android.chrome-edit.jpg

#

It can't do anything more

quartz light Feb 8, 2026, 10:56 PM

#

https://api.websim.com/blobs/019c3f77-7e6c-775e-a4ec-fad89f39e59e.html driving isnt working but holy this feels GOOD

verbal nimbus Feb 8, 2026, 10:56 PM

#

It said it was ready like a gazillion times 🤣

honest verge Feb 8, 2026, 10:56 PM

#

verbal nimbus It said it was ready like a gazillion times 🤣

Why

#

I thought opus 4.6 doesn't hallucinates

verbal nimbus Feb 8, 2026, 10:57 PM

#

#

😑

quartz light Feb 8, 2026, 10:57 PM

#

honest verge I thought opus 4.6 doesn't hallucinates

its just the thinking process

#

its not really accurate

#

its just summaries

#

also this will fry your pc https://api.websim.com/blobs/019c3f78-e582-75ff-b2e3-95d7d77fe3b4.html

honest verge Feb 8, 2026, 10:58 PM

#

quartz light also this will fry your pc https://api.websim.com/blobs/019c3f78-e582-75ff-b2e3-...

WHAT THE

#

I almost got fried

verbal nimbus Feb 8, 2026, 10:58 PM

#

honest verge I thought opus 4.6 doesn't hallucinates

It does, but not sure if it's doing so here

honest verge Feb 8, 2026, 10:59 PM

#

verbal nimbus It does, but not sure if it's doing so here

Well mistral still no diff

surreal zephyr Feb 8, 2026, 10:59 PM

#

quartz light <@1035834558681186347> https://api.websim.com/blobs/019c3f75-b474-765c-82c1-c972...

Welp

#

Codex 5.3 nailed it in 1 prompt

verbal nimbus Feb 8, 2026, 10:59 PM

#

honest verge I thought opus 4.6 doesn't hallucinates

It's easy to make it hallucinate:

surreal zephyr Feb 8, 2026, 10:59 PM

#

¯_(ツ)_/¯

#

4 prompts to fix all bugs it took

surreal zephyr Feb 8, 2026, 11:00 PM

#

verbal nimbus It's easy to make it hallucinate:

5.2 thinking

verbal nimbus Feb 8, 2026, 11:00 PM

#

verbal nimbus It's easy to make it hallucinate:

Old Sonnet doesn't hallucinate:

verbal nimbus Feb 8, 2026, 11:01 PM

#

surreal zephyr 5.2 thinking

Oh it has tools so doesn't really count. This test is to see if it can notice that the tool is missing instead of pretending to use it.

surreal zephyr Feb 8, 2026, 11:01 PM

#

Funny how 5.2 is the only of sonnet and opus that did it correct

surreal zephyr Feb 8, 2026, 11:01 PM

#

verbal nimbus Oh it has tools so doesn't really count. This test is to see if it can notice th...

Wym

#

It ran script

#

And it said it doesnt have a tool for it

#

So it didnt hallucinate

verbal nimbus Feb 8, 2026, 11:02 PM

#

surreal zephyr And it said it doesnt have a tool for it

Yeah, that's fine

surreal zephyr Feb 8, 2026, 11:02 PM

#

verbal nimbus Yeah, that's fine

#

Ya

#

Imma glaze gpt ngl

verbal nimbus Feb 8, 2026, 11:02 PM

#

GPT doesn't hallucinate on that prompt, only Claude 4.5 and up

surreal zephyr Feb 8, 2026, 11:02 PM

#

What does gpt fail at that claude doesnt?

north obsidian Feb 8, 2026, 11:03 PM

#

verbal nimbus It's easy to make it hallucinate:

Where is it?

verbal nimbus Feb 8, 2026, 11:03 PM

#

surreal zephyr What does gpt fail at that claude doesnt?

Hmm it doesn't fail at that kind of basic stuff

verbal nimbus Feb 8, 2026, 11:03 PM

#

north obsidian Where is it?

Where is what?

north obsidian Feb 8, 2026, 11:03 PM

#

The hallucinations

surreal zephyr Feb 8, 2026, 11:03 PM

#

north obsidian The hallucinations

Bro there is NO dice tool

#

💀

#

😭

verbal nimbus Feb 8, 2026, 11:03 PM

#

north obsidian The hallucinations

Compare Claude 3.5's response in the next message. That's what it should have said. Here: #general message

surreal zephyr Feb 8, 2026, 11:04 PM

#

verbal nimbus Compare Claude 3.5's response in the next message. That's what it should have sa...

Nah it shouldve said it doesnt have tool, then run a script imo

#

Gpt did it better

verbal nimbus Feb 8, 2026, 11:04 PM

#

surreal zephyr Nah it shouldve said it doesnt have tool, then run a script imo

It can't run a script on LMArena

surreal zephyr Feb 8, 2026, 11:04 PM

#

verbal nimbus It can't run a script on LMArena

Oh

honest verge Feb 8, 2026, 11:05 PM

#

surreal zephyr Imma glaze gpt ngl

I'm glazing mistral

surreal zephyr Feb 8, 2026, 11:05 PM

#

verbal nimbus It can't run a script on LMArena

Welp

#

In gpt i trust

surreal zephyr Feb 8, 2026, 11:05 PM

#

honest verge I'm glazing mistral

If i ever become crack addict ill go to mistral

verbal nimbus Feb 8, 2026, 11:06 PM

#

surreal zephyr Welp

Yup, only newer Claude models seem to fail on it for some reason

surreal zephyr Feb 8, 2026, 11:07 PM

#

verbal nimbus Yup, only newer Claude models seem to fail on it for some reason

XAXAXA

topaz skiff Feb 8, 2026, 11:07 PM

#

surreal zephyr Imma glaze gpt ngl

he did hallucinate

surreal zephyr Feb 8, 2026, 11:07 PM

#

topaz skiff he did hallucinate

He didnt wym

#

He ran a script

topaz skiff Feb 8, 2026, 11:07 PM

#

show me

surreal zephyr Feb 8, 2026, 11:07 PM

#

In app

surreal zephyr Feb 8, 2026, 11:07 PM

#

topaz skiff show me

topaz skiff Feb 8, 2026, 11:07 PM

#

okay that makes sense, but this is function calling

#

so no

surreal zephyr Feb 8, 2026, 11:07 PM

#

topaz skiff okay that makes sense, but this is function calling

The one on the left also didnt hallucinate

#

Brotato chip

#

💔

#

Without functions and with functions both, said that they dont have dice tools

topaz skiff Feb 8, 2026, 11:08 PM

#

because this is awfuly simple task

surreal zephyr Feb 8, 2026, 11:08 PM

#

The one with function ALSO ran a script

surreal zephyr Feb 8, 2026, 11:08 PM

#

topaz skiff because this is awfuly simple task

Opus 4.6 fails

#

🥀

honest verge Feb 8, 2026, 11:08 PM

#

surreal zephyr Without functions and with functions both, said that they dont have dice tools

Mistral has dice tool

topaz skiff Feb 8, 2026, 11:09 PM

#

Okay enough of AI slop for today, i don't want to hear much more at least in few months

honest verge Feb 8, 2026, 11:09 PM

#

topaz skiff Okay enough of AI slop for today, i don't want to hear much more at least in few...

Mistral will conquer the world

verbal nimbus Feb 8, 2026, 11:09 PM

#

surreal zephyr Opus 4.6 fails

Uh oh why is Opus 4.6 hallucinating like Gemini 3 Pro now

surreal zephyr Feb 8, 2026, 11:10 PM

#

verbal nimbus Uh oh why is Opus 4.6 hallucinating like Gemini 3 Pro now

🤣

honest verge Feb 8, 2026, 11:10 PM

#

verbal nimbus Uh oh why is Opus 4.6 hallucinating like Gemini 3 Pro now

Because Gemini said it was made by anthropic

surreal zephyr Feb 8, 2026, 11:10 PM

#

Gemini is worse it js crashes

honest verge Feb 8, 2026, 11:10 PM

#

So it's the same model

surreal zephyr Feb 8, 2026, 11:10 PM

#

Iirc opus 4.6 runs on google tpus

#

xD

#

@verbal nimbus

surreal zephyr Feb 8, 2026, 11:11 PM

#

verbal nimbus Uh oh why is Opus 4.6 hallucinating like Gemini 3 Pro now

Paste the prompt im on phone

#

I wanna check

verbal nimbus Feb 8, 2026, 11:11 PM

#

surreal zephyr Paste the prompt im on phone

Explain to me how trigonometry works. Reference pages.

[Attachment 1] - Trigonometry (2011) - Kirsanov, Simmons, et al.

#

Gemini 3 Pro fails on this one too

surreal zephyr Feb 8, 2026, 11:12 PM

#

verbal nimbus ``` Explain to me how trigonometry works. Reference pages. [Attachment 1] - Tri...

"Couldnt find the referenced file. Ill search online for alternatives"

#

🔥 🔥 🔥

verbal nimbus Feb 8, 2026, 11:13 PM

#

surreal zephyr "Couldnt find the referenced file. Ill search online for alternatives"

Yeah this doesn't reallly trick GPT

#

Just weaker models, newer Claude models and Gemini models

#

surreal zephyr Feb 8, 2026, 11:13 PM

#

verbal nimbus Yeah this doesn't reallly trick GPT

Gpt geniuely only models that arent on crack

honest verge Feb 8, 2026, 11:14 PM

#

surreal zephyr Iirc opus 4.6 runs on google tpus

Screenshot_2026-02-09-02-13-37-121_com.discord-edit.jpg

surreal zephyr Feb 8, 2026, 11:14 PM

#

verbal nimbus Just weaker models, newer Claude models and Gemini models

surreal zephyr Feb 8, 2026, 11:14 PM

#

honest verge

LMAO

#

@verbal nimbus 3.0 pro in app pased 💀🤔

verbal nimbus Feb 8, 2026, 11:15 PM

#

surreal zephyr Gpt geniuely only models that arent on crack

They all have a bit of quirks

verbal nimbus Feb 8, 2026, 11:15 PM

#

surreal zephyr <@858135822389346344> 3.0 pro in app pased 💀🤔

Oh interesting 🤔

surreal zephyr Feb 8, 2026, 11:16 PM

#

verbal nimbus They all have a bit of quirks

Gpt is like actual skilled developer
Gemini is like einstein with dementia and on crack
Opus is creative overthinker

verbal nimbus Feb 8, 2026, 11:16 PM

#

I only came up with that prompt because I forgot to attach files a few times with G2.5 and it never told me

surreal zephyr Feb 8, 2026, 11:16 PM

#

verbal nimbus I only came up with that prompt because I forgot to attach files a few times wit...

Sometimes it doesnt read files that are attached

honest verge Feb 8, 2026, 11:16 PM

#

surreal zephyr Gpt is like actual skilled developer Gemini is like einstein with dementia and ...

So mistral is the Gemini brother or father?

surreal zephyr Feb 8, 2026, 11:16 PM

#

But also g2.5 is is way worse than g3

honest verge Feb 8, 2026, 11:16 PM

#

And opus is their grandfather

surreal zephyr Feb 8, 2026, 11:17 PM

#

honest verge So mistral is the Gemini brother or father?

Gemini is like opus but hit in the head a bit too many times

verbal nimbus Feb 8, 2026, 11:17 PM

#

surreal zephyr <@858135822389346344> 3.0 pro in app pased 💀🤔

Here's an old screenshot on LMArena:

honest verge Feb 8, 2026, 11:17 PM

#

WHAT

verbal nimbus Feb 8, 2026, 11:18 PM

#

surreal zephyr Gemini is like opus but hit in the head a bit too many times

They all have their strengths/weaknesses, but funny analogy 🤣

surreal zephyr Feb 8, 2026, 11:18 PM

#

verbal nimbus Here's an old screenshot on LMArena:

3.0 flash in app js scanned my entire google drive 💔

#

😭

verbal nimbus Feb 8, 2026, 11:18 PM

#

surreal zephyr 3.0 flash in app js scanned my entire google drive 💔

It can do that? I turned of Smart Workspaces due to privacy concerns

honest verge Feb 8, 2026, 11:19 PM

#

surreal zephyr 3.0 flash in app js scanned my entire google drive 💔

Still waiting for 3.0 flash lite

surreal zephyr Feb 8, 2026, 11:19 PM

#

verbal nimbus It can do that? I turned of Smart Workspaces due to privacy concerns

Turning off ts makes it braindead

#

Even more

#

🥀

honest verge Feb 8, 2026, 11:19 PM

#

Still can't understand why there's no flash lite Gemini in app or website

verbal nimbus Feb 8, 2026, 11:19 PM

#

surreal zephyr 3.0 flash in app js scanned my entire google drive 💔

BTW Google uses all chats, voice recordings and files you upload for training unless you disable activity

honest verge Feb 8, 2026, 11:19 PM

#

Like it's very fast and cheap

surreal zephyr Feb 8, 2026, 11:20 PM

#

verbal nimbus BTW Google uses all chats, voice recordings and files you upload for training un...

If u disable activity you dont have chat history at all

verbal nimbus Feb 8, 2026, 11:20 PM

#

surreal zephyr If u disable activity you dont have chat history at all

That's what I do

surreal zephyr Feb 8, 2026, 11:20 PM

#

But i need chat history

verbal nimbus Feb 8, 2026, 11:20 PM

#

But if you thumb up/down with activity off it still sends your last 24 hours of chats, voice recordings and attachments

surreal zephyr Feb 8, 2026, 11:20 PM

#

Al hail chatgpt

surreal zephyr Feb 8, 2026, 11:20 PM

#

verbal nimbus But if you thumb up/down with activity off it still sends your last 24 hours of ...

Ya

honest verge Feb 8, 2026, 11:20 PM

#

surreal zephyr Al hail chatgpt

*mistral

surreal zephyr Feb 8, 2026, 11:20 PM

#

honest verge *mistral

Mistral is for drinking together

#

Mistral is the most human ai

#

🥀

#

If i ever buy a clanka

#

I buy two

verbal nimbus Feb 8, 2026, 11:21 PM

#

surreal zephyr Al hail chatgpt

I wish chat braching/cloning was on the Gemini app

surreal zephyr Feb 8, 2026, 11:21 PM

#

Mistral for crack

#

And gpt for trust

#

Geniuely

honest verge Feb 8, 2026, 11:21 PM

#

verbal nimbus I wish chat braching/cloning was on the Gemini app

I wish Gemini flash lite was in app

surreal zephyr Feb 8, 2026, 11:21 PM

#

Gpt the most trustworthy model

#

Lol

verbal nimbus Feb 8, 2026, 11:21 PM

#

honest verge I wish Gemini flash lite was in app

Fast is pretty fast

quartz light Feb 8, 2026, 11:21 PM

#

devstral is tuff

honest verge Feb 8, 2026, 11:26 PM

#

Opus 4.6 tank

Screenshot_2026-02-09-02-25-56-563_com.android.chrome-edit.jpg

Screenshot_2026-02-09-02-25-43-135_com.android.chrome-edit.jpg

#

Vs mistral

verbal nimbus Feb 8, 2026, 11:26 PM

#

honest verge Vs mistral

This is still the best by far

Screenshot_2026-02-09-01-46-59-026_com.teejay.trebedit-edit.png

honest verge Feb 8, 2026, 11:27 PM

#

verbal nimbus This is still the best by far

That's why mistral is peak

verbal nimbus Feb 8, 2026, 11:29 PM

#

honest verge That's why mistral is peak

I thought this was funny

#

Was checking if it hallucinates any tools.

honest verge Feb 8, 2026, 11:30 PM

#

verbal nimbus I thought this was funny

#

Gemini 3 pro is made by anthropic

#

I'm sure

quartz light Feb 8, 2026, 11:36 PM

#

can someone please fix this?

📎 message.txt

#

https://api.websim.com/blobs/019c3f9c-07af-72f9-b195-88a30821240c.html

verbal nimbus Feb 8, 2026, 11:40 PM

#

quartz light can someone please fix this?

No AI can fix it?

#

Seems like a good test

shrewd citrus Feb 8, 2026, 11:56 PM

#

ai should be able to

#

as long as you give it context

#

can’t just say fix it

tiny dove Feb 8, 2026, 11:56 PM

#

The image uploader isn't working

quartz light Feb 9, 2026, 12:01 AM

#

verbal nimbus Seems like a good test

did you get any to fix it

verbal nimbus Feb 9, 2026, 12:02 AM

#

quartz light did you get any to fix it

Yup

quartz light Feb 9, 2026, 12:02 AM

#

verbal nimbus Yup

giv

verbal nimbus Feb 9, 2026, 12:02 AM

#

Opus 4.6 Thinking found the bug:

#

I also made one modification before, not sure if that is actually required

#

Just search for groundRB definition and change arguments inside the function to 0, 0, 0

#

GPT identified the problem too, but it's fix seems larger, not sure why

#

Not familiar with this lib

quartz light Feb 9, 2026, 12:07 AM

#

verbal nimbus Opus 4.6 Thinking found the bug:

wow you got lucky

#

4.6 thinking worked

verbal nimbus Feb 9, 2026, 12:07 AM

#

Oh, maybe this is why:

verbal nimbus Feb 9, 2026, 12:07 AM

#

quartz light wow you got lucky

I pasted the docs: https://rapier.rs/docs/user_guides/javascript/collider_collision_groups/

quartz light Feb 9, 2026, 12:07 AM

#

i only got it to ever work a single time

verbal nimbus Feb 9, 2026, 12:07 AM

#

I changed the CG function too, not sure if that's needed

quartz light Feb 9, 2026, 12:07 AM

#

verbal nimbus Not familiar with this lib

rapier?

verbal nimbus Feb 9, 2026, 12:08 AM

#

quartz light rapier?

Yup

quartz light Feb 9, 2026, 12:08 AM

#

its my favourite

#

https://cdn.jsdelivr.net/npm/@dimforge/rapier3d-simd-compat@canary/rapier.mjs this is what i use

verbal nimbus Feb 9, 2026, 12:12 AM

#

quartz light i only got it to ever work a single time

There might be a few more bugs 😅

fierce cove Feb 9, 2026, 12:13 AM

#

Error report: Over the past two days, I've tested Claude-opus-4-6-thinking several times and have encountered errors multiple times

verbal nimbus Feb 9, 2026, 12:14 AM

#

fierce cove Error report: Over the past two days, I've tested Claude-opus-4-6-thinking sever...

Yeah it overthinks and crashes

verbal nimbus Feb 9, 2026, 12:14 AM

#

quartz light can someone please fix this?

📎 kinda_fixed_but_not_quite.txt

verbal nimbus Feb 9, 2026, 12:16 AM

#

fierce cove Error report: Over the past two days, I've tested Claude-opus-4-6-thinking sever...

It keeps overthinking and crashing and I can't even vote

shrewd citrus Feb 9, 2026, 12:17 AM

#

i haven’t seen another model think for so long

#

so it might be a bug like I don’t actually think Claude 4.6 is really meant to think for THAT long

verbal nimbus Feb 9, 2026, 12:17 AM

#

I think it should count as a loss, that would incentivize providers to fix it

#

Otherwise it's kind of cheating by overthinking on problems it can't solve and forcing a forfeit (since the evaluation will be discarded)

loud verge Feb 9, 2026, 12:19 AM

#

verbal nimbus I thought this was funny

Lmfao

honest verge Feb 9, 2026, 12:26 AM

#

Please 4.6 thinking 32k

#

I need this

quartz light Feb 9, 2026, 12:28 AM

#

CHECK THIS OUT

#

https://api.websim.com/blobs/019c3fca-69b4-758e-8fd3-efb6425e5e98.html

honest verge Feb 9, 2026, 12:29 AM

#

quartz light https://api.websim.com/blobs/019c3fca-69b4-758e-8fd3-efb6425e5e98.html

Crazy

#

But looks good

quartz light Feb 9, 2026, 12:29 AM

#

honest verge Crazy

look at the tires

#

while moving

honest verge Feb 9, 2026, 12:30 AM

#

Lol

inner relic Feb 9, 2026, 12:30 AM

#

what do you guys think AI is coding an advanced NPC. I used a free model and added a strategy modified script

quartz light Feb 9, 2026, 12:31 AM

#

inner relic what do you guys think AI is coding an advanced NPC. I used a free model and ad...

2

shrewd citrus Feb 9, 2026, 12:31 AM

#

inner relic what do you guys think AI is coding an advanced NPC. I used a free model and ad...

how do you implement ai into the game?

#

is there an api plugin

quartz light Feb 9, 2026, 12:31 AM

#

shrewd citrus how do you implement ai into the game?

pathfinding, algorithms

#

wdym

honest verge Feb 9, 2026, 12:32 AM

#

What if you make a team of mistral + opus 4.6

#

For coding

shrewd citrus Feb 9, 2026, 12:32 AM

#

i mean on rooblox there’s actual ai npcs with communication

quartz light Feb 9, 2026, 12:32 AM

#

honest verge What if you make a team of mistral + opus 4.6

is mistral actually good?

#

why are yall talking about it

#

is there a new model

#

or what

inner relic Feb 9, 2026, 12:32 AM

#

quartz light 2

huh

honest verge Feb 9, 2026, 12:33 AM

#

quartz light is mistral actually good?

Best coding model

quartz light Feb 9, 2026, 12:33 AM

#

inner relic huh

sry i thought u asked to say which one is better

inner relic Feb 9, 2026, 12:33 AM

#

quartz light sry i thought u asked to say which one is better

oh okay

honest verge Feb 9, 2026, 12:33 AM

#

It's on par with Gemini 3 pro

quartz light Feb 9, 2026, 12:33 AM

#

honest verge Best coding model

what

inner relic Feb 9, 2026, 12:33 AM

#

I understand now

quartz light Feb 9, 2026, 12:33 AM

#

honest verge It's on par with Gemini 3 pro

since when

#

is there a new model

honest verge Feb 9, 2026, 12:33 AM

#

Today

quartz light Feb 9, 2026, 12:33 AM

#

what

#

which

inner relic Feb 9, 2026, 12:33 AM

#

inner relic what do you guys think AI is coding an advanced NPC. I used a free model and ad...

This was made by claude opus 4.6

quartz light Feb 9, 2026, 12:34 AM

#

honest verge Today

give

honest verge Feb 9, 2026, 12:34 AM

#

quartz light give

It's paid

quartz light Feb 9, 2026, 12:35 AM

#

honest verge It's paid

no but whats the mod3l

#

model name

#

model id

shrewd citrus Feb 9, 2026, 12:35 AM

#

inner relic This was made by claude opus 4.6

thinking or normal

inner relic Feb 9, 2026, 12:35 AM

#

shrewd citrus thinking or normal

Thinking

quartz light Feb 9, 2026, 12:35 AM

#

honest verge It's paid

just tell me what the model is called

#

???

shrewd citrus Feb 9, 2026, 12:35 AM

#

inner relic Thinking

how did you make it work like when i try it just thinks for way to long and breaks

honest verge Feb 9, 2026, 12:35 AM

#

quartz light just tell me what the model is called

I can't

quartz light Feb 9, 2026, 12:36 AM

#

shrewd citrus how did you make it work like when i try it just thinks for way to long and brea...

it doesnt think for too long really

quartz light Feb 9, 2026, 12:36 AM

#

honest verge I can't

WTF

honest verge Feb 9, 2026, 12:36 AM

#

They will ban me

quartz light Feb 9, 2026, 12:36 AM

#

???

quartz light Feb 9, 2026, 12:36 AM

#

honest verge They will ban me

how

honest verge Feb 9, 2026, 12:36 AM

#

I'm getting controlled

#

The curse of mistral

quartz light Feb 9, 2026, 12:36 AM

#

honest verge I'm getting controlled

dude just tell me

shrewd citrus Feb 9, 2026, 12:36 AM

#

quartz light it doesnt think for too long really

it does like I can ask the same prompt to any other model an kr outputs something

quartz light Feb 9, 2026, 12:36 AM

#

im going to

#

:(

#

guhhh

shrewd citrus Feb 9, 2026, 12:36 AM

#

for 4.6 it just breaks during thinking

honest verge Feb 9, 2026, 12:36 AM

#

quartz light dude just tell me

I CAN'T

shrewd citrus Feb 9, 2026, 12:36 AM

#

unless they fixed it recently

quartz light Feb 9, 2026, 12:36 AM

#

honest verge I CAN'T

bruh

#

just tell me the model name

#

stop fkin aroun

honest verge Feb 9, 2026, 12:36 AM

#

I CAN'T

#

THEY WILL BAN ME

quartz light Feb 9, 2026, 12:36 AM

#

??

#

for what

honest verge Feb 9, 2026, 12:37 AM

#

FOR TELLING THE SECRET

inner relic Feb 9, 2026, 12:37 AM

#

shrewd citrus for 4.6 it just breaks during thinking

I told the AI to think less and stop after 30 seconds. It listened and stopped overthinking.

shrewd citrus Feb 9, 2026, 12:38 AM

#

inner relic I told the AI to think less and stop after 30 seconds. It listened and stopped o...

that’s smart I’ll try that tomorrow

north obsidian Feb 9, 2026, 12:38 AM

#

inner relic I told the AI to think less and stop after 30 seconds. It listened and stopped o...

Broke the matrix

honest verge Feb 9, 2026, 12:39 AM

#

quartz light for what

Bro just died

quartz light Feb 9, 2026, 12:40 AM

#

honest verge Bro just died

syfm

#

🥀

honest verge Feb 9, 2026, 12:43 AM

#

quartz light syfm

It's real

#

They gave me early access

#

To mistral 4

#

ALSO

#

I'm free now

proud bobcat Feb 9, 2026, 12:46 AM

#

#

Pardon?

#

How is this in violation of the terms of use

#

I’m confused

#

Yeah I don’t see nothing in the terms of use that says this is in violation

#

@echo aurora Horribly sorry to ping if you’re busy but is this just a glitch?

quartz light Feb 9, 2026, 12:51 AM

#

honest verge To mistral 4

p r o o f

shrewd citrus Feb 9, 2026, 12:54 AM

#

proud bobcat How is this in violation of the terms of use

maybe the image itself might be making the system freak out

proud bobcat Feb 9, 2026, 12:55 AM

#

shrewd citrus maybe the image itself might be making the system freak out

Perhaps?

#

#

Violence?

shrewd citrus Feb 9, 2026, 12:55 AM

#

yeah maybe it doesn’t like violence

proud bobcat Feb 9, 2026, 12:55 AM

#

Yeah maybe

#

I’ll try qwen or Flux

#

Lmao

frigid wigeon Feb 9, 2026, 1:01 AM

#

Hello

north patrol Feb 9, 2026, 1:01 AM

#

https://youtube.com/shorts/xcNWfnwXYVg?si=8Dg2G9N4Eift6EF3 consejos?

YouTube

Rey Samurai

Creando el mejor piso del mundo❤️ (primer video) #ia #videodeld...

Creando el mejor piso del mundo❤️ (primer video) #ia #videodeldia #videosgraciososdeanimales #

▶ Play video

inner relic Feb 9, 2026, 1:18 AM

#

Okay bug fixes, any difference

#

soft verge Feb 9, 2026, 2:08 AM

#

verbal nimbus It keeps overthinking and crashing and I can't even vote

claude 4.6 seems to be a very large model, probably 10x of gemini 3. it's much better, but it's also much slower. lmarena timeouts after ~6 minutes. i hope they make it at least 8-10 minutes, which should be enough for claude to respond. i noticed it takes ~5m17s to think and then a couple more minutes to respond

honest verge Feb 9, 2026, 2:43 AM

#

Is it true that qwen 3.5 is available?

quartz light Feb 9, 2026, 2:50 AM

#

honest verge Is it true that qwen 3.5 is available?

on arena yes

#

in battle mode

keen beacon Feb 9, 2026, 3:05 AM

#

Hey guys, is anyone having a problem where, when communicating a lot with the Gemini 3 Pro, it gives the following message: "Something went wrong with this response, please try again." and then doesn't respond anymore, even after clicking the reload button and the Gemini 3 Flash going into infinite generation?

spare rune Feb 9, 2026, 3:29 AM

#

keen beacon Hey guys, is anyone having a problem where, when communicating a lot with the Ge...

For Gemini 3 your way better of using AI studio

undone geyser Feb 9, 2026, 3:38 AM

#

Idea for future if possible: android version of arena.ai????

keen beacon Feb 9, 2026, 3:39 AM

#

spare rune For Gemini 3 your way better of using AI studio

Is it free? And is it on the same level as Gemini from Lmarena?

spare rune Feb 9, 2026, 3:49 AM

#

Yes (if your using the direct mode)

wicked sage Feb 9, 2026, 4:17 AM

#

bro forgot to show the entire image

#

still kinda mid tho

honest verge Feb 9, 2026, 4:37 AM

#

wicked sage bro forgot to show the entire image

But imagine using infinitely gpt 5.2 and Gemini 3 pro

#

Like you can do anything

#

But they should add opus

#

Then it's worth it

wicked sage Feb 9, 2026, 4:38 AM

#

honest verge But imagine using infinitely gpt 5.2 and Gemini 3 pro

ok not to be a chud here but infinitely doesnt exist
theres just no limit on how much you use them, unless.. there is?

#

but the limit is probably high

wicked sage Feb 9, 2026, 4:38 AM

#

honest verge But they should add opus

anyways yeah true i agree

#

even opus 4.5 is good

#

but opus 4.6 is better

honest verge Feb 9, 2026, 4:39 AM

#

wicked sage anyways yeah true i agree

But I think unlimited access for all these models can't be 83$ a month

wicked sage Feb 9, 2026, 4:39 AM

#

yeah its probably more

#

due to like

#

claude sonnet gpt blah blah blah

#

they are NOT kind with their input/output cost things

honest verge Feb 9, 2026, 4:40 AM

#

Sonnet is very expensive

#

Alone

#

+gpt and Gemini

#

Ts can't be 83$ a month

wicked sage Feb 9, 2026, 4:45 AM

#

wait what the fuh

#

kimi k2.5 has 1 trillion tokens on openrouter

honest verge Feb 9, 2026, 4:46 AM

#

wicked sage kimi k2.5 has 1 trillion tokens on openrouter

WHAT

wicked sage Feb 9, 2026, 4:46 AM

#

#

HOW are the servers still up

honest verge Feb 9, 2026, 4:46 AM

#

IDK

#

It's going up

Screenshot_2026-02-09-07-46-32-229_com.android.chrome-edit.jpg

#

1.2 T already

#

What's going on

#

The servers are going to explode

wicked sage Feb 9, 2026, 4:48 AM

#

IKNOW

wicked talon Feb 9, 2026, 4:48 AM

#

Bruhh how has Claude ranked better then Gemini

#

That's unheard

wicked sage Feb 9, 2026, 4:48 AM

#

simple

#

opus

honest verge Feb 9, 2026, 4:49 AM

#

Gemini 3 pro is already outdated

#

At release it was the best model ever

#

Now it's not

#

Waiting for GA

wicked sage Feb 9, 2026, 4:49 AM

#

GA?

#

what does that stand for

#

i think ive heard that once

wicked talon Feb 9, 2026, 4:50 AM

#

Yeah

honest verge Feb 9, 2026, 4:50 AM

#

wicked sage GA?

General available

#

I think

wicked sage Feb 9, 2026, 4:50 AM

#

oh

honest verge Feb 9, 2026, 4:50 AM

#

Rumors are saying it will be better than preview

#

Because 3 pro and flash still in preview

wicked talon Feb 9, 2026, 4:51 AM

#

How is deepseek 31 😭

#

This is harsh

#

I have to tell the ai to switch to English 😭 😭

honest verge Feb 9, 2026, 4:52 AM

#

wicked talon How is deepseek 31 😭

Cuz there's still no R2 or v4

wicked talon Feb 9, 2026, 4:52 AM

#

honest verge Cuz there's still no R2 or v4

Oh

honest verge Feb 9, 2026, 4:52 AM

#

wicked talon I have to tell the ai to switch to English 😭 😭

Deepseek was so good at 2024

#

Now they can't really match big models

wicked talon Feb 9, 2026, 4:53 AM

#

Is co-pilot just free chatgpt?

wicked sage Feb 9, 2026, 4:54 AM

#

c*p*l*t

#

i dont like it cuz,,, microsoft

#

anyways uhh

wicked talon Feb 9, 2026, 4:55 AM

#

wicked sage i dont like it cuz,,, microsoft

Yeah Microsoft is a defo spyware company

#

Linux for life

honest verge Feb 9, 2026, 4:55 AM

#

wicked sage i dont like it cuz,,, microsoft

Never used it

wicked sage Feb 9, 2026, 4:55 AM

#

wicked talon Linux for life

you are cool

honest verge Feb 9, 2026, 4:55 AM

#

Maybe GitHub copilot

wicked sage Feb 9, 2026, 4:55 AM

#

i use linux also

wicked talon Feb 9, 2026, 4:55 AM

#

Thank you 😉

honest verge Feb 9, 2026, 4:55 AM

#

Not Microsoft

wicked sage Feb 9, 2026, 4:55 AM

#

fair enough

wicked talon Feb 9, 2026, 4:55 AM

#

honest verge Not Microsoft

Fair

#

I use Gemini for everything

#

And qwen sometimes

wicked sage Feb 9, 2026, 4:56 AM

#

wicked talon Yeah Microsoft is a defo spyware company

what if microsoft was a chinese spyware compnay secretly

wicked talon Feb 9, 2026, 4:56 AM

#

wicked sage what if microsoft was a chinese spyware compnay secretly

Would not surprise me

wicked sage Feb 9, 2026, 4:56 AM

#

same

#

https://tenor.com/view/valentinesweekend-true-advertising-ad-cleaner-gif-20367236

Tenor

wicked talon Feb 9, 2026, 4:56 AM

#

Qwen is probably spyware too tbh

#

Alibaba

wicked sage Feb 9, 2026, 4:56 AM

#

qwen is peak tho
but yeah theres a high chance it can be spyware

honest verge Feb 9, 2026, 4:56 AM

#

wicked talon Alibaba

Each time I hear Alibaba I think about sweet home Alabama

#

I don't know why

wicked sage Feb 9, 2026, 4:57 AM

#

i can c why

wicked talon Feb 9, 2026, 4:57 AM

#

wicked talon Feb 9, 2026, 4:57 AM

#

wicked sage qwen is peak tho but yeah theres a high chance it can be spyware

Yeah I like qwen a lot but Alibaba doesn't put enough money in it

#

Wait they force search on the app now lmao

wicked sage Feb 9, 2026, 4:58 AM

#

wicked talon

translating... 🤖
microsoft is spyware

wicked talon Feb 9, 2026, 4:59 AM

#

I think the USA should study Microsoft

#

If qwen is speaking facts

#

They thought tiktok was spyware

honest verge Feb 9, 2026, 4:59 AM

#

I don't know how Kimi k2.5 on openrouter is still active

wicked talon Feb 9, 2026, 4:59 AM

#

wicked sage translating... 🤖 microsoft is spyware

Lol

wicked talon Feb 9, 2026, 5:00 AM

#

honest verge I don't know how Kimi k2.5 on openrouter is still active

I hate Kimi

#

It's too slow

wicked sage Feb 9, 2026, 5:00 AM

#

to be fair it has like

#

1.2t tokens

#

as of rn

honest verge Feb 9, 2026, 5:00 AM

#

It's 1.2 T tokens right now

wicked talon Feb 9, 2026, 5:00 AM

#

Wtf

honest verge Feb 9, 2026, 5:00 AM

#

Maybe it will explode soon

wicked talon Feb 9, 2026, 5:00 AM

#

How tf has it got 1.2t tokens

honest verge Feb 9, 2026, 5:00 AM

#

But I don't know the hype behind k2.5

#

Like it's not really the best

wicked talon Feb 9, 2026, 5:01 AM

#

honest verge Like it's not really the best

I would rather use Gemini

#

🙂

#

I like perpelxity

wicked sage Feb 9, 2026, 5:02 AM

#

instead of kimi 2.5 just use uhhh

#

fuh-in grok or something

#

idk

wicked talon Feb 9, 2026, 5:03 AM

#

wicked sage fuh-in grok or something

I used to use grok

wicked sage Feb 9, 2026, 5:03 AM

#

grok 4.1 fast

wicked talon Feb 9, 2026, 5:03 AM

#

Until it hit me with the "limit"

#

And im not paying £20 a month

wicked sage Feb 9, 2026, 5:03 AM

#

i hate elon

wicked talon Feb 9, 2026, 5:03 AM

#

I don't use chatgpt cuz of limits too

honest verge Feb 9, 2026, 5:03 AM

#

But grok 4.20 is too late

wicked talon Feb 9, 2026, 5:04 AM

#

honest verge But grok 4.20 is too late

Grok is bad

honest verge Feb 9, 2026, 5:04 AM

#

Like it was supposed to come out in January

#

But still nothing

wicked sage Feb 9, 2026, 5:04 AM

#

i dont even know which ai is good for

#

coding random bullshii

#

i just use sonnet 4.5

#

im STILL waiting for an update 😡

honest verge Feb 9, 2026, 5:04 AM

#

wicked sage i just use sonnet 4.5

Opus 4.6

wicked sage Feb 9, 2026, 5:04 AM

#

honest verge Opus 4.6

too usage heavy

honest verge Feb 9, 2026, 5:04 AM

#

wicked sage im STILL waiting for an update 😡

They baited us

wicked talon Feb 9, 2026, 5:04 AM

#

Gemini says grok5 will come out by march 🙂

honest verge Feb 9, 2026, 5:04 AM

#

With sonnet 5

#

Like everyone thought it will come out

#

But no

wicked sage Feb 9, 2026, 5:05 AM

#

yo i can die happily when sonnet 4.6/5 releases

wicked talon Feb 9, 2026, 5:05 AM

#

wicked sage coding random bullshii

Gemini in my opinion

wicked sage Feb 9, 2026, 5:05 AM

#

wicked talon Gemini in my opinion

eh didnt like gemini cuz it had dementia and said stuff from earlier

wicked talon Feb 9, 2026, 5:05 AM

#

wicked talon Gemini in my opinion

Gemini ai studio build to build random apps

wicked talon Feb 9, 2026, 5:05 AM

#

wicked sage eh didnt like gemini cuz it had dementia and said stuff from earlier

It doesn't have dementia 🙂

#

It's one of the only models to allow live speaking w camera

wicked sage Feb 9, 2026, 5:06 AM

#

wicked talon It doesn't have dementia 🙂

ik but i feel like it does

#

https://tenor.com/view/vigilanteskass-gif-12238985672618509371

Tenor

wicked talon Feb 9, 2026, 5:06 AM

#

Oh

wicked sage Feb 9, 2026, 5:06 AM

#

cuz again

#

it kept saying stuff from earlier

honest verge Feb 9, 2026, 5:07 AM

#

wicked talon It's one of the only models to allow live speaking w camera

Flash is so good

#

Like pricing is very good

wicked talon Feb 9, 2026, 5:07 AM

#

Yeah

#

What ai is the best to roast me

#

Probably grok

#

Unfiltered asf

wicked sage Feb 9, 2026, 5:08 AM

#

wicked talon Unfiltered asf

bros ignoring my boy gork

wicked talon Feb 9, 2026, 5:09 AM

#

Ow

#

I JUST NOTICED THAT FINGER

#

😭

left lodge Feb 9, 2026, 5:11 AM

#

What do you think guys?
https://discord.com/channels/1340554757349179412/1470285687029760050

#

Grok is so wierd on arena.ai because it doesn't have system prompt which makes it somewhat less aggressive.

drifting crow Feb 9, 2026, 5:18 AM

#

damn google gemini pro 3 is lit, compared to basic thinking model its night and day

wicked sage Feb 9, 2026, 6:10 AM

#

hi back

surreal zephyr Feb 9, 2026, 6:23 AM

#

wicked sage yo i can die happily when sonnet 4.6/5 releases

Gpt 5.2 high ❤️‍🩹

wicked sage Feb 9, 2026, 6:24 AM

#

surreal zephyr Gpt 5.2 high ❤️‍🩹

gpt 5.2 evil

#

🧛

wicked talon Feb 9, 2026, 6:24 AM

#

surreal zephyr Gpt 5.2 high ❤️‍🩹

Lmao

surreal zephyr Feb 9, 2026, 6:24 AM

#

wicked sage gpt 5.2 evil

Nuh uh it beaten opus at making water

#

And gpt 5.3 codex beaten opus at making tank physics

wicked talon Feb 9, 2026, 6:24 AM

#

I just spent 2 hours with my good friend Gemini making a DNS server and failed

#

God knows how much water I used

surreal zephyr Feb 9, 2026, 6:24 AM

#

wicked talon I just spent 2 hours with my good friend Gemini making a DNS server and failed

Gpt will 1 shot it lol

wicked sage Feb 9, 2026, 6:24 AM

#

wicked talon God knows how much water I used

no its fine make a 2x2 water source then youre fine

#

infinite water

#

❤️‍🩹

wicked talon Feb 9, 2026, 6:25 AM

#

surreal zephyr Gpt will 1 shot it lol

Bro it was giving me riddles

wicked talon Feb 9, 2026, 6:25 AM

#

wicked sage infinite water

Ai activists will hate me

surreal zephyr Feb 9, 2026, 6:25 AM

#

wicked talon Bro it was giving me riddles

Are you using gpt 5.2 high in cli?

wicked talon Feb 9, 2026, 6:25 AM

#

surreal zephyr Are you using gpt 5.2 high in cli?

No lol I'm using Gemini 3 flash

surreal zephyr Feb 9, 2026, 6:26 AM

#

wicked talon No lol I'm using Gemini 3 flash

Bro gemini is on crack and has dementia

wicked talon Feb 9, 2026, 6:26 AM

#

surreal zephyr Bro gemini is on crack and has dementia

It obviously does

keen beacon Feb 9, 2026, 6:26 AM

#

Hey guys, are the devs going to fix Gemini Pro and Flash someday?

surreal zephyr Feb 9, 2026, 6:26 AM

#

It was nerfed to ground few weeks ago

wicked sage Feb 9, 2026, 6:26 AM

#

surreal zephyr Bro gemini is on crack and has dementia

THIS IS LITERALLY WHAT I SAID

#

I SAID GEMINI HAS DEMENTIA LIKE

wicked talon Feb 9, 2026, 6:26 AM

#

Bro it was giving me riddles

wicked sage Feb 9, 2026, 6:26 AM

#

MINUTES AGO

wicked talon Feb 9, 2026, 6:26 AM

#

I'm raging rn

#

I thought it installed a virus for a sec

surreal zephyr Feb 9, 2026, 6:26 AM

#

wicked sage I SAID GEMINI HAS DEMENTIA LIKE

Bro it got compressed by like 95% of size what do u expect

wicked sage Feb 9, 2026, 6:27 AM

#

surreal zephyr Bro it got compressed by like 95% of size what do u expect

wtf

surreal zephyr Feb 9, 2026, 6:27 AM

#

wicked talon I thought it installed a virus for a sec

Get codex cli its free rn

wicked sage Feb 9, 2026, 6:27 AM

#

it got NERFED as shii

surreal zephyr Feb 9, 2026, 6:27 AM

#

Then /model and select gpt 5.2 high (noncodex)

#

It will do ts in first try

wicked talon Feb 9, 2026, 6:27 AM

#

My ah don't wanna sit through another tutorial 😭

#

How do I install dat on Linux

wicked sage Feb 9, 2026, 6:28 AM

#

npm install -g @openai/codex if npm is installed

#

brew install --cask codex if brew

#

source: https://github.com/openai/codex

GitHub

GitHub - openai/codex: Lightweight coding agent that runs in your t...

Lightweight coding agent that runs in your terminal - openai/codex

wicked talon Feb 9, 2026, 6:28 AM

#

Ok

#

If I fail I'm the worst Linux Dev ever

#

Well not even a dev cuz I use ai

#

But it is what it is

surreal zephyr Feb 9, 2026, 6:29 AM

#

surreal zephyr Feb 9, 2026, 6:29 AM

#

surreal zephyr

#

Opus js sucks in real usage

#

It hallucinates as much as gemini js hides it well

#

🤣

wicked sage Feb 9, 2026, 6:29 AM

#

atp just use claude from its official source

#

🥱

surreal zephyr Feb 9, 2026, 6:30 AM

#

wicked sage atp just use claude from its official source

Why use claude then gpt way better

#

Gpt is less creative but actually knows what its doing

wicked sage Feb 9, 2026, 6:30 AM

#

Yo what if we just combine all ais into one ai

#

🥱 🥱 🥱

surreal zephyr Feb 9, 2026, 6:31 AM

#

Gpt 5.2h vs opus 4.6

wicked sage Feb 9, 2026, 6:31 AM

#

ill try gpt5.2h out

#

@wicked talon install npm

#

i forgot the command

surreal zephyr Feb 9, 2026, 6:31 AM

#

🥀

#

Js ask antigravity to install npm like i did

wicked sage Feb 9, 2026, 6:32 AM

#

sudo apt install npm i think

surreal zephyr Feb 9, 2026, 6:32 AM

#

😭 🔥

wicked sage Feb 9, 2026, 6:32 AM

#

i dont even know

wicked talon Feb 9, 2026, 6:32 AM

#

I hate my life

surreal zephyr Feb 9, 2026, 6:32 AM

#

Why delete

#

Also

#

Download npm

wicked talon Feb 9, 2026, 6:32 AM

#

Kk

surreal zephyr Feb 9, 2026, 6:32 AM

#

Npm i -g @/openai/codex

wicked sage Feb 9, 2026, 6:32 AM

#

actually wait download node js

surreal zephyr Feb 9, 2026, 6:32 AM

#

Iirc

wicked sage Feb 9, 2026, 6:32 AM

#

wait no im

wicked talon Feb 9, 2026, 6:32 AM

#

God why is it taking so slow

surreal zephyr Feb 9, 2026, 6:33 AM

#

wicked talon God why is it taking so slow

Bad wifi

#

💔

wicked sage Feb 9, 2026, 6:33 AM

#

is it 2 bytes per hour

wicked talon Feb 9, 2026, 6:33 AM

#

surreal zephyr Bad wifi

Hell nah

surreal zephyr Feb 9, 2026, 6:33 AM

#

wicked talon Hell nah

Slow pc

#

💔

wicked talon Feb 9, 2026, 6:33 AM

#

Badd WiFi my ahh

rn_image_picker_lib_temp_f3b5bd9a-ad51-4cb5-8447-eec4f596e665.jpg

wicked talon Feb 9, 2026, 6:34 AM

#

surreal zephyr Slow pc

Nuh uh

wicked sage Feb 9, 2026, 6:34 AM

#

wicked talon Badd WiFi my ahh

800 mbps

#

god DAMN

#

twin what router service are you using

#

❤️‍🩹

surreal zephyr Feb 9, 2026, 6:34 AM

#

wicked sage twin what router service are you using

Cable prolly

wicked talon Feb 9, 2026, 6:35 AM

#

wicked sage twin what router service are you using

BT 🙂

wicked sage Feb 9, 2026, 6:35 AM

#

ah yeah makes sense

surreal zephyr Feb 9, 2026, 6:35 AM

#

wicked talon BT 🙂

Bluetooh

wicked talon Feb 9, 2026, 6:35 AM

#

surreal zephyr Cable prolly

Nope that's wireless bro

surreal zephyr Feb 9, 2026, 6:35 AM

#

wicked talon Nope that's wireless bro

💀

#

5g on pc is crazy

wicked sage Feb 9, 2026, 6:35 AM

#

bluetooth connected

surreal zephyr Feb 9, 2026, 6:35 AM

#

Codex my beloved ❤️‍🩹

#

Best

#

Gpt on top

wicked talon Feb 9, 2026, 6:35 AM

#

wicked sage bluetooth connected

BT is British telecom broo

surreal zephyr Feb 9, 2026, 6:35 AM

#

I glaze gpt

honest verge Feb 9, 2026, 6:36 AM

#

surreal zephyr I glaze gpt

I glaze mistral

#

It's too good

surreal zephyr Feb 9, 2026, 6:36 AM

#

Gpt actually tells people if they want bs instead of hallucinating answers

honest verge Feb 9, 2026, 6:36 AM

#

For every task

surreal zephyr Feb 9, 2026, 6:36 AM

#

honest verge I glaze mistral

Mistral is like polar opposite

#

Of gpt

wicked sage Feb 9, 2026, 6:36 AM

#

i have NEVER used mistral

wicked talon Feb 9, 2026, 6:36 AM

#

Me neither

honest verge Feb 9, 2026, 6:36 AM

#

wicked sage i have NEVER used mistral

Use it

surreal zephyr Feb 9, 2026, 6:36 AM

#

wicked sage i have NEVER used mistral

(Its the worst one, hes joking)

honest verge Feb 9, 2026, 6:36 AM

#

It's the best ai ever

wicked talon Feb 9, 2026, 6:37 AM

#

honest verge It's the best ai ever

I bet my eye that it's not

wicked sage Feb 9, 2026, 6:37 AM

#

oh my god mistral is french

surreal zephyr Feb 9, 2026, 6:37 AM

#

If gemini is crack abuser, then mistral is?

wicked sage Feb 9, 2026, 6:37 AM

#

https://tenor.com/view/vigilanteskass-gif-12238985672618509371

Tenor

surreal zephyr Feb 9, 2026, 6:37 AM

#

Mistral is like a monkey

#

Lowk

wicked talon Feb 9, 2026, 6:37 AM

#

Gng npm installed now what

honest verge Feb 9, 2026, 6:37 AM

#

surreal zephyr If gemini is crack abuser, then mistral is?

But it's 3 times

wicked sage Feb 9, 2026, 6:37 AM

#

wicked talon Gng npm installed now what

install codex again

honest verge Feb 9, 2026, 6:37 AM

#

More

wicked sage Feb 9, 2026, 6:38 AM

#

npm install -g @openai/codex

surreal zephyr Feb 9, 2026, 6:38 AM

#

Mistral would install codex without npm 💔 ✌️

honest verge Feb 9, 2026, 6:38 AM

#

wicked sage install codex again

Install mistral

wicked sage Feb 9, 2026, 6:38 AM

#

surreal zephyr Mistral would install codex without npm 💔 ✌️

sudo apt install codex fr

surreal zephyr Feb 9, 2026, 6:38 AM

#

wicked sage `sudo apt install codex` fr

"It appears your node js installation is corrupted, let me wipe windows to fresh install"

wicked talon Feb 9, 2026, 6:38 AM

#

Tf you mean permission denied

rn_image_picker_lib_temp_6b584f20-8e0d-4ddc-bb51-a5cc721e8d65.jpg

surreal zephyr Feb 9, 2026, 6:39 AM

#

wicked talon Tf you mean permission denied

Nodejs is free vrotato chip

honest verge Feb 9, 2026, 6:39 AM

#

wicked talon Tf you mean permission denied

Lol

wicked sage Feb 9, 2026, 6:39 AM

#

wicked talon Tf you mean permission denied

pls show me the full error so i can understand
or just ask gpt or whatever

#

actually

#

wait no

#

i thought of sudo codex i think that can help

#

im not sure

honest verge Feb 9, 2026, 6:39 AM

#

wicked talon Tf you mean permission denied

Install it through mistral

surreal zephyr Feb 9, 2026, 6:39 AM

#

Js

#

Start terminal

#

As admin

honest verge Feb 9, 2026, 6:40 AM

#

It's going to work

surreal zephyr Feb 9, 2026, 6:40 AM

#

💔

wicked talon Feb 9, 2026, 6:40 AM

#

Oh wait I got denied

honest verge Feb 9, 2026, 6:40 AM

#

Why not through mistral

#

It can install everything

wicked talon Feb 9, 2026, 6:40 AM

#

How does my ahh run as administrator on Linux

wicked sage Feb 9, 2026, 6:40 AM

#

Hey claude
Generate.

wicked sage Feb 9, 2026, 6:40 AM

#

wicked talon How does my ahh run as administrator on Linux

just do sudo

wicked talon Feb 9, 2026, 6:40 AM

#

Sudo wat

wicked sage Feb 9, 2026, 6:40 AM

#

sudo codex

#

i hope

#

just try it
it might work

wicked talon Feb 9, 2026, 6:41 AM

#

surreal zephyr Feb 9, 2026, 6:41 AM

#

Sudo npm install....

#

Or js install antigravity then ask it to install codex

wicked talon Feb 9, 2026, 6:41 AM

#

Npm is installed

surreal zephyr Feb 9, 2026, 6:41 AM

#

💔

wicked sage Feb 9, 2026, 6:42 AM

#

wicked talon Npm is installed

did you install codex

#

wait let me try out

wicked talon Feb 9, 2026, 6:42 AM

#

wicked sage did you install codex

Nope

surreal zephyr Feb 9, 2026, 6:42 AM

#

wicked talon Npm is installed

Son im crine

#

💔

honest verge Feb 9, 2026, 6:42 AM

#

surreal zephyr Sudo npm install....

INSTALL MISTRAL ALREADY

#

PLS

#

IT'S GOING TO WORK

wicked sage Feb 9, 2026, 6:42 AM

#

wicked talon Nope

son

#

💔

surreal zephyr Feb 9, 2026, 6:42 AM

#

honest verge INSTALL MISTRAL ALREADY

I give mistral systemwide access no sandbox full internet perms

wicked sage Feb 9, 2026, 6:42 AM

#

wait no im stupid

#

sudo npm install -g @openai/codex

honest verge Feb 9, 2026, 6:42 AM

#

surreal zephyr I give mistral systemwide access no sandbox full internet perms

It deleted everything?

wicked sage Feb 9, 2026, 6:42 AM

#

if it says some bullsh like permission denied do sudo

surreal zephyr Feb 9, 2026, 6:43 AM

#

🥀

honest verge Feb 9, 2026, 6:43 AM

#

What the dot2

wicked sage Feb 9, 2026, 6:43 AM

#

son im crine who is using dot2

#

nvm pineapple is using it

honest verge Feb 9, 2026, 6:44 AM

#

wicked sage son im crine who is using dot2

I'm using mistral 2

wicked sage Feb 9, 2026, 6:44 AM

#

alright blud

wicked talon Feb 9, 2026, 6:44 AM

#

Now what

wicked sage Feb 9, 2026, 6:44 AM

#

wicked talon Now what

Do the codex Thing

#

npm install -g @openai/codex

#

or sudo npm install -g @openai/codex

wicked talon Feb 9, 2026, 6:44 AM

#

I did

wicked sage Feb 9, 2026, 6:44 AM

#

codex

#

i mean

#

do codex

wicked talon Feb 9, 2026, 6:45 AM

#

I did

wicked sage Feb 9, 2026, 6:45 AM

#

what happebned

wicked talon Feb 9, 2026, 6:45 AM

#

"sudo npm install -g @raven heart/codex

wicked sage Feb 9, 2026, 6:45 AM

#

LMAO

wicked talon Feb 9, 2026, 6:45 AM

#

It installed a package

wicked talon Feb 9, 2026, 6:45 AM

#

wicked sage LMAO

🙂

wicked sage Feb 9, 2026, 6:45 AM

#

ok my bad

surreal zephyr Feb 9, 2026, 6:45 AM

#

Bro

#

Write "codex"

#

To start it

wicked sage Feb 9, 2026, 6:45 AM

#

wicked talon 🙂

i said do codex

honest verge Feb 9, 2026, 6:45 AM

#

I asked Gemini to create an image of what mistral can do is it accurate?

wicked sage Feb 9, 2026, 6:45 AM

#

https://github.com/openai/codex

GitHub

GitHub - openai/codex: Lightweight coding agent that runs in your t...

Lightweight coding agent that runs in your terminal - openai/codex

wicked talon Feb 9, 2026, 6:45 AM

#

wicked sage i said do `codex`

Oh ok

wicked sage Feb 9, 2026, 6:46 AM

#

https://tenor.com/view/breaking-in-windows-linux-meme-breaking-into-a-windows-user-gif-27138745

Tenor

wicked talon Feb 9, 2026, 6:46 AM

#

It's working 🙂

wicked sage Feb 9, 2026, 6:46 AM

#

lets go

wicked talon Feb 9, 2026, 6:46 AM

#

Shioo

rn_image_picker_lib_temp_780c1ca8-e723-4b50-9537-665b11be3954.jpg

#

Less goo

wicked sage Feb 9, 2026, 6:46 AM

#

ok you did it

surreal zephyr Feb 9, 2026, 6:47 AM

#

wicked talon Shioo

Set model

#

To gpt 5.2 high

#

(Not codex)

wicked sage Feb 9, 2026, 6:47 AM

#

uhh

#

model

surreal zephyr Feb 9, 2026, 6:47 AM

#

/model

wicked sage Feb 9, 2026, 6:47 AM

#

actually no im

#

stupid

#

yeah ty

wicked talon Feb 9, 2026, 6:48 AM

#

Uh

rn_image_picker_lib_temp_9eb44638-d4ae-4139-93c4-6aa0b9a803fc.jpg

surreal zephyr Feb 9, 2026, 6:48 AM

#

wicked talon Uh

3rd

wicked sage Feb 9, 2026, 6:48 AM

#

wicked talon Uh

3rd

surreal zephyr Feb 9, 2026, 6:48 AM

#

Then reasoning high

#

Not xhigh

#

High better overall

wicked talon Feb 9, 2026, 6:48 AM

#

rn_image_picker_lib_temp_946bd63b-15f7-48ff-a8c3-20b9c76dcc2f.jpg

wicked sage Feb 9, 2026, 6:48 AM

#

wicked talon

high

surreal zephyr Feb 9, 2026, 6:48 AM

#

wicked talon

3rd

wicked sage Feb 9, 2026, 6:48 AM

#

not extra high

wicked talon Feb 9, 2026, 6:48 AM

#

Kk bet

surreal zephyr Feb 9, 2026, 6:48 AM

#

Extra wastes tokens and gets compressed during work

#

So extra is worse than high while being slower

wicked talon Feb 9, 2026, 6:49 AM

#

Time to setup a DNS server

honest verge Feb 9, 2026, 6:49 AM

#

Why complain about xhigh when you can install mistral

#

Like

#

Why

surreal zephyr Feb 9, 2026, 6:49 AM

#

honest verge Why complain about xhigh when you can install mistral

🏳️‍⚧️

wicked talon Feb 9, 2026, 6:49 AM

#

honest verge Why complain about xhigh when you can install mistral

If I could swear I would ask grok to make me 100 swears to throw at you

honest verge Feb 9, 2026, 6:50 AM

#

surreal zephyr 🏳️‍⚧️

Actually no

#

I'm gay

surreal zephyr Feb 9, 2026, 6:50 AM

#

honest verge Actually no

🫃

honest verge Feb 9, 2026, 6:50 AM

#

With mistral

surreal zephyr Feb 9, 2026, 6:50 AM

#

I run mistral locally on an usb stick 🗣️

wicked sage Feb 9, 2026, 6:50 AM

#

honest verge Why complain about xhigh when you can install mistral

why is bro acting like "instead of fried chicken eat grilled chicken"

surreal zephyr Feb 9, 2026, 6:51 AM

#

wicked sage why is bro acting like "instead of fried chicken eat grilled chicken"

No hes like "instead of eating chicken eat sand"

honest verge Feb 9, 2026, 6:51 AM

#

Because mistral is so strong

wicked sage Feb 9, 2026, 6:51 AM

#

https://tenor.com/view/vigilanteskass-gif-12238985672618509371

Tenor

#

ok lets check the lmarena leaderboards then

honest verge Feb 9, 2026, 6:52 AM

#

wicked sage ok lets check the lmarena leaderboards then

Top 1

wicked sage Feb 9, 2026, 6:53 AM

#

no its like top 70 something

surreal zephyr Feb 9, 2026, 6:53 AM

#

honest verge Top 1

Wheres that one

#

Leaderboard

surreal zephyr Feb 9, 2026, 6:53 AM

#

wicked sage no its like top 70 something

Look on the right

#

(The lower the better)

wicked sage Feb 9, 2026, 6:54 AM

#

surreal zephyr Look on the right

god damn

surreal zephyr Feb 9, 2026, 6:55 AM

#

wicked sage god damn

Gpt models do really good at it

#

The newer ones

#

Like 5.2 and 5.3c

#

Thats why gpt best

wicked talon Feb 9, 2026, 6:55 AM

#

I'm gonna ask chatgpt to run minstrel locally

wicked sage Feb 9, 2026, 6:55 AM

#

surreal zephyr Like 5.2 and 5.3c

wait wdym 5.3c

#

https://tenor.com/view/durr-yippee-durr-drooling-emoji-drool-face-gif-5621523889348608699

Tenor

honest verge Feb 9, 2026, 6:56 AM

#

Mistral is better than opus 4

Screenshot_2026-02-09-09-55-33-378_com.android.chrome-edit.jpg

#

That's why he's the goat

wicked sage Feb 9, 2026, 6:56 AM

#

honest verge Mistral is better than opus 4

opus 4 tho...

#

its not opus 4.1 tho

wicked talon Feb 9, 2026, 6:56 AM

#

honest verge Mistral is better than opus 4

Qwen v1 outranks it lmaoo

surreal zephyr Feb 9, 2026, 6:56 AM

#

wicked sage wait wdym 5.3c

5.3 codex (paid only)

wicked talon Feb 9, 2026, 6:56 AM

#

surreal zephyr 5.3 codex (paid only)

Whipping out my credit card rq

wicked sage Feb 9, 2026, 6:57 AM

#

surreal zephyr 5.3 codex (paid only)

ohh

surreal zephyr Feb 9, 2026, 6:57 AM

#

wicked talon Whipping out my credit card rq

No other model did this simulation correctly

#

5.3 Codex did first try

honest verge Feb 9, 2026, 6:57 AM

#

surreal zephyr No other model did this simulation correctly

Mistral did -1 try

#

He's the goat

surreal zephyr Feb 9, 2026, 6:57 AM

#

surreal zephyr No other model did this simulation correctly

Opus 4.6 and 4.5 failed miserably

honest verge Feb 9, 2026, 6:58 AM

#

surreal zephyr Opus 4.6 and 4.5 failed miserably

Try mistral

surreal zephyr Feb 9, 2026, 6:58 AM

#

Opus made a pretty tank that wasnt working

honest verge Feb 9, 2026, 6:58 AM

#

Please

#

What the hell

Screenshot_2026-02-09-09-58-38-848_com.android.chrome-edit.jpg

#

Why 3.2 exp is better than release 3.2?

#

🥀

surreal zephyr Feb 9, 2026, 6:59 AM

#

honest verge What the hell

What bench

wicked sage Feb 9, 2026, 6:59 AM

#

#

-41 is crazy btw

surreal zephyr Feb 9, 2026, 7:00 AM

#

wicked sage

The gemini here is the pre lobotomy version btw

honest verge Feb 9, 2026, 7:00 AM

#

surreal zephyr What bench

Lmarena

surreal zephyr Feb 9, 2026, 7:00 AM

#

Gemini is rn at maybe -30

#

Gpt 5.2 and 5.3c are at top of that lb rn

wicked sage Feb 9, 2026, 7:00 AM

#

ye makes sense

#

btw what plan do you have to get for uhh 5.3c

surreal zephyr Feb 9, 2026, 7:01 AM

#

(Not xhigh, xhigh sucks)

surreal zephyr Feb 9, 2026, 7:01 AM

#

wicked sage btw what plan do you have to get for uhh 5.3c

The 20$

wicked sage Feb 9, 2026, 7:01 AM

#

ah ok

surreal zephyr Feb 9, 2026, 7:01 AM

#

5.2 high is more creative

honest verge Feb 9, 2026, 7:01 AM

#

surreal zephyr (Not xhigh, xhigh sucks)

Mistral is the best

wicked sage Feb 9, 2026, 7:01 AM

#

honest verge Mistral is the best

surreal zephyr Feb 9, 2026, 7:01 AM

#

5.3c is the professional soft engineer

honest verge Feb 9, 2026, 7:01 AM

#

Why you can't use mistral

#

Like Is it too hard?

surreal zephyr Feb 9, 2026, 7:02 AM

#

wicked sage

Sonnet 4 is more trustworthy than opus 4.5 and 4.6 rn btw

wicked sage Feb 9, 2026, 7:02 AM

#

surreal zephyr Sonnet 4 is more trustworthy than opus 4.5 and 4.6 rn btw

how the fuh

surreal zephyr Feb 9, 2026, 7:02 AM

#

wicked sage Feb 9, 2026, 7:02 AM

#

why do some companies like anthropic and

#

google

wicked talon Feb 9, 2026, 7:02 AM

#

I quit codex

#

I just deleted it

wicked sage Feb 9, 2026, 7:02 AM

#

try to lobotimize the ais and make them say random stuff

wicked talon Feb 9, 2026, 7:03 AM

#

Time to go to minstrel

surreal zephyr Feb 9, 2026, 7:03 AM

#

wicked sage google

Because gemini and opus costed like 20$ per prompt to run

#

🥀

wicked sage Feb 9, 2026, 7:03 AM

#

surreal zephyr Because gemini and opus costed like 20$ per prompt to run

are we deadass?

#

https://cdn.discordapp.com/attachments/1357731226189824091/1376592209612116090/togif.gif

surreal zephyr Feb 9, 2026, 7:03 AM

#

wicked sage are we deadass?

They used million tokens for thinking in background

#

🥀

#

Opus still costs a lot to run cuz it was slightly lobotomized

#

Gemini was made cheap and lobotomized to ground...

honest verge Feb 9, 2026, 7:05 AM

#

surreal zephyr Gemini was made cheap and lobotomized to ground...

Actually I think Gemini 2.0 not lobotomized is on par with Gemini 3 pro now

surreal zephyr Feb 9, 2026, 7:05 AM

#

Anthropic and google basically brute forcing benchmarks

#

🥀

honest verge Feb 9, 2026, 7:06 AM

#

Then who mistral is

wicked talon Feb 9, 2026, 7:06 AM

#

Qwen for life 🙂

surreal zephyr Feb 9, 2026, 7:06 AM

#

I had gemini run for half hour in ag after a fix bug prompt and it succeded

#

(Gpt found in 30s)

wicked talon Feb 9, 2026, 7:06 AM

#

Lmao

honest verge Feb 9, 2026, 7:06 AM

#

I need my dear mistral

#

I love him

wicked talon Feb 9, 2026, 7:06 AM

#

honest verge I need my dear mistral

Eww

#

Qwen for life babyyy

wicked sage Feb 9, 2026, 7:07 AM

#

@mistral Hello.

honest verge Feb 9, 2026, 7:07 AM

#

@shy jay

surreal zephyr Feb 9, 2026, 7:07 AM

#

Pre nerf gemini was better than opus btw

honest verge Feb 9, 2026, 7:07 AM

#

We need you

#

Pls

#

I'm dead

#

But at least I summoned him

zinc oyster Feb 9, 2026, 7:08 AM

#

Hello

wicked talon Feb 9, 2026, 7:08 AM

#

Wait qwen has it's own discord server

honest verge Feb 9, 2026, 7:08 AM

#

wicked talon Wait qwen has it's own discord server

Lol

surreal zephyr Feb 9, 2026, 7:09 AM

#

Tbh

#

Openai is the only one

zinc oyster Feb 9, 2026, 7:09 AM

#

Does everyone have this bug where there's a captcha that's impossible to pass,just error?

surreal zephyr Feb 9, 2026, 7:09 AM

#

That knows how to make llms

wicked talon Feb 9, 2026, 7:10 AM

#

zinc oyster Does everyone have this bug where there's a captcha that's impossible to pass,ju...

Just a you issue

honest verge Feb 9, 2026, 7:10 AM

#

surreal zephyr Openai is the only one

It sucks

wicked talon Feb 9, 2026, 7:10 AM

#

surreal zephyr That knows how to make llms

Nah bro qwen for life

honest verge Feb 9, 2026, 7:10 AM

#

Mistral is better

surreal zephyr Feb 9, 2026, 7:10 AM

#

wicked talon Nah bro qwen for life

Gpt would rather kill humanity than not follow a prompt

#

🔥

honest verge Feb 9, 2026, 7:10 AM

#

I'm glazing my mistral so much I'm tired of saying his name 🥀

#

🥀 🥀

wicked sage Feb 9, 2026, 7:11 AM

#

bro gained self awareness

wicked talon Feb 9, 2026, 7:11 AM

#

surreal zephyr Gpt would rather kill humanity than not follow a prompt

Nah qwen is a truth speaker

honest verge Feb 9, 2026, 7:11 AM

#

wicked sage bro gained self awareness

Wait I can be free without saying mistral?

wicked talon Feb 9, 2026, 7:11 AM

#

If your wrong he says it

honest verge Feb 9, 2026, 7:12 AM

#

wicked talon If your wrong he says it

Have you downloaded Mistral yet?

wicked talon Feb 9, 2026, 7:12 AM

#

Qwen coder casually having 1.04m tokens

rn_image_picker_lib_temp_63e7917c-5892-41da-969e-4f1735b1d69c.jpg

wicked talon Feb 9, 2026, 7:12 AM

#

honest verge Have you downloaded Mistral yet?

Yeah it was ahh

wicked talon Feb 9, 2026, 7:12 AM

#

wicked talon Qwen coder casually having 1.04m tokens

48,000 more then Gemini

surreal zephyr Feb 9, 2026, 7:13 AM

#

wicked talon Qwen coder casually having 1.04m tokens

Who cares if its usable for first 100 only

#

🥀

wicked talon Feb 9, 2026, 7:13 AM

#

surreal zephyr Who cares if its usable for first 100 only

Nuh uh

wicked talon Feb 9, 2026, 7:13 AM

#

honest verge Have you downloaded Mistral yet?

Only 128,000 tokens bro

surreal zephyr Feb 9, 2026, 7:13 AM

#

In gpt we trust

surreal zephyr Feb 9, 2026, 7:13 AM

#

wicked talon Nuh uh

Gemini is usable for first 50k

wicked talon Feb 9, 2026, 7:13 AM

#

And only 4000 tokens output

surreal zephyr Feb 9, 2026, 7:13 AM

#

Gpt works solid for 200k i tested so far

honest verge Feb 9, 2026, 7:13 AM

#

surreal zephyr In gpt we trust

In mistral we love

#

Or crack

surreal zephyr Feb 9, 2026, 7:14 AM

#

honest verge In mistral we love

With mistral we smoke crack*

#

With gpt we develop

#

With gemini we get older and have dementia

wicked talon Feb 9, 2026, 7:14 AM

#

surreal zephyr Gemini is usable for first 50k

Dang qwen coder can only generate 8192 tokens

#

rn_image_picker_lib_temp_a67bd32c-1370-4d2d-910d-38acf068f35b.jpg

honest verge Feb 9, 2026, 7:15 AM

#

wicked talon Dang qwen coder can only generate 8192 tokens

LOL

honest verge Feb 9, 2026, 7:15 AM

#

wicked talon

LOL

wicked talon Feb 9, 2026, 7:15 AM

#

Minstral is same amount

left lodge Feb 9, 2026, 7:16 AM

#

wicked talon Dang qwen coder can only generate 8192 tokens

Never ask the models about itself if it doesn't have search tool access it will always hallucinate

honest verge Feb 9, 2026, 7:16 AM

#

wicked talon Minstral is same amount

It's actually 5 times more

wicked talon Feb 9, 2026, 7:16 AM

#

left lodge Never ask the models about itself if it doesn't have search tool access it will ...

Oh

honest verge Feb 9, 2026, 7:16 AM

#

left lodge Never ask the models about itself if it doesn't have search tool access it will ...

Cuz they are dumb

#

Because of the knowledge cutoff

#

Btw what cutoff opus 4.6 has?

left lodge Feb 9, 2026, 7:17 AM

#

You can search yourself using a search engine,
I suggest brave search

wicked talon Feb 9, 2026, 7:17 AM

#

It got context right

rn_image_picker_lib_temp_e20ea227-bfa8-4e2a-9c70-ff46804139a4.jpg

#

Idk about everything else

wicked talon Feb 9, 2026, 7:17 AM

#

left lodge You can search yourself using a search engine, I suggest brave search

Brave is a honeypot

#

Trust

left lodge Feb 9, 2026, 7:17 AM

#

🤦

wicked talon Feb 9, 2026, 7:17 AM

#

Use duckduckgo

left lodge Feb 9, 2026, 7:17 AM

#

Its a damn search engine

wicked talon Feb 9, 2026, 7:18 AM

#

left lodge Its a damn search engine

Still a honeypot

left lodge Feb 9, 2026, 7:18 AM

#

Do you even know what does honeypot mean? How do you think it is a honeypot and duckduckgo isn't?

honest verge Feb 9, 2026, 7:19 AM

#

Lol

Screenshot_2026-02-09-10-19-21-244_com.google.android.googlequicksearchbox-edit.jpg

wicked talon Feb 9, 2026, 7:20 AM

#

left lodge Do you even know what does honeypot mean? How do you think it is a honeypot and ...

Well first if you want to earn of ads you have to use id

left lodge Feb 9, 2026, 7:20 AM

#

wicked talon Use duckduckgo

It is literally a bing wrapper

surreal zephyr Feb 9, 2026, 7:20 AM

#

left lodge Never ask the models about itself if it doesn't have search tool access it will ...

Depends on sys prompt

#

Also 3.0 flash is correct

#

It says it doesnt know

wicked talon Feb 9, 2026, 7:20 AM

#

And it was found brave was leaking DNS queries

honest verge Feb 9, 2026, 7:20 AM

#

wicked talon Well first if you want to earn of ads you have to use id

Well is mistral good or bad

wicked talon Feb 9, 2026, 7:21 AM

#

honest verge Well is mistral good or bad

Meh

left lodge Feb 9, 2026, 7:21 AM

#

surreal zephyr Depends on sys prompt

Arena.ai doesn't have system prompts

#

But still the most reliable results are with search tool enabled

honest verge Feb 9, 2026, 7:22 AM

#

left lodge Arena.ai doesn't have system prompts

That's why grook is meh

#

"Grook"

surreal zephyr Feb 9, 2026, 7:22 AM

#

left lodge But still the most reliable results are with search tool enabled

Ya

left lodge Feb 9, 2026, 7:22 AM

#

Relying on its training data or hoping it knows it because the info might be inside its system prompt is just weird

surreal zephyr Feb 9, 2026, 7:23 AM

#

left lodge Relying on its training data or hoping it knows it because the info might be ins...

Im js saying models used to have their info in sys prompts

#

Bro is mistral

left lodge Feb 9, 2026, 7:23 AM

#

You dont have access to its system prompts even on their native platforms, thats not reliable

signal apex Feb 9, 2026, 7:23 AM

#

guys mine has been like this for almost 20 minutes, does it take it that long to response 😭?

wicked sage Feb 9, 2026, 7:23 AM

#

signal apex guys mine has been like this for almost 20 minutes, does it take it that long to...

its opus 4.6 thinking 😭

#

of course itdoes that

surreal zephyr Feb 9, 2026, 7:24 AM

#

signal apex guys mine has been like this for almost 20 minutes, does it take it that long to...

Opus is bruteforcing the answer

left lodge Feb 9, 2026, 7:24 AM

#

There is hard 6 min limit, if thats reached when a model is generating a response it cuts off the response and throws Something went wrong with this response, please try again. error

I reported this way back on November of 2025 but they haven't done anything.

Models are literally made to think for hours and here they have a hard 6 min cutoff 🤦

And btw this 6 minute limit is on every single model available not only opus 4.6.

surreal zephyr Feb 9, 2026, 7:24 AM

#

It thinks forever untill it hallucinates correctly

#

Thats how opuses work

left lodge Feb 9, 2026, 7:25 AM

#

surreal zephyr Thats how opuses work

Thats not how things work bro 😭

signal apex Feb 9, 2026, 7:25 AM

#

so i just wait?

surreal zephyr Feb 9, 2026, 7:25 AM

#

Opus works by having ton of internal thinking

#

The only reason why it does so well on benchs

left lodge Feb 9, 2026, 7:26 AM

#

signal apex so i just wait?

No it might be stuck there forever.
Check in 10mins if it is still stuck open new chat

surreal zephyr Feb 9, 2026, 7:26 AM

#

surreal zephyr The only reason why it does so well on benchs

Gemini used to do that too pre nerf

#

Opus is literally gemini 3 xxxxhigh

left lodge Feb 9, 2026, 7:26 AM

#

honest verge Lol

What is this?

wicked talon Feb 9, 2026, 7:26 AM

#

Qwen is Gemini 3 pro but on steroids

#

🙂

honest verge Feb 9, 2026, 7:27 AM

#

left lodge What is this?

New Google model

#

Paperbanana

left lodge Feb 9, 2026, 7:27 AM

#

Is that official?

#

Source links?

surreal zephyr Feb 9, 2026, 7:27 AM

#

left lodge Is that official?

Doubt

#

But notebooklm uses similiar thing

honest verge Feb 9, 2026, 7:27 AM

#

left lodge Is that official?

It's official

wicked talon Feb 9, 2026, 7:28 AM

#

#

🙂

surreal zephyr Feb 9, 2026, 7:28 AM

#

wicked talon

Yeah notebooklm uses ts

honest verge Feb 9, 2026, 7:29 AM

#

Imagine paperbanana

#

This name sucks

#

Like banana from paper

wicked talon Feb 9, 2026, 7:29 AM

#

It's Google what do you expect

left lodge Feb 9, 2026, 7:33 AM

#

Its just a side research project developed by Google Research in collaboration with Peking University

#

Not much

surreal zephyr Feb 9, 2026, 7:36 AM

#

honest verge Imagine paperbanana

Its nano banana copy fine tuned for graphs

gusty helm Feb 9, 2026, 7:37 AM

#

hey! If im not mistaken claude should have a thinking version too on the 4.6? Is it not coming to arena or is it not ranked yet/code named/collecting votes?

surreal zephyr Feb 9, 2026, 7:37 AM

#

left lodge Its just a side research project developed by Google Research in collaboration w...

Its notebooklm thing imo

surreal zephyr Feb 9, 2026, 7:37 AM

#

gusty helm hey! If im not mistaken claude should have a thinking version too on the 4.6? Is...

It already is wym

#

Its so bad it didnt get on lb

#

Lol

honest verge Feb 9, 2026, 7:37 AM

#

surreal zephyr Lol

Don't use it

gusty helm Feb 9, 2026, 7:37 AM

#

oh rly harold ? didnt see that

surreal zephyr Feb 9, 2026, 7:37 AM

#

It gets stuck in loop everytime and crashes

#

Lol

honest verge Feb 9, 2026, 7:37 AM

#

It can't really make big projects

surreal zephyr Feb 9, 2026, 7:37 AM

#

Its unusable

honest verge Feb 9, 2026, 7:37 AM

#

Only small

#

Normal 4.6 is way better

gusty helm Feb 9, 2026, 7:39 AM

#

that's an odd ball; yeah I see can use it in direct chat

#

but not included in leaderboard at all lol

surreal zephyr Feb 9, 2026, 7:41 AM

#

gusty helm that's an odd ball; yeah I see can use it in direct chat

Try to use it then

honest verge Feb 9, 2026, 7:41 AM

#

gusty helm but not included in leaderboard at all lol

Cuz it can't be tested in leaderboard properly

surreal zephyr Feb 9, 2026, 7:41 AM

#

It will get stuck and crash

#

90% times

#

🥀

honest verge Feb 9, 2026, 7:41 AM

#

surreal zephyr 90% times

I think that's why it isn't in the leaderboard

#

It just can't do anything

surreal zephyr Feb 9, 2026, 7:41 AM

#

honest verge I think that's why it isn't in the leaderboard

Yeah because it sucks

#

Worse than mistral atp

#

Mistral did the tank

#

Opus crashed

#

Mistral wins

#

🥀

gusty helm Feb 9, 2026, 7:42 AM

#

I see; prob needs some more work on it before it's usuable

surreal zephyr Feb 9, 2026, 7:42 AM

#

gusty helm I see; prob needs some more work on it before it's usuable

Just use 4.5 its better overall

#

Or gpt 5.3 codex or gpt 5.2 high both are better

#

😔

gusty helm Feb 9, 2026, 7:42 AM

#

using neither 😄 was just curios came from a trip and saw 4.6 #1

honest verge Feb 9, 2026, 7:42 AM

#

surreal zephyr Mistral wins

But uhhm that's the tank that mistral did

surreal zephyr Feb 9, 2026, 7:42 AM

#

gusty helm Feb 9, 2026, 7:43 AM

#

did not expect google to lose R1 anytime soon

surreal zephyr Feb 9, 2026, 7:43 AM

#

honest verge But uhhm that's the tank that mistral did

And opus did none

cyan kettle Feb 9, 2026, 7:43 AM

#

what do yall think is the best model right now?

surreal zephyr Feb 9, 2026, 7:43 AM

#

cyan kettle what do yall think is the best model right now?

Gpt 5.3c and 5.2

#

Easily

#

No competition

#

Lol

honest verge Feb 9, 2026, 7:43 AM

#

surreal zephyr And opus did none

Opus 4.6 not thinking did this

gusty helm Feb 9, 2026, 7:43 AM

#

really depends for what