#general

1 messages · Page 253 of 1

verbal nimbus
honest verge
verbal nimbus
#

Maybe it needs some parameter tweaking

honest verge
#

WHAT IS THIS

surreal zephyr
#

5.3 codex nailed it btw

surreal zephyr
verbal nimbus
north obsidian
honest verge
surreal zephyr
#

W openai for gatekeeping

north obsidian
surreal zephyr
north obsidian
#

Oh nice

honest verge
#

Also sucks for deleting 4o

#

4o is Still the best

surreal zephyr
quartz light
#

opus 4.6

honest verge
#

Not bad

surreal zephyr
#

Still here for a week

quartz light
surreal zephyr
verbal nimbus
#

Opus 4.6 is overthinking

#

It's been thinking for ages

quartz light
#

arena?

verbal nimbus
quartz light
#

itll time out

verbal nimbus
#

It should count as a fail if it times out

surreal zephyr
verbal nimbus
#

It can't just overthink and time out whenever the problem is too hard, then get the vote forefeited

surreal zephyr
#

Opus be like gpt 5.2 xxxxxhigh

honest verge
#

Peak

surreal zephyr
honest verge
#

best ai model ever?

verbal nimbus
surreal zephyr
#

An llm who thinks all the time

#

Has nothing to think about except thoughts

#

🥀

verbal nimbus
#

Tokens: 💸

surreal zephyr
#

I got 1m context window

#

Im gona use the 1m context window

#

🤣

#

60$ per prompt btw 😭 ✌️

verbal nimbus
#

Is this the summary or the actual thinking

honest verge
#

ITS IMPOSSIBLE

verbal nimbus
#

Because it says "I did X" and "I did Y" but I don't see it

surreal zephyr
surreal zephyr
honest verge
surreal zephyr
verbal nimbus
surreal zephyr
#

Opus has A LOT of internal thinking

verbal nimbus
#

But this one looks like the actual thinking 🤔

surreal zephyr
#

Like more than all other models

honest verge
surreal zephyr
verbal nimbus
#

Like is this a summary or is it actually hallucinating

surreal zephyr
#

It generates gazzilions of tokens under the hood

#

Thats why it costs so much

#

(The summaries overall sound like bs and derailed from reality for all models btw)

verbal nimbus
#

Still thinking...

surreal zephyr
#

Like summary can say "im using this" and the model uses that

honest verge
scarlet spire
#

Well then. thinkies That's quite the turnstile twitch

honest verge
#

Like I'm trying to create snake game with mistral

verbal nimbus
#

Wait what have you been doing this entire time then, Opus 🤔

honest verge
#

It can't do this

quartz light
scarlet spire
honest verge
#

It's mistral

#

It's the best I could do with mistral large 3

#

It can't do anything more

quartz light
verbal nimbus
#

It said it was ready like a gazillion times 🤣

honest verge
#

I thought opus 4.6 doesn't hallucinates

verbal nimbus
#

😑

quartz light
#

its not really accurate

#

its just summaries

verbal nimbus
honest verge
surreal zephyr
#

Codex 5.3 nailed it in 1 prompt

verbal nimbus
surreal zephyr
#

¯_(ツ)_/¯

#

4 prompts to fix all bugs it took

surreal zephyr
verbal nimbus
verbal nimbus
# surreal zephyr 5.2 thinking

Oh it has tools so doesn't really count. This test is to see if it can notice that the tool is missing instead of pretending to use it.

surreal zephyr
#

Funny how 5.2 is the only of sonnet and opus that did it correct

surreal zephyr
#

It ran script

#

And it said it doesnt have a tool for it

#

So it didnt hallucinate

verbal nimbus
surreal zephyr
#

Ya

#

Imma glaze gpt ngl

verbal nimbus
#

GPT doesn't hallucinate on that prompt, only Claude 4.5 and up

surreal zephyr
#

What does gpt fail at that claude doesnt?

north obsidian
verbal nimbus
verbal nimbus
north obsidian
#

The hallucinations

surreal zephyr
#

💀

#

😭

verbal nimbus
surreal zephyr
#

Gpt did it better

verbal nimbus
surreal zephyr
honest verge
surreal zephyr
#

In gpt i trust

surreal zephyr
verbal nimbus
topaz skiff
surreal zephyr
#

He ran a script

topaz skiff
#

show me

surreal zephyr
#

In app

surreal zephyr
topaz skiff
#

okay that makes sense, but this is function calling

#

so no

surreal zephyr
#

Brotato chip

#

💔

#

Without functions and with functions both, said that they dont have dice tools

topaz skiff
#

because this is awfuly simple task

surreal zephyr
#

The one with function ALSO ran a script

surreal zephyr
#

🥀

topaz skiff
#

Okay enough of AI slop for today, i don't want to hear much more at least in few months

honest verge
verbal nimbus
honest verge
surreal zephyr
#

Gemini is worse it js crashes

honest verge
#

So it's the same model

surreal zephyr
#

Iirc opus 4.6 runs on google tpus

#

xD

#

@verbal nimbus

surreal zephyr
#

I wanna check

verbal nimbus
#

Gemini 3 Pro fails on this one too

surreal zephyr
#

🔥 🔥 🔥

verbal nimbus
#

Just weaker models, newer Claude models and Gemini models

surreal zephyr
honest verge
surreal zephyr
#

@verbal nimbus 3.0 pro in app pased 💀🤔

verbal nimbus
verbal nimbus
surreal zephyr
verbal nimbus
#

I only came up with that prompt because I forgot to attach files a few times with G2.5 and it never told me

surreal zephyr
honest verge
surreal zephyr
#

But also g2.5 is is way worse than g3

honest verge
#

And opus is their grandfather

surreal zephyr
verbal nimbus
honest verge
#

WHAT

verbal nimbus
surreal zephyr
#

😭

verbal nimbus
honest verge
surreal zephyr
#

Even more

#

🥀

honest verge
#

Still can't understand why there's no flash lite Gemini in app or website

verbal nimbus
honest verge
#

Like it's very fast and cheap

surreal zephyr
verbal nimbus
surreal zephyr
#

But i need chat history

verbal nimbus
#

But if you thumb up/down with activity off it still sends your last 24 hours of chats, voice recordings and attachments

surreal zephyr
#

Al hail chatgpt

honest verge
surreal zephyr
#

Mistral is the most human ai

#

🥀

#

If i ever buy a clanka

#

I buy two

verbal nimbus
surreal zephyr
#

Mistral for crack

#

And gpt for trust

#

Geniuely

honest verge
surreal zephyr
#

Gpt the most trustworthy model

#

Lol

verbal nimbus
quartz light
#

devstral is tuff

honest verge
#

Opus 4.6 tank

verbal nimbus
honest verge
verbal nimbus
#

Was checking if it hallucinates any tools.

honest verge
#

Gemini 3 pro is made by anthropic

#

I'm sure

quartz light
verbal nimbus
#

Seems like a good test

shrewd citrus
#

ai should be able to

#

as long as you give it context

#

can’t just say fix it

tiny dove
#

The image uploader isn't working

quartz light
verbal nimbus
quartz light
verbal nimbus
#

Opus 4.6 Thinking found the bug:

#

I also made one modification before, not sure if that is actually required

#

Just search for groundRB definition and change arguments inside the function to 0, 0, 0

#

GPT identified the problem too, but it's fix seems larger, not sure why

#

Not familiar with this lib

quartz light
#

4.6 thinking worked

verbal nimbus
quartz light
#

i only got it to ever work a single time

verbal nimbus
#

I changed the CG function too, not sure if that's needed

quartz light
verbal nimbus
quartz light
#

its my favourite

verbal nimbus
fierce cove
#

Error report: Over the past two days, I've tested Claude-opus-4-6-thinking several times and have encountered errors multiple times

verbal nimbus
verbal nimbus
shrewd citrus
#

i haven’t seen another model think for so long

#

so it might be a bug like I don’t actually think Claude 4.6 is really meant to think for THAT long

verbal nimbus
#

I think it should count as a loss, that would incentivize providers to fix it

#

Otherwise it's kind of cheating by overthinking on problems it can't solve and forcing a forfeit (since the evaluation will be discarded)

loud verge
honest verge
#

Please 4.6 thinking 32k

#

I need this

quartz light
#

CHECK THIS OUT

honest verge
#

But looks good

quartz light
#

while moving

honest verge
#

Lol

inner relic
shrewd citrus
#

is there an api plugin

quartz light
#

wdym

honest verge
#

What if you make a team of mistral + opus 4.6

#

For coding

shrewd citrus
#

i mean on rooblox there’s actual ai npcs with communication

quartz light
#

why are yall talking about it

#

is there a new model

#

or what

inner relic
honest verge
quartz light
inner relic
honest verge
#

It's on par with Gemini 3 pro

quartz light
inner relic
#

I understand now

quartz light
#

is there a new model

honest verge
#

Today

quartz light
#

what

#

which

inner relic
quartz light
honest verge
quartz light
#

model name

#

model id

shrewd citrus
inner relic
quartz light
#

???

shrewd citrus
honest verge
quartz light
quartz light
honest verge
#

They will ban me

quartz light
#

???

quartz light
honest verge
#

I'm getting controlled

#

The curse of mistral

quartz light
shrewd citrus
quartz light
#

im going to

#

:(

#

guhhh

shrewd citrus
#

for 4.6 it just breaks during thinking

honest verge
shrewd citrus
#

unless they fixed it recently

quartz light
#

just tell me the model name

#

stop fkin aroun

honest verge
#

I CAN'T

#

THEY WILL BAN ME

quartz light
#

??

#

for what

honest verge
#

FOR TELLING THE SECRET

inner relic
shrewd citrus
honest verge
quartz light
#

🥀

honest verge
#

They gave me early access

#

To mistral 4

#

ALSO

#

I'm free now

proud bobcat
#

Pardon?

#

How is this in violation of the terms of use

#

I’m confused

#

Yeah I don’t see nothing in the terms of use that says this is in violation

#

@echo aurora Horribly sorry to ping if you’re busy but is this just a glitch?

quartz light
shrewd citrus
proud bobcat
#

Violence?

shrewd citrus
#

yeah maybe it doesn’t like violence

proud bobcat
#

Yeah maybe

#

I’ll try qwen or Flux

frigid wigeon
#

Hello

north patrol
soft verge
# verbal nimbus It keeps overthinking and crashing and I can't even vote

claude 4.6 seems to be a very large model, probably 10x of gemini 3. it's much better, but it's also much slower. lmarena timeouts after ~6 minutes. i hope they make it at least 8-10 minutes, which should be enough for claude to respond. i noticed it takes ~5m17s to think and then a couple more minutes to respond

honest verge
#

Is it true that qwen 3.5 is available?

quartz light
#

in battle mode

keen beacon
#

Hey guys, is anyone having a problem where, when communicating a lot with the Gemini 3 Pro, it gives the following message: "Something went wrong with this response, please try again." and then doesn't respond anymore, even after clicking the reload button and the Gemini 3 Flash going into infinite generation?

spare rune
undone geyser
#

Idea for future if possible: android version of arena.ai????

keen beacon
spare rune
#

Yes (if your using the direct mode)

wicked sage
#

bro forgot to show the entire image

#

still kinda mid tho

honest verge
#

Like you can do anything

#

But they should add opus

#

Then it's worth it

wicked sage
#

but the limit is probably high

wicked sage
#

even opus 4.5 is good

#

but opus 4.6 is better

honest verge
wicked sage
#

yeah its probably more

#

due to like

#

claude sonnet gpt blah blah blah

#

they are NOT kind with their input/output cost things

honest verge
#

Sonnet is very expensive

#

Alone

#

+gpt and Gemini

#

Ts can't be 83$ a month

wicked sage
#

wait what the fuh

#

kimi k2.5 has 1 trillion tokens on openrouter

honest verge
wicked sage
#

HOW are the servers still up

honest verge
#

IDK

#

It's going up

#

1.2 T already

#

What's going on

#

The servers are going to explode

wicked sage
#

IKNOW

wicked talon
#

Bruhh how has Claude ranked better then Gemini

#

That's unheard

wicked sage
#

simple

#

opus

honest verge
#

Gemini 3 pro is already outdated

#

At release it was the best model ever

#

Now it's not

#

Waiting for GA

wicked sage
#

GA?

#

what does that stand for

#

i think ive heard that once

wicked talon
#

Yeah

honest verge
#

I think

wicked sage
#

oh

honest verge
#

Rumors are saying it will be better than preview

#

Because 3 pro and flash still in preview

wicked talon
#

How is deepseek 31 😭

#

This is harsh

#

I have to tell the ai to switch to English 😭 😭

honest verge
wicked talon
honest verge
#

Now they can't really match big models

wicked talon
#

Is co-pilot just free chatgpt?

wicked sage
#

c*p*l*t

#

i dont like it cuz,,, microsoft

#

anyways uhh

wicked talon
#

Linux for life

honest verge
wicked sage
honest verge
#

Maybe GitHub copilot

wicked sage
#

i use linux also

wicked talon
#

Thank you 😉

honest verge
#

Not Microsoft

wicked sage
#

fair enough

wicked talon
#

I use Gemini for everything

#

And qwen sometimes

wicked sage
wicked talon
wicked talon
#

Qwen is probably spyware too tbh

#

Alibaba

wicked sage
#

qwen is peak tho
but yeah theres a high chance it can be spyware

honest verge
#

I don't know why

wicked sage
#

i can c why

wicked talon
wicked talon
#

Wait they force search on the app now lmao

wicked sage
wicked talon
#

I think the USA should study Microsoft

#

If qwen is speaking facts

#

They thought tiktok was spyware

honest verge
#

I don't know how Kimi k2.5 on openrouter is still active

wicked talon
wicked talon
#

It's too slow

wicked sage
#

to be fair it has like

#

1.2t tokens

#

as of rn

honest verge
#

It's 1.2 T tokens right now

wicked talon
#

Wtf

honest verge
#

Maybe it will explode soon

wicked talon
#

How tf has it got 1.2t tokens

honest verge
#

But I don't know the hype behind k2.5

#

Like it's not really the best

wicked talon
#

🙂

#

I like perpelxity

wicked sage
#

instead of kimi 2.5 just use uhhh

#

fuh-in grok or something

#

idk

wicked talon
wicked sage
#

grok 4.1 fast

wicked talon
#

Until it hit me with the "limit"

#

And im not paying £20 a month

wicked sage
#

i hate elon

wicked talon
#

I don't use chatgpt cuz of limits too

honest verge
#

But grok 4.20 is too late

wicked talon
honest verge
#

Like it was supposed to come out in January

#

But still nothing

wicked sage
#

i dont even know which ai is good for

#

coding random bullshii

#

i just use sonnet 4.5

#

im STILL waiting for an update 😡

honest verge
wicked sage
honest verge
wicked talon
#

Gemini says grok5 will come out by march 🙂

honest verge
#

With sonnet 5

#

Like everyone thought it will come out

#

But no

wicked sage
#

yo i can die happily when sonnet 4.6/5 releases

wicked talon
wicked sage
wicked talon
wicked talon
#

It's one of the only models to allow live speaking w camera

wicked talon
#

Oh

wicked sage
#

cuz again

#

it kept saying stuff from earlier

honest verge
#

Like pricing is very good

wicked talon
#

Yeah

#

What ai is the best to roast me

#

Probably grok

#

Unfiltered asf

wicked sage
wicked talon
#

I JUST NOTICED THAT FINGER

#

😭

left lodge
#

Grok is so wierd on arena.ai because it doesn't have system prompt which makes it somewhat less aggressive.

drifting crow
#

damn google gemini pro 3 is lit, compared to basic thinking model its night and day

wicked sage
#

hi back

surreal zephyr
wicked sage
#

🧛

wicked talon
surreal zephyr
#

And gpt 5.3 codex beaten opus at making tank physics

wicked talon
#

I just spent 2 hours with my good friend Gemini making a DNS server and failed

#

God knows how much water I used

wicked sage
#

infinite water

#

❤️‍🩹

wicked talon
wicked talon
surreal zephyr
wicked talon
surreal zephyr
wicked talon
keen beacon
#

Hey guys, are the devs going to fix Gemini Pro and Flash someday?

surreal zephyr
#

It was nerfed to ground few weeks ago

wicked sage
#

I SAID GEMINI HAS DEMENTIA LIKE

wicked talon
#

Bro it was giving me riddles

wicked sage
#

MINUTES AGO

wicked talon
#

I'm raging rn

#

I thought it installed a virus for a sec

surreal zephyr
surreal zephyr
wicked sage
#

it got NERFED as shii

surreal zephyr
#

Then /model and select gpt 5.2 high (noncodex)

#

It will do ts in first try

wicked talon
#

My ah don't wanna sit through another tutorial 😭

#

How do I install dat on Linux

wicked sage
#

npm install -g @openai/codex if npm is installed

#

brew install --cask codex if brew

wicked talon
#

Ok

#

If I fail I'm the worst Linux Dev ever

#

Well not even a dev cuz I use ai

#

But it is what it is

surreal zephyr
surreal zephyr
#

Opus js sucks in real usage

#

It hallucinates as much as gemini js hides it well

#

🤣

wicked sage
#

atp just use claude from its official source

#

🥱

surreal zephyr
#

Gpt is less creative but actually knows what its doing

wicked sage
#

Yo what if we just combine all ais into one ai

#

🥱 🥱 🥱

surreal zephyr
#

Gpt 5.2h vs opus 4.6

wicked sage
#

ill try gpt5.2h out

#

@wicked talon install npm

#

i forgot the command

surreal zephyr
#

🥀

#

Js ask antigravity to install npm like i did

wicked sage
#

sudo apt install npm i think

surreal zephyr
#

😭 🔥

wicked sage
#

i dont even know

wicked talon
#

I hate my life

surreal zephyr
#

Why delete

#

Also

#

Download npm

wicked talon
#

Kk

surreal zephyr
#

Npm i -g @/openai/codex

wicked sage
#

actually wait download node js

surreal zephyr
#

Iirc

wicked sage
#

wait no im

wicked talon
#

God why is it taking so slow

surreal zephyr
#

💔

wicked sage
#

is it 2 bytes per hour

wicked talon
surreal zephyr
#

💔

wicked talon
#

Badd WiFi my ahh

wicked talon
wicked sage
#

god DAMN

#

twin what router service are you using

#

❤️‍🩹

surreal zephyr
wicked talon
wicked sage
#

ah yeah makes sense

surreal zephyr
wicked talon
surreal zephyr
#

5g on pc is crazy

wicked sage
#

bluetooth connected

surreal zephyr
#

Codex my beloved ❤️‍🩹

#

Best

#

Gpt on top

wicked talon
surreal zephyr
#

I glaze gpt

honest verge
#

It's too good

surreal zephyr
#

Gpt actually tells people if they want bs instead of hallucinating answers

honest verge
#

For every task

surreal zephyr
#

Of gpt

wicked sage
#

i have NEVER used mistral

wicked talon
#

Me neither

honest verge
surreal zephyr
honest verge
#

It's the best ai ever

wicked talon
wicked sage
#

oh my god mistral is french

surreal zephyr
#

If gemini is crack abuser, then mistral is?

surreal zephyr
#

Mistral is like a monkey

#

Lowk

wicked talon
#

Gng npm installed now what

honest verge
wicked sage
honest verge
#

More

wicked sage
#

npm install -g @openai/codex

surreal zephyr
#

Mistral would install codex without npm 💔 ✌️

honest verge
wicked sage
surreal zephyr
wicked talon
#

Tf you mean permission denied

surreal zephyr
honest verge
wicked sage
#

actually

#

wait no

#

i thought of sudo codex i think that can help

#

im not sure

honest verge
surreal zephyr
#

Js

#

Start terminal

#

As admin

honest verge
#

It's going to work

surreal zephyr
#

💔

wicked talon
#

Oh wait I got denied

honest verge
#

Why not through mistral

#

It can install everything

wicked talon
#

How does my ahh run as administrator on Linux

wicked sage
#

Hey claude
Generate.

wicked sage
wicked talon
#

Sudo wat

wicked sage
#

sudo codex

#

i hope

#

just try it
it might work

wicked talon
surreal zephyr
#

Sudo npm install....

#

Or js install antigravity then ask it to install codex

wicked talon
#

Npm is installed

surreal zephyr
#

💔

wicked sage
#

wait let me try out

wicked talon
surreal zephyr
#

💔

honest verge
#

PLS

#

IT'S GOING TO WORK

wicked sage
#

💔

surreal zephyr
wicked sage
#

wait no im stupid

#

sudo npm install -g @openai/codex

honest verge
wicked sage
#

if it says some bullsh like permission denied do sudo

surreal zephyr
#

🥀

honest verge
#

What thedot2

wicked sage
#

son im crine who is using dot2

#

nvm pineapple is using it

honest verge
wicked sage
#

alright blud

wicked talon
#

Now what

wicked sage
#

npm install -g @openai/codex

#

or sudo npm install -g @openai/codex

wicked talon
#

I did

wicked sage
#

codex

#

i mean

#

do codex

wicked talon
#

I did

wicked sage
#

what happebned

wicked talon
#

"sudo npm install -g @raven heart/codex

wicked sage
#

LMAO

wicked talon
#

It installed a package

wicked talon
wicked sage
#

ok my bad

surreal zephyr
#

Bro

#

Write "codex"

#

To start it

wicked sage
honest verge
#

I asked Gemini to create an image of what mistral can do is it accurate?

wicked sage
wicked talon
wicked talon
#

It's working 🙂

wicked sage
#

lets go

wicked talon
#

Less goo

wicked sage
#

ok you did it

surreal zephyr
#

To gpt 5.2 high

#

(Not codex)

wicked sage
#

uhh

#

model

surreal zephyr
#

/model

wicked sage
#

actually no im

#

stupid

#

yeah ty

wicked talon
surreal zephyr
wicked sage
surreal zephyr
#

Then reasoning high

#

Not xhigh

#

High better overall

wicked talon
wicked sage
surreal zephyr
wicked sage
#

not extra high

wicked talon
#

Kk bet

surreal zephyr
#

Extra wastes tokens and gets compressed during work

#

So extra is worse than high while being slower

wicked talon
#

Time to setup a DNS server

honest verge
#

Why complain about xhigh when you can install mistral

#

Like

#

Why

surreal zephyr
wicked talon
honest verge
#

I'm gay

surreal zephyr
honest verge
#

With mistral

surreal zephyr
#

I run mistral locally on an usb stick 🗣️

wicked sage
surreal zephyr
honest verge
#

Because mistral is so strong

wicked sage
#

ok lets check the lmarena leaderboards then

honest verge
wicked sage
#

no its like top 70 something

surreal zephyr
#

Leaderboard

surreal zephyr
#

(The lower the better)

wicked sage
surreal zephyr
#

The newer ones

#

Like 5.2 and 5.3c

#

Thats why gpt best

wicked talon
#

I'm gonna ask chatgpt to run minstrel locally

honest verge
#

Mistral is better than opus 4

#

That's why he's the goat

wicked sage
#

its not opus 4.1 tho

wicked talon
surreal zephyr
wicked talon
wicked sage
surreal zephyr
#

5.3 Codex did first try

honest verge
#

He's the goat

surreal zephyr
honest verge
surreal zephyr
#

Opus made a pretty tank that wasnt working

honest verge
#

Please

#

What the hell

#

Why 3.2 exp is better than release 3.2?

#

🥀

surreal zephyr
wicked sage
#

-41 is crazy btw

surreal zephyr
honest verge
surreal zephyr
#

Gemini is rn at maybe -30

#

Gpt 5.2 and 5.3c are at top of that lb rn

wicked sage
#

ye makes sense

#

btw what plan do you have to get for uhh 5.3c

surreal zephyr
#

(Not xhigh, xhigh sucks)

surreal zephyr
wicked sage
#

ah ok

surreal zephyr
#

5.2 high is more creative

honest verge
wicked sage
surreal zephyr
#

5.3c is the professional soft engineer

honest verge
#

Why you can't use mistral

#

Like Is it too hard?

surreal zephyr
# wicked sage

Sonnet 4 is more trustworthy than opus 4.5 and 4.6 rn btw

surreal zephyr
wicked sage
#

why do some companies like anthropic and

#

google

wicked talon
#

I quit codex

#

I just deleted it

wicked sage
#

try to lobotimize the ais and make them say random stuff

wicked talon
#

Time to go to minstrel

surreal zephyr
#

🥀

surreal zephyr
#

🥀

#

Opus still costs a lot to run cuz it was slightly lobotomized

#

Gemini was made cheap and lobotomized to ground...

honest verge
surreal zephyr
#

Anthropic and google basically brute forcing benchmarks

#

🥀

honest verge
#

Then who mistral is

wicked talon
#

Qwen for life 🙂

surreal zephyr
#

I had gemini run for half hour in ag after a fix bug prompt and it succeded

#

(Gpt found in 30s)

wicked talon
#

Lmao

honest verge
#

I need my dear mistral

#

I love him

wicked talon
#

Qwen for life babyyy

wicked sage
#

@mistral Hello.

honest verge
#

@shy jay

surreal zephyr
#

Pre nerf gemini was better than opus btw

honest verge
#

We need you

#

Pls

#

I'm dead

#

But at least I summoned him

zinc oyster
#

Hello

wicked talon
#

Wait qwen has it's own discord server

honest verge
surreal zephyr
#

Tbh

#

Openai is the only one

zinc oyster
#

Does everyone have this bug where there's a captcha that's impossible to pass,just error?

surreal zephyr
#

That knows how to make llms

honest verge
wicked talon
honest verge
#

Mistral is better

surreal zephyr
#

🔥

honest verge
#

I'm glazing my mistral so much I'm tired of saying his name 🥀

#

🥀 🥀

wicked sage
#

bro gained self awareness

wicked talon
honest verge
wicked talon
#

If your wrong he says it

honest verge
wicked talon
#

Qwen coder casually having 1.04m tokens

wicked talon
wicked talon
surreal zephyr
#

🥀

wicked talon
wicked talon
surreal zephyr
#

In gpt we trust

surreal zephyr
wicked talon
#

And only 4000 tokens output

surreal zephyr
#

Gpt works solid for 200k i tested so far

honest verge
#

Or crack

surreal zephyr
#

With gpt we develop

#

With gemini we get older and have dementia

wicked talon
honest verge
wicked talon
#

Minstral is same amount

left lodge
honest verge
honest verge
#

Because of the knowledge cutoff

#

Btw what cutoff opus 4.6 has?

left lodge
#

You can search yourself using a search engine,
I suggest brave search

wicked talon
#

It got context right

#

Idk about everything else

wicked talon
#

Trust

left lodge
#

🤦

wicked talon
#

Use duckduckgo

left lodge
#

Its a damn search engine

wicked talon
left lodge
#

Do you even know what does honeypot mean? How do you think it is a honeypot and duckduckgo isn't?

honest verge
wicked talon
left lodge
surreal zephyr
#

Also 3.0 flash is correct

#

It says it doesnt know

wicked talon
#

And it was found brave was leaking DNS queries

honest verge
wicked talon
left lodge
#

But still the most reliable results are with search tool enabled

honest verge
#

"Grook"

left lodge
#

Relying on its training data or hoping it knows it because the info might be inside its system prompt is just weird

surreal zephyr
#

Bro is mistral

left lodge
#

You dont have access to its system prompts even on their native platforms, thats not reliable

signal apex
#

guys mine has been like this for almost 20 minutes, does it take it that long to response 😭?

wicked sage
#

of course itdoes that

surreal zephyr
left lodge
#

There is hard 6 min limit, if thats reached when a model is generating a response it cuts off the response and throws Something went wrong with this response, please try again. error

I reported this way back on November of 2025 but they haven't done anything.

Models are literally made to think for hours and here they have a hard 6 min cutoff 🤦

And btw this 6 minute limit is on every single model available not only opus 4.6.

surreal zephyr
#

It thinks forever untill it hallucinates correctly

#

Thats how opuses work

left lodge
signal apex
#

so i just wait?

surreal zephyr
#

Opus works by having ton of internal thinking

#

The only reason why it does so well on benchs

left lodge
surreal zephyr
#

Opus is literally gemini 3 xxxxhigh

left lodge
wicked talon
#

Qwen is Gemini 3 pro but on steroids

#

🙂

honest verge
#

Paperbanana

left lodge
#

Is that official?

#

Source links?

surreal zephyr
#

But notebooklm uses similiar thing

honest verge
wicked talon
#

🙂

surreal zephyr
honest verge
#

Imagine paperbanana

#

This name sucks

#

Like banana from paper

wicked talon
#

It's Google what do you expect

left lodge
#

Its just a side research project developed by Google Research in collaboration with Peking University

#

Not much

surreal zephyr
gusty helm
#

hey! If im not mistaken claude should have a thinking version too on the 4.6? Is it not coming to arena or is it not ranked yet/code named/collecting votes?

surreal zephyr
#

Its so bad it didnt get on lb

#

Lol

honest verge
gusty helm
#

oh rly harold ? didnt see that

surreal zephyr
#

It gets stuck in loop everytime and crashes

#

Lol

honest verge
#

It can't really make big projects

surreal zephyr
#

Its unusable

honest verge
#

Only small

#

Normal 4.6 is way better

gusty helm
#

that's an odd ball; yeah I see can use it in direct chat

#

but not included in leaderboard at all lol

surreal zephyr
honest verge
surreal zephyr
#

It will get stuck and crash

#

90% times

#

🥀

honest verge
#

It just can't do anything

surreal zephyr
#

Worse than mistral atp

#

Mistral did the tank

#

Opus crashed

#

Mistral wins

#

🥀

gusty helm
#

I see; prob needs some more work on it before it's usuable

surreal zephyr
#

Or gpt 5.3 codex or gpt 5.2 high both are better

#

😔

gusty helm
#

using neither 😄 was just curios came from a trip and saw 4.6 #1

honest verge
surreal zephyr
gusty helm
#

did not expect google to lose R1 anytime soon

surreal zephyr
cyan kettle
#

what do yall think is the best model right now?

surreal zephyr
#

Easily

#

No competition

#

Lol

honest verge
gusty helm
#

really depends for what