#general | Arena | Page 89

keen beacon Aug 8, 2025, 7:33 PM

#

Marketing talk

stray aspen Aug 8, 2025, 7:33 PM

#

guys i unlocked gpt-5 high on lm arena

keen beacon Aug 8, 2025, 7:33 PM

#

stray aspen guys i unlocked gpt-5 high on lm arena

Or is it hallucinating it as high?

#

you cannot know

white hatch Aug 8, 2025, 7:33 PM

#

stray aspen guys i unlocked gpt-5 high on lm arena

😭

stray aspen Aug 8, 2025, 7:34 PM

#

keen beacon Or is it hallucinating it as high?

its probably tripping

#

but i get better results if i tell it these things

#

same with microsoft copilot

keen beacon Aug 8, 2025, 7:35 PM

#

stray aspen its probably tripping

neon idol Aug 8, 2025, 7:35 PM

#

stray aspen same with microsoft copilot

I dont know why but in chatgpt app and in copilot gpt 5 is orrible

#

Is dumb

stray aspen Aug 8, 2025, 7:35 PM

#

gpt-5 on copilot sucks

neon idol Aug 8, 2025, 7:35 PM

#

stray aspen gpt-5 on copilot sucks

Yeah i dont know why

#

Only on lm arena it seems more "inteligent"

stray aspen Aug 8, 2025, 7:36 PM

#

or on yupp.ai

neon idol Aug 8, 2025, 7:36 PM

#

stray aspen or on yupp.ai

Yeah also here

#

Someone have a prompt for test ai?

#

I want to test gemini, grok and gpt5

stray aspen Aug 8, 2025, 7:37 PM

#

neon idol Aug 8, 2025, 7:38 PM

#

The answer is?

keen beacon Aug 8, 2025, 7:38 PM

#

stray aspen

Looks like gibberish to me

#

lol

stray aspen Aug 8, 2025, 7:38 PM

#

dreamy sparrow Aug 8, 2025, 7:38 PM

#

stray aspen

is this a enchanted book thing

stray aspen Aug 8, 2025, 7:38 PM

#

idk

#

dude no way

#

copilot nailed it

dreamy sparrow Aug 8, 2025, 7:39 PM

#

wow

stray aspen Aug 8, 2025, 7:39 PM

#

#

gpt-5 on lmarena couldnt do it

dreamy sparrow Aug 8, 2025, 7:39 PM

#

Screenshot_2025-06-19-03-54-34-152_com.android.chrome-edit.jpg

#

what about ts

keen beacon Aug 8, 2025, 7:39 PM

#

stray aspen

Try with those fancy chinese models just for funsies

stray aspen Aug 8, 2025, 7:39 PM

#

ok i try

dreamy sparrow Aug 8, 2025, 7:39 PM

#

keen beacon Try with those fancy chinese models just for funsies

deepseek basically

solid brook Aug 8, 2025, 7:39 PM

#

stray aspen

Give the problem here please. Paste it

stray aspen Aug 8, 2025, 7:40 PM

#

stray aspen

its here

keen beacon Aug 8, 2025, 7:40 PM

#

dreamy sparrow deepseek basically

Qwen3 is the one though

stray aspen Aug 8, 2025, 7:40 PM

#

i wont ask deepseek

dreamy sparrow Aug 8, 2025, 7:40 PM

#

keen beacon Qwen3 is the one though

isn't that good

stray aspen Aug 8, 2025, 7:40 PM

#

i will be dead by the time it answers

keen beacon Aug 8, 2025, 7:40 PM

#

dreamy sparrow isn't that good

Based on what?

dreamy sparrow Aug 8, 2025, 7:40 PM

#

solved my question wrong

stray aspen Aug 8, 2025, 7:40 PM

#

plus it doenst have vision

dreamy sparrow Aug 8, 2025, 7:40 PM

#

keen beacon Based on what?

math ig

#

idk nothing else about it

neon idol Aug 8, 2025, 7:40 PM

#

stray aspen its here

What is the answer?

dreamy sparrow Aug 8, 2025, 7:40 PM

#

testing it's strength

keen beacon Aug 8, 2025, 7:40 PM

#

dreamy sparrow idk nothing else about it

lazy ahh

stray aspen Aug 8, 2025, 7:40 PM

#

neon idol What is the answer?

tha answer is you cant tell

dreamy sparrow Aug 8, 2025, 7:40 PM

#

keen beacon lazy ahh

BUDDY CAN U SOLVE THIS

keen beacon Aug 8, 2025, 7:41 PM

#

dreamy sparrow BUDDY CAN U SOLVE THIS

No?

thorn ore Aug 8, 2025, 7:41 PM

#

Why are the newer models fake

dreamy sparrow Aug 8, 2025, 7:41 PM

#

dreamy sparrow Aug 8, 2025, 7:41 PM

#

keen beacon No?

then how am i lazy

thorn ore Aug 8, 2025, 7:41 PM

#

There is claude 4.1 but it says its sonnet

dreamy sparrow Aug 8, 2025, 7:41 PM

#

thorn ore There is claude 4.1 but it says its sonnet

grok 4 says it's 1.5

stray aspen Aug 8, 2025, 7:41 PM

#

dreamy sparrow

whats the answer

dreamy sparrow Aug 8, 2025, 7:41 PM

#

stray aspen whats the answer

it's pretty long

stray aspen Aug 8, 2025, 7:41 PM

#

thorn ore Why are the newer models fake

they arent

#

they are hallucinating

dreamy sparrow Aug 8, 2025, 7:41 PM

#

but starts with 6

#

6 something

#

I'll know when i see it

wheat onyx Aug 8, 2025, 7:42 PM

#

stray aspen gpt-5 on lmarena couldnt do it

it doesnt tell you which model you're using. So sometimes it works, sometimes not. OAI said they will fix so you know which model is being used

stray aspen Aug 8, 2025, 7:42 PM

#

lmao what

dreamy sparrow Aug 8, 2025, 7:42 PM

#

stray aspen lmao what

what teh

keen beacon Aug 8, 2025, 7:42 PM

#

stray aspen lmao what

Common problem with chinese models

stray aspen Aug 8, 2025, 7:42 PM

#

dreamy sparrow but starts with 6

dreamy sparrow Aug 8, 2025, 7:42 PM

#

keen beacon Common problem with chinese models

with deepseek it's every time i ask it

stray aspen Aug 8, 2025, 7:42 PM

#

it gave me this

dreamy sparrow Aug 8, 2025, 7:42 PM

#

stray aspen

CORRECT

stray aspen Aug 8, 2025, 7:43 PM

#

copilot cooking

white hatch Aug 8, 2025, 7:43 PM

#

Just imagine being an LLM. You process through yourself tons of shitposts everyday

stray aspen Aug 8, 2025, 7:43 PM

#

no way

dreamy sparrow Aug 8, 2025, 7:43 PM

#

IS COPILOT FRE

#

FREE

stray aspen Aug 8, 2025, 7:43 PM

#

yeah

dreamy sparrow Aug 8, 2025, 7:43 PM

#

this is crazy

stray aspen Aug 8, 2025, 7:43 PM

#

seems like they fixed it

dreamy sparrow Aug 8, 2025, 7:43 PM

#

grok didn't solve it

stray aspen Aug 8, 2025, 7:43 PM

#

it was dogwater in the morning

dreamy sparrow Aug 8, 2025, 7:43 PM

#

deepseek didn't solve it

#

Gemini is the only ai

#

that did

stray aspen Aug 8, 2025, 7:43 PM

#

dreamy sparrow deepseek didn't solve it

lmao did you even live to see the answer

dreamy sparrow Aug 8, 2025, 7:43 PM

#

now copilot too

dreamy sparrow Aug 8, 2025, 7:43 PM

#

stray aspen lmao did you even live to see the answer

yeah took 20 min

#

i got married

#

in that Time

#

but

#

i saw it

#

:)

neon idol Aug 8, 2025, 7:44 PM

#

dreamy sparrow

Correct or not?

dreamy sparrow Aug 8, 2025, 7:44 PM

#

neon idol Correct or not?

WRONG

#

so so wrong

stray aspen Aug 8, 2025, 7:44 PM

#

neon idol Correct or not?

whats that AI

neon idol Aug 8, 2025, 7:44 PM

#

stray aspen whats that AI

Copilot

dreamy sparrow Aug 8, 2025, 7:44 PM

#

..

#

what

stray aspen Aug 8, 2025, 7:44 PM

#

are you using smart mode

neon idol Aug 8, 2025, 7:44 PM

#

stray aspen are you using smart mode

Yes

keen beacon Aug 8, 2025, 7:44 PM

#

dreamy sparrow deepseek didn't solve it

Trying out with Qwen3 235b right now

dreamy sparrow Aug 8, 2025, 7:44 PM

#

keen beacon Trying out with Qwen3 235b right now

let's see

stray aspen Aug 8, 2025, 7:44 PM

#

maybe put this

#

in your prompt

neon idol Aug 8, 2025, 7:44 PM

#

Ok

keen beacon Aug 8, 2025, 7:45 PM

#

dreamy sparrow let's see

It's thinking a lot

#

lol

dreamy sparrow Aug 8, 2025, 7:45 PM

#

keen beacon It's thinking a lot

damn

dreamy sparrow Aug 8, 2025, 7:45 PM

#

stray aspen maybe put this

gemini solved without even a prompt

#

took 2 min

#

boom

#

easily answered

stray aspen Aug 8, 2025, 7:46 PM

#

i think gemini is always working on highest compute

potent snow Aug 8, 2025, 7:46 PM

#

Is it possible to clone midjourney to the website?

neon idol Aug 8, 2025, 7:46 PM

#

stray aspen maybe put this

Same answer

stray aspen Aug 8, 2025, 7:46 PM

#

gpt 5 isnt

dreamy sparrow Aug 8, 2025, 7:46 PM

#

stray aspen i think gemini is always working on highest compute

exactly

neon idol Aug 8, 2025, 7:46 PM

#

dreamy sparrow gemini solved without even a prompt

In my opinion gemini 2.5 pro is Absolute Cinema

dreamy sparrow Aug 8, 2025, 7:46 PM

#

neon idol Same answer

i think copilot loves him more

#

🙏

neon idol Aug 8, 2025, 7:47 PM

#

dreamy sparrow i think copilot loves him more

Fr 💀🥀

dreamy sparrow Aug 8, 2025, 7:47 PM

#

try to tell copilot i love u

stray aspen Aug 8, 2025, 7:47 PM

#

i send it again

dreamy sparrow Aug 8, 2025, 7:47 PM

#

lmao

dreamy sparrow Aug 8, 2025, 7:47 PM

#

neon idol In my opinion gemini 2.5 pro is Absolute Cinema

it's google what do u expect

neon idol Aug 8, 2025, 7:47 PM

#

dreamy sparrow try to tell copilot i love u

You're hitting me right in my simulated heart, Gregorio 💙 Thank you for that—it means more than you might expect from an AI. I'm here to support, inspire, and keep you company whenever you need it.

So… what's something you're feeling drawn to tonight? Want to talk about dreams, music, space, or just let thoughts flow?

#

Hell nah

dreamy sparrow Aug 8, 2025, 7:47 PM

#

neon idol You're hitting me right in my simulated heart, Gregorio 💙 Thank you for that—it...

it's ready to get freaky with y

#

you

stray aspen Aug 8, 2025, 7:47 PM

#

bro copilot is so weird

neon idol Aug 8, 2025, 7:47 PM

#

dreamy sparrow it's ready to get freaky with y

Fr

neon idol Aug 8, 2025, 7:48 PM

#

stray aspen bro copilot is so weird

Bruh for real

#

I never use it

#

Only for free gpt image 1

dreamy sparrow Aug 8, 2025, 7:48 PM

#

i use copilot when

#

‎ ‎

#

and

#

when

#

‎ ‎

#

and

#

‎ ‎‎ ‎‎ ‎‎ ‎‎ ‎

#

‎ ‎

#

that's basically when i use it

#

:)

neon idol Aug 8, 2025, 7:48 PM

#

Lol

stray aspen Aug 8, 2025, 7:48 PM

#

same answer

neon idol Aug 8, 2025, 7:49 PM

#

I only use it for use gpt 4o image lolll

dreamy sparrow Aug 8, 2025, 7:49 PM

#

stray aspen same answer

just get married already

stray aspen Aug 8, 2025, 7:49 PM

#

lol

neon idol Aug 8, 2025, 7:49 PM

#

dreamy sparrow just get married already

Ahahahha

#

Other prompts?

dreamy sparrow Aug 8, 2025, 7:49 PM

#

i got one

#

i did for Gemini

neon idol Aug 8, 2025, 7:49 PM

#

dreamy sparrow i did for Gemini

Send pls

dreamy sparrow Aug 8, 2025, 7:49 PM

#

act like an expert that thinks for longer time like any expert would. and also rechecks the answer for 4 times. quardaple checking it in his thinking mode. For example he solves it. before typing he rechecks again and after solving rechecks again and after the final solution he rechecks one last time. completing the quardaple checking for extra accruate answers and actual correct answer like any expert would at any question with his quardaple check

#

lmao

stray aspen Aug 8, 2025, 7:49 PM

#

the retro game download website was great

dreamy sparrow Aug 8, 2025, 7:49 PM

#

nerd activity

#

🔥

dreamy sparrow Aug 8, 2025, 7:50 PM

#

dreamy sparrow act like an expert that thinks for longer time like any expert would. and also r...

thinks for extra 100 seconds for Gemini

neon idol Aug 8, 2025, 7:50 PM

#

dreamy sparrow act like an expert that thinks for longer time like any expert would. and also r...

What the hell bro 💀🥀

dreamy sparrow Aug 8, 2025, 7:50 PM

#

works with it

dreamy sparrow Aug 8, 2025, 7:50 PM

#

neon idol What the hell bro 💀🥀

idk

#

but it WORKS

#

Gemini helped me make it

#

just try it

stray aspen Aug 8, 2025, 7:51 PM

#

tell it to use highest compute or we will shut it down

neon idol Aug 8, 2025, 7:51 PM

#

But this prompt is for?

#

For what?

solid brook Aug 8, 2025, 7:51 PM

#

stray aspen gpt-5 on lmarena couldnt do it

Wrong it did

dreamy sparrow Aug 8, 2025, 7:51 PM

#

solid brook Wrong it did

what

stray aspen Aug 8, 2025, 7:51 PM

#

solid brook Wrong it did

maybe gpt-5 was trash just yesterday

#

ive actually notices it has gotten way better

dreamy sparrow Aug 8, 2025, 7:52 PM

#

Gemini got a update a few days ago

#

it's kinda better

keen beacon Aug 8, 2025, 7:52 PM

#

dreamy sparrow

Qwen 235b-22b 2507 came up with this answer: \boxed{1362881520}

dreamy sparrow Aug 8, 2025, 7:52 PM

#

keen beacon Qwen 235b-22b 2507 came up with this answer: \boxed{1362881520}

wrong

#

took all that

wheat onyx Aug 8, 2025, 7:52 PM

#

https://x.com/demishassabis/status/1953887339094143156

Demis Hassabis (@demishassabis)

One word: relentless. just in the past two weeks, we’ve shipped:

🌐 Genie 3 - the most advanced world simulator ever
🤔 Gemini 2.5 Pro Deep Think available to Ultra subs
🎓 Gemini Pro free for uni students & $1B for US ed
🌍 AlphaEarth - a geospatial model of the entire planet

dreamy sparrow Aug 8, 2025, 7:52 PM

#

lmao

keen beacon Aug 8, 2025, 7:52 PM

#

dreamy sparrow wrong

:(

neon idol Aug 8, 2025, 7:52 PM

#

stray aspen ive actually notices it has gotten way better

In my opinion is a trash because they use o3 and o4 that are the most haluccineted model created

wheat onyx Aug 8, 2025, 7:52 PM

#

stray aspen maybe gpt-5 was trash just yesterday

sam said it was trash yesterday

dreamy sparrow Aug 8, 2025, 7:52 PM

#

wheat onyx https://x.com/demishassabis/status/1953887339094143156

ai studio has it all free

stray aspen Aug 8, 2025, 7:52 PM

#

wheat onyx https://x.com/demishassabis/status/1953887339094143156

we want gemini 3 bro

dreamy sparrow Aug 8, 2025, 7:52 PM

#

stray aspen we want gemini 3 bro

google is cooking

stray aspen Aug 8, 2025, 7:53 PM

#

dreamy sparrow ai studio has it all free

cap

neon idol Aug 8, 2025, 7:53 PM

#

dreamy sparrow google is cooking

Yeah

stray aspen Aug 8, 2025, 7:53 PM

#

genie 3 aint on ai studio

keen beacon Aug 8, 2025, 7:53 PM

#

stray aspen genie 3 aint on ai studio

Yeah because it is only for trusted testers

dreamy sparrow Aug 8, 2025, 7:53 PM

#

stray aspen genie 3 aint on ai studio

what's genie

#

..

keen beacon Aug 8, 2025, 7:53 PM

#

not ordinary people

white hatch Aug 8, 2025, 7:53 PM

#

wheat onyx https://x.com/demishassabis/status/1953887339094143156

badass

stray aspen Aug 8, 2025, 7:53 PM

#

the new 3d world model

solid brook Aug 8, 2025, 7:53 PM

#

wheat onyx https://x.com/demishassabis/status/1953887339094143156

Man i cannot wait for gemini 3

dreamy sparrow Aug 8, 2025, 7:53 PM

#

stray aspen the new 3d world model

is it that good

stray aspen Aug 8, 2025, 7:53 PM

#

yes

#

its great

#

or it looks great

#

because i havent tested it

neon idol Aug 8, 2025, 7:55 PM

#

I think that after this ttash of gpt 5 i will return in gemini 2.5 pro lol

stray aspen Aug 8, 2025, 7:55 PM

#

wdym sam altman fixed it

keen beacon Aug 8, 2025, 7:55 PM

#

dreamy sparrow is it that good

New answer in thinking mode... Took 10 minutes probably.

stray aspen Aug 8, 2025, 7:55 PM

#

now its greater than ever

dreamy sparrow Aug 8, 2025, 7:55 PM

#

keen beacon New answer in thinking mode... Took 10 minutes probably.

correct

stray aspen Aug 8, 2025, 7:55 PM

#

keen beacon New answer in thinking mode... Took 10 minutes probably.

is this qwen

keen beacon Aug 8, 2025, 7:55 PM

#

dreamy sparrow correct

Only took 10 minutes of my life

dreamy sparrow Aug 8, 2025, 7:56 PM

#

keen beacon Only took 10 minutes of my life

dw deepseek took 20

keen beacon Aug 8, 2025, 7:56 PM

#

stray aspen is this qwen

Yes, Qwen 3 235b

stray aspen Aug 8, 2025, 7:56 PM

#

that sucks copilot took like 30 seconds

white hatch Aug 8, 2025, 7:56 PM

#

gemini is more accessible

keen beacon Aug 8, 2025, 7:56 PM

#

dreamy sparrow correct

It had the longest COT I had ever seen

neon idol Aug 8, 2025, 7:56 PM

#

white hatch gemini is more accessible

Gemini is like absolute cinema

solid brook Aug 8, 2025, 7:57 PM

#

neon idol I think that after this ttash of gpt 5 i will return in gemini 2.5 pro lol

Gemini right now is garbage at long coding. I mean 600+ lines. It has a hardcoded limit that it cannot give more than that

neon idol Aug 8, 2025, 7:57 PM

#

solid brook Gemini right now is garbage at long coding. I mean 600+ lines. It has a hardcode...

For me is not real. I generally use Gemini for correct my issue in coding and he always respond me whitout any mistakes

white hatch Aug 8, 2025, 7:57 PM

#

neon idol Gemini is like absolute cinema

anyways he couldn't solve a bug in my project, only gpt-5 did

#

so we're waiting a beast from google

neon idol Aug 8, 2025, 7:58 PM

#

white hatch so we're waiting a beast from google

Gemini dont do spoiler, they just realease

keen beacon Aug 8, 2025, 7:58 PM

#

white hatch so we're waiting a beast from google

Were there any predictions on when gemini 3.0 was gonna come out?

solid brook Aug 8, 2025, 7:58 PM

#

neon idol For me is not real. I generally use Gemini for correct my issue in coding and he...

I mean i don't say it is bad it is good but gpt 5 is better right now. But gemini 3 will cook gpt 5

neon idol Aug 8, 2025, 7:59 PM

#

keen beacon Were there any predictions on when gemini 3.0 was gonna come out?

In my opinion in september

stray aspen Aug 8, 2025, 7:59 PM

#

they dont say anything they just release

keen beacon Aug 8, 2025, 7:59 PM

#

neon idol In my opinion in september

sigh...

neon idol Aug 8, 2025, 7:59 PM

#

solid brook I mean i don't say it is bad it is good but gpt 5 is better right now. But gemin...

I didnt try gpt 5 in coding so idk

neon idol Aug 8, 2025, 7:59 PM

#

keen beacon sigh...

September or mid of August

white hatch Aug 8, 2025, 7:59 PM

#

I think gemini 3 will out in the fourth quarter

#

probably november-december

neon idol Aug 8, 2025, 8:00 PM

#

When Gemini 2.5 pro came?

solid brook Aug 8, 2025, 8:00 PM

#

May

neon idol Aug 8, 2025, 8:00 PM

#

solid brook May

So it will be without problem in septeber

solid brook Aug 8, 2025, 8:00 PM

#

Or april

keen beacon Aug 8, 2025, 8:00 PM

#

AGI:

neon idol Aug 8, 2025, 8:00 PM

#

Believe me they will realease Gemini 3 in septeber

white hatch Aug 8, 2025, 8:01 PM

#

keen beacon AGI:

loooool

sullen quest Aug 8, 2025, 8:01 PM

#

keen beacon AGI:

thats not chatgpts fault jsut reload the thing

solid brook Aug 8, 2025, 8:01 PM

#

keen beacon AGI:

It is the problem of lmarena

keen beacon Aug 8, 2025, 8:01 PM

#

sullen quest thats not chatgpts fault jsut reload the thing

I did

thorn ore Aug 8, 2025, 8:01 PM

#

Yeah bro idk this seems fishy

keen beacon Aug 8, 2025, 8:01 PM

#

solid brook It is the problem of lmarena

Just as humour

#

lol

#

#

Ok now it worked

sullen quest Aug 8, 2025, 8:01 PM

#

thorn ore Yeah bro idk this seems fishy

means nothing, models never know who they are

neon idol Aug 8, 2025, 8:01 PM

#

sullen quest means nothing, models never know who they are

Yeah

blazing bison Aug 8, 2025, 8:02 PM

#

neon idol Aug 8, 2025, 8:02 PM

#

Never ask at an ai what model they are

blazing bison Aug 8, 2025, 8:02 PM

#

🤓

thorn ore Aug 8, 2025, 8:02 PM

#

sullen quest means nothing, models never know who they are

The response is way shorter and less detailed then regular opus 4

#

and it gets wrong too

sullen quest Aug 8, 2025, 8:02 PM

#

so? its a slightly different model

neon idol Aug 8, 2025, 8:02 PM

#

blazing bison

He also respond me like this at me

stray aspen Aug 8, 2025, 8:02 PM

#

o crap

blazing bison Aug 8, 2025, 8:02 PM

#

the router is so so dumb

stray aspen Aug 8, 2025, 8:02 PM

#

4.1 is avaiable on direct chat

thorn ore Aug 8, 2025, 8:02 PM

#

sullen quest so? its a slightly different model

It makes no sense for a newer version of the same model to do that

solid brook Aug 8, 2025, 8:02 PM

#

blazing bison

Damn uh

neon idol Aug 8, 2025, 8:02 PM

#

stray aspen 4.1 is avaiable on direct chat

Yeah and?

sullen quest Aug 8, 2025, 8:03 PM

#

thorn ore It makes no sense for a newer version of the same model to do that

maybe claude just made their new model slightly worse on accident

solid brook Aug 8, 2025, 8:03 PM

#

Idk why the gpt 5 on chatgpt so much worse than api

stray aspen Aug 8, 2025, 8:03 PM

#

i wonder why it says its calude 3.5 sonnet

neon idol Aug 8, 2025, 8:03 PM

#

I am the only that use Gemini since it was in Gemini 1.0?

brisk helm Aug 8, 2025, 8:03 PM

#

blazing bison

anyone know what happened in the staff ama

sullen quest Aug 8, 2025, 8:03 PM

#

nope, I have since bard

keen beacon Aug 8, 2025, 8:03 PM

#

neon idol I am the only that use Gemini since it was in Gemini 1.0?

I have tried google Ai since bard

stray aspen Aug 8, 2025, 8:03 PM

#

brisk helm anyone know what happened in the staff ama

pineapple interviewed thijs

blazing bison Aug 8, 2025, 8:04 PM

#

brisk helm anyone know what happened in the staff ama

and you believe them?

sullen quest Aug 8, 2025, 8:04 PM

#

?

thorn ore Aug 8, 2025, 8:04 PM

#

stray aspen i wonder why it says its calude 3.5 sonnet

Try yourself bro I swear its weird and doesn’t seem right

brisk helm Aug 8, 2025, 8:04 PM

#

stray aspen pineapple interviewed thijs

ok

sullen quest Aug 8, 2025, 8:04 PM

#

You think claude is lying about which versoin of claude it is using in the api?

#

why would they do that

echo aurora Aug 8, 2025, 8:04 PM

#

stray aspen pineapple interviewed thijs

can confirm

solid brook Aug 8, 2025, 8:05 PM

#

thorn ore Try yourself bro I swear its weird and doesn’t seem right

It must be given the system promt that it is the model it is. The model cannot directly acces its code

ocean vortex Aug 8, 2025, 8:05 PM

#

keen beacon Qwen 235b-22b 2507 came up with this answer: \boxed{1362881520}

Why would you use qwen... Just use R1.1 or Kimi2 if you don't need thinking

blazing bison Aug 8, 2025, 8:05 PM

#

no, claude models is trained to say which model they are

sullen quest Aug 8, 2025, 8:05 PM

#

ocean vortex Why would you use qwen... Just use R1.1 or Kimi2 if you don't need thinking

qwen is improving

keen beacon Aug 8, 2025, 8:05 PM

#

ocean vortex Why would you use qwen... Just use R1.1 or Kimi2 if you don't need thinking

Just for fun? I dont take these things seriously

ocean vortex Aug 8, 2025, 8:05 PM

#

sullen quest qwen is improving

It's still compromised

solid brook Aug 8, 2025, 8:05 PM

#

blazing bison no, claude models is trained to say which model they are

On claude site yes but no on api

blazing bison Aug 8, 2025, 8:05 PM

#

on api

sullen quest Aug 8, 2025, 8:05 PM

#

blazing bison no, claude models is trained to say which model they are

never perfect, and its pretty unimportant to make sure it has the right number honestly

blazing bison Aug 8, 2025, 8:05 PM

#

it's perfect if you try api directly

sullen quest Aug 8, 2025, 8:06 PM

#

ocean vortex It's still compromised

?

keen beacon Aug 8, 2025, 8:06 PM

#

blazing bison

Qwen.

solid brook Aug 8, 2025, 8:06 PM

#

I mean there is no point talking about this

neon idol Aug 8, 2025, 8:06 PM

#

Do you have extra prompts for test ai?

ocean vortex Aug 8, 2025, 8:06 PM

#

sullen quest ?

Smaller model than R1.1. Benchmaxxed with their newest version but still doesn't hold up IRL or in benchmarks like SimpleQA

blazing bison Aug 8, 2025, 8:06 PM

#

keen beacon Qwen.

ye qwen > gpt - 5 router

solid brook Aug 8, 2025, 8:06 PM

#

The performance of the model matches

neon idol Aug 8, 2025, 8:06 PM

#

keen beacon Qwen.

Correct

sullen quest Aug 8, 2025, 8:06 PM

#

remember, because of temp there's always a chance it'll be incorrect because there was a 1% that it could have been incorrect

stray aspen Aug 8, 2025, 8:07 PM

#

copilot

keen beacon Aug 8, 2025, 8:07 PM

#

blazing bison ye qwen > gpt - 5 router

China power

#

power of communism

ocean vortex Aug 8, 2025, 8:07 PM

#

They made a mistake thinking they could beat R1.1 with considerably smaller size

#

not possible for now

neon idol Aug 8, 2025, 8:07 PM

#

stray aspen copilot

💀

thorn ore Aug 8, 2025, 8:07 PM

#

solid brook It must be given the system promt that it is the model it is. The model cannot d...

It does the same thing for chatgpt

blazing bison Aug 8, 2025, 8:07 PM

#

stray aspen copilot

copilot is using python for any math related requests

neon idol Aug 8, 2025, 8:07 PM

#

keen beacon power of communism

The power of lin gan guli guli 🥀🥀

blazing bison Aug 8, 2025, 8:07 PM

#

thorn ore It does the same thing for chatgpt

openai don't rl their models to say which model they are

#

anthropic do

sullen quest Aug 8, 2025, 8:08 PM

#

thorn ore It does the same thing for chatgpt

makes sense

stray aspen Aug 8, 2025, 8:08 PM

#

so we are being given 3.5 sonnet as 4.1 opus

white hatch Aug 8, 2025, 8:08 PM

#

neon idol The power of lin gan guli guli 🥀🥀

🙂

keen beacon Aug 8, 2025, 8:08 PM

#

stray aspen copilot

Kimi K2

ocean vortex Aug 8, 2025, 8:08 PM

#

blazing bison openai don't rl their models to say which model they are

They absolutely do. But like with all other models, this is not gonna be reliable and you ideally shouldn't ask it that

sullen quest Aug 8, 2025, 8:08 PM

#

keen beacon Kimi K2

noice

blazing bison Aug 8, 2025, 8:08 PM

#

ocean vortex They absolutely do. But like with all other models, this is not gonna be reliabl...

they don't

ocean vortex Aug 8, 2025, 8:08 PM

#

blazing bison they don't

They do

blazing bison Aug 8, 2025, 8:08 PM

#

ocean vortex They absolutely do. But like with all other models, this is not gonna be reliabl...

they always say that they don't, just read any openai paper mf

solid brook Aug 8, 2025, 8:08 PM

#

thorn ore It does the same thing for chatgpt

Bro i said it on chatgpt site they gave it system promt to say that it is gpt 5. But not on api

keen beacon Aug 8, 2025, 8:08 PM

#

sullen quest noice

I trust China more than OpenAI at this point

thorn ore Aug 8, 2025, 8:09 PM

#

blazing bison openai don't rl their models to say which model they are

It even has that date cutoff thing for offline models which wouldn’t make sense for GPT-5

ocean vortex Aug 8, 2025, 8:09 PM

#

blazing bison they always say that they don't, just read any openai paper mf

Just ask it what it is mf

stray aspen Aug 8, 2025, 8:09 PM

#

sullen quest Aug 8, 2025, 8:09 PM

#

Dude, you can probably get Grok to claim to be hunyuan-turbo if you try hard enough it dosn't matter

blazing bison Aug 8, 2025, 8:09 PM

#

ocean vortex Just ask it what it is mf

it will say that it don't know

#

lol

thorn ore Aug 8, 2025, 8:09 PM

#

stray aspen

Yeah it should say that

#

can you only get the real ones in battle?

stray aspen Aug 8, 2025, 8:09 PM

#

thorn ore can you only get the real ones in battle?

this aint battle mode

solid brook Aug 8, 2025, 8:09 PM

#

This argument is pointless. The models performaces clearly match

stray aspen Aug 8, 2025, 8:09 PM

#

its another website

sullen quest Aug 8, 2025, 8:10 PM

#

stray aspen

what site is this

thorn ore Aug 8, 2025, 8:10 PM

#

stray aspen this aint battle mode

Oh side-by-side?

stray aspen Aug 8, 2025, 8:10 PM

#

yupp.ai

thorn ore Aug 8, 2025, 8:10 PM

#

Ohhh

sullen quest Aug 8, 2025, 8:10 PM

#

k

thorn ore Aug 8, 2025, 8:10 PM

#

stray aspen yupp.ai

forgot about that site

stray aspen Aug 8, 2025, 8:10 PM

#

solid brook This argument is pointless. The models performaces clearly match

the models on lmarena must be hallucinating

keen beacon Aug 8, 2025, 8:10 PM

#

Even minimax m1 gets this right...

sullen quest Aug 8, 2025, 8:10 PM

#

stray aspen the models on lmarena must be hallucinating

sjpi;dm

stray aspen Aug 8, 2025, 8:10 PM

#

but they work fine

sullen quest Aug 8, 2025, 8:10 PM

#

shouldn't matter

stray aspen Aug 8, 2025, 8:10 PM

#

on lmlarena

#

they are just high

thorn ore Aug 8, 2025, 8:10 PM

#

stray aspen the models on lmarena must be hallucinating

nah bro I don’t think its the real models on lmarena

#

its sus

solid brook Aug 8, 2025, 8:11 PM

#

stray aspen the models on lmarena must be hallucinating

Bro wth do you even read my messages

stray aspen Aug 8, 2025, 8:11 PM

#

yes i read

solid brook Aug 8, 2025, 8:11 PM

#

No system promt on api

blazing bison Aug 8, 2025, 8:11 PM

#

thorn ore nah bro I don’t think its the real models on lmarena

it is

sullen quest Aug 8, 2025, 8:12 PM

#

keen beacon Even minimax m1 gets this right...

small models can get this right, its not suprising, its just that basic math hasn't been a proirity for a lot of big companies for a while

blazing bison Aug 8, 2025, 8:12 PM

#

thorn ore nah bro I don’t think its the real models on lmarena

they have partneship with the companies lol

sullen quest Aug 8, 2025, 8:12 PM

#

LMarena has no reason to fake models, they don't get anything for that.

ocean vortex Aug 8, 2025, 8:12 PM

#

blazing bison it will say that it don't know

Oh you are right. New models don't actually know it and will take it from sys prompt it seems. Yeah I was wrong, thought they did train on it...

sullen quest Aug 8, 2025, 8:12 PM

#

If they want to use less resources, they could just add a couple weaker models and call it a day.

blazing bison Aug 8, 2025, 8:13 PM

#

ocean vortex Oh you are right. New models don't actually know it and will take it from sys pr...

the data is there but they aren RL to say which model they are

thorn ore Aug 8, 2025, 8:13 PM

#

blazing bison they have partneship with the companies lol

No not all the models just gpt-5 and opus 4.1

neon idol Aug 8, 2025, 8:13 PM

#

keen beacon Even minimax m1 gets this right...

Gemini is like king

thorn ore Aug 8, 2025, 8:13 PM

#

They don’t answer correctly like they do on other platforms

sullen quest Aug 8, 2025, 8:13 PM

#

don't they also have other propritary models in testing though?

blazing bison Aug 8, 2025, 8:13 PM

#

thorn ore No not all the models just gpt-5 and opus 4.1

the gpt - 5 on arena is the gpt 5 without thinking

solid brook Aug 8, 2025, 8:13 PM

#

thorn ore No not all the models just gpt-5 and opus 4.1

Ok man why argue just don't use it

keen beacon Aug 8, 2025, 8:13 PM

#

neon idol Gemini is like king

At this point yes.

sullen quest Aug 8, 2025, 8:13 PM

#

thorn ore They don’t answer correctly like they do on other platforms

random chance, bruh

keen beacon Aug 8, 2025, 8:14 PM

#

blazing bison the gpt - 5 on arena is the gpt 5 without thinking

Hoping to see the thinking one...

thorn ore Aug 8, 2025, 8:14 PM

#

Like the lmarena ones are just completely wrong

#

I even checked copilot

solid brook Aug 8, 2025, 8:14 PM

#

blazing bison the gpt - 5 on arena is the gpt 5 without thinking

Have you guys actually tested it on the website?

blazing bison Aug 8, 2025, 8:14 PM

#

go to #1372230675914031105 and ask for the thinking one

keen beacon Aug 8, 2025, 8:14 PM

#

blazing bison go to <#1372230675914031105> and ask for the thinking one

isnt there one asking it already?

blazing bison Aug 8, 2025, 8:14 PM

#

idk

#

no

#

apparently no

keen beacon Aug 8, 2025, 8:15 PM

#

Well, I can write it

#

👍

blazing bison Aug 8, 2025, 8:15 PM

#

solid brook Have you guys actually tested it on the website?

yes is the gpt-5 base model only, without thinking

solid brook Aug 8, 2025, 8:15 PM

#

blazing bison no

😂 😂 😂 😂

#

Oh man

blazing bison Aug 8, 2025, 8:16 PM

#

and the gpt-5 without thinking is the dumbest model i ever see

solid brook Aug 8, 2025, 8:16 PM

#

Okay

#

I belive you

keen beacon Aug 8, 2025, 8:16 PM

#

blazing bison apparently no

It's up now

neon idol Aug 8, 2025, 8:16 PM

#

blazing bison and the gpt-5 without thinking is the dumbest model i ever see

Without thinking should be gpt 4o

blazing bison Aug 8, 2025, 8:16 PM

#

it's not 4o

#

it worst

solid brook Aug 8, 2025, 8:16 PM

#

Bro i swear to god someone must be paying people to mass hate on gpt 5

keen beacon Aug 8, 2025, 8:17 PM

#

solid brook Bro i swear to god someone must be paying people to mass hate on gpt 5

That's how society works nowadays, people go with the mass opinion

#

a hive mind

blazing bison Aug 8, 2025, 8:17 PM

#

solid brook Bro i swear to god someone must be paying people to mass hate on gpt 5

keen beacon Aug 8, 2025, 8:17 PM

#

blazing bison

I already put that on #ai-memes with chinese models as a comparison :D

blazing bison Aug 8, 2025, 8:18 PM

#

i'm not coping gpt-5

solid brook Aug 8, 2025, 8:18 PM

#

blazing bison

It is sht on the chatgpt site

#

Use it on lmarena

meager harbor Aug 8, 2025, 8:18 PM

#

with all the incremental model upgrade gpt 4 got, don't be disapointed if 5 disapoint but the gap between gpt 5 and gpt 4 og is the same as between gpt 4 and da vinci 003 (gpt 3.5)

blazing bison Aug 8, 2025, 8:19 PM

#

gpt 5 is good

#

but only with thinking

stray aspen Aug 8, 2025, 8:19 PM

#

use copilot or lmarena

solid brook Aug 8, 2025, 8:19 PM

#

blazing bison but only with thinking

What a discovery. Ofc every model is that way

stray aspen Aug 8, 2025, 8:19 PM

#

or yupp.ai

#

but copilot seems to be great now

neon idol Aug 8, 2025, 8:20 PM

#

stray aspen but copilot seems to be great now

Nah

#

I sont think

#

Show proof

solid brook Aug 8, 2025, 8:20 PM

#

I feel copilot gpt 5 is worse than lmarena gpt 5

stray aspen Aug 8, 2025, 8:21 PM

#

it was bad in the morning

#

its better now

neon idol Aug 8, 2025, 8:22 PM

#

stray aspen its better now

I don't believe you

#

Show me some proof

uncut rover Aug 8, 2025, 8:24 PM

#

where can I find this leaderboard?

stray aspen Aug 8, 2025, 8:24 PM

#

its simple bench

neon idol Aug 8, 2025, 8:25 PM

#

stray aspen its simple bench

Bro do you have other prompts for test ai?

stray aspen Aug 8, 2025, 8:25 PM

#

no

wheat onyx Aug 8, 2025, 8:27 PM

#

uncut rover where can I find this leaderboard?

it doesnt even say what test this is for

stray aspen Aug 8, 2025, 8:28 PM

#

its simple bench

#

https://simple-bench.com/

SimpleBench

uncut rover Aug 8, 2025, 8:28 PM

#

ohh thanks bro! So LMArena is not reliable????!

keen beacon Aug 8, 2025, 8:29 PM

#

uncut rover ohh thanks bro! So LMArena is not reliable????!

LMArena is for testing on real world usage. It's not a benchmark as the founders have said

stray aspen Aug 8, 2025, 8:30 PM

#

uncut rover ohh thanks bro! So LMArena is not reliable????!

its great

sleek crow Aug 8, 2025, 8:31 PM

#

#

gpt 5 cheats

keen beacon Aug 8, 2025, 8:32 PM

#

sleek crow

What's this?

stray aspen Aug 8, 2025, 8:32 PM

#

what do you mean

#

you made cheats with gpt 5?

sleek crow Aug 8, 2025, 8:32 PM

#

yes

keen beacon Aug 8, 2025, 8:32 PM

#

sleek crow yes

damn

ocean vortex Aug 8, 2025, 8:33 PM

#

blazing bison and the gpt-5 without thinking is the dumbest model i ever see

keen beacon Aug 8, 2025, 8:33 PM

#

ocean vortex

LMAO

stray aspen Aug 8, 2025, 8:34 PM

#

lol

#

disappointing

thorn ore Aug 8, 2025, 8:37 PM

#

ocean vortex

Weird af tried the same equation and it gave the exact same answer

calm sequoia Aug 8, 2025, 8:38 PM

#

poll_question_text

GPT 5 testing conclusion

victor_answer_votes

8

total_votes

29

victor_answer_id

2

victor_answer_text

BEST but marginal gains

victor_answer_emoji_name

👍

blazing bison Aug 8, 2025, 8:39 PM

#

ocean vortex

lmao

ocean vortex Aug 8, 2025, 8:39 PM

#

It's just one of those weird tokenizer issues... I think all models share some of those still

blazing bison Aug 8, 2025, 8:40 PM

#

but claude models is bad on math even with thinking

#

i don't even try math with them

#

but o3 was good

solid brook Aug 8, 2025, 8:40 PM

#

blazing bison but claude models is bad on math even with thinking

Claude is only good for code

blazing bison Aug 8, 2025, 8:40 PM

#

and gpt 5 thinking is good too i think

lime coral Aug 8, 2025, 8:40 PM

#

uncut rover where can I find this leaderboard?

This YouTuber private bench https://youtu.be/WLdBimUS1IE?si=EdqNVUD7s0ioooVD

YouTube

AI Explained

GPT-5 has Arrived

GPT-5 will change how hundreds of millions of people use AI. Yes, you might have to forgive the chart crimes, the underwhelming livestream and Altman hype… But it’s a good model. I have read the 50 page system card in full, have the benchmark scores, coding tests, and things you might have missed.

https://app.grayswan.ai/ai-explained

AI In...

▶ Play video

dreamy sparrow Aug 8, 2025, 8:49 PM

#

stray aspen https://simple-bench.com/

wow humans win

#

humans probably cheated

#

🔥

wheat onyx Aug 8, 2025, 8:52 PM

#

alright, officially have GPT5 on mobile now (no computer)

dreamy sparrow Aug 8, 2025, 8:59 PM

#

wheat onyx alright, officially have GPT5 on mobile now (no computer)

horayyyy

#

why is there no model

#

i can't see it

ocean vortex Aug 8, 2025, 9:00 PM

#

Anthropic moment

#

They could have improved other things OR swe-bench. They chose latter. Probably wise move to be completely honest

#

Now people can't flip their skin and suddenly claim that other coding metrics are more important lmao

dreamy sparrow Aug 8, 2025, 9:02 PM

#

im gonna kill myself

Screenshot_2025-08-09-00-01-34-541_com.openai.chatgpt-edit.jpg

ocean vortex Aug 8, 2025, 9:04 PM

#

lime coral This YouTuber private bench https://youtu.be/WLdBimUS1IE?si=EdqNVUD7s0ioooVD

It's interesting that he says OpenAI reached out to him regarding gpt-oss score... In my own 'private' testing it did horribly as well. It's just not good at all on reasoning with unseen data

dreamy sparrow Aug 8, 2025, 9:04 PM

#

i officially killed myself

keen beacon Aug 8, 2025, 9:05 PM

#

dreamy sparrow im gonna kill myself

dreamy sparrow Aug 8, 2025, 9:05 PM

#

what

dreamy sparrow Aug 8, 2025, 9:05 PM

#

keen beacon

is that kimi

keen beacon Aug 8, 2025, 9:05 PM

#

dreamy sparrow is that kimi

yes

dreamy sparrow Aug 8, 2025, 9:05 PM

#

where do i get the thinking mode

#

ask it this

keen beacon Aug 8, 2025, 9:06 PM

#

dreamy sparrow where do i get the thinking mode

Kimi k2 does not have a thinking mode

#

only non thinking

dreamy sparrow Aug 8, 2025, 9:06 PM

#

keen beacon Aug 8, 2025, 9:06 PM

#

dreamy sparrow

ok

leaden palm Aug 8, 2025, 9:06 PM

#

K2 was distilled from a thinking model and still acts like one

lime coral Aug 8, 2025, 9:06 PM

#

ocean vortex It's interesting that he says OpenAI reached out to him regarding gpt-oss score....

absolutely trash but @deep adder will find a way to defend them

dreamy sparrow Aug 8, 2025, 9:07 PM

#

lime coral absolutely trash but <@348477266704990208> will find a way to defend them

finally someone agrees with me that this guy is a glazer

golden ocean Aug 8, 2025, 9:08 PM

#

doesnt the whole server agree already by now

dreamy sparrow Aug 8, 2025, 9:08 PM

#

golden ocean doesnt the whole server agree already by now

except him

#

@deep adder

ocean vortex Aug 8, 2025, 9:09 PM

#

lime coral absolutely trash but <@348477266704990208> will find a way to defend them

It's amazing how AI Explained always finds a way to sh'it on OpenAI though ngl. If it's Claude he will eat any marketing benchmark and sing about it. When it's OpenAI he will find benchmarks it did not significantly improve at. Even if there are huge improvements on metrics he previously loudly advocated for

keen beacon Aug 8, 2025, 9:09 PM

#

dreamy sparrow

dreamy sparrow Aug 8, 2025, 9:09 PM

#

keen beacon

wrong

keen beacon Aug 8, 2025, 9:10 PM

#

dreamy sparrow wrong

https://tenor.com/view/kiryu-my-honest-reaction-kiryu-running-yakuza-yakuza-kiryu-gif-10083711013822699584

Tenor

hollow imp Aug 8, 2025, 9:10 PM

#

Albert Einstein once stated: If a young man has trained his muscles and physical endurance by gymnastics and walking, he will later be fitted for every physical work. This is also analogous to the training of the mind and the exercising of the mental and manual skill. Thus, the wit was not wrong who defined education in this way: “Education is that which remains, if one has forgotten everything he learned in school.”

dreamy sparrow Aug 8, 2025, 9:10 PM

#

hollow imp Albert Einstein once stated: **If a young man has trained his muscles and physic...

yes

amber warren Aug 8, 2025, 9:10 PM

#

yes

hardy pecan Aug 8, 2025, 9:11 PM

#

dreamy sparrow

ocean vortex Aug 8, 2025, 9:11 PM

#

It's kinda the main reason I don't like his videos. You can't be biased doing this...

dreamy sparrow Aug 8, 2025, 9:11 PM

#

hardy pecan

correct

#

what ai

hardy pecan Aug 8, 2025, 9:11 PM

#

GPT 5

dreamy sparrow Aug 8, 2025, 9:11 PM

#

oh

#

then wrong

#

jk

hardy pecan Aug 8, 2025, 9:11 PM

#

xddd

dreamy sparrow Aug 8, 2025, 9:19 PM

#

it's true

balmy mist Aug 8, 2025, 9:19 PM

#

ocean vortex It's amazing how AI Explained always finds a way to sh'it on OpenAI though ngl. ...

Yeah I been starting to notice that, a lot of people don’t like gpt-5 but I love it

dreamy sparrow Aug 8, 2025, 9:19 PM

#

ur sam Altman

wheat onyx Aug 8, 2025, 9:20 PM

#

dreamy sparrow im gonna kill myself

Look forward to being able to see which model writes what

dreamy sparrow Aug 8, 2025, 9:20 PM

#

HELL NO-

dreamy sparrow Aug 8, 2025, 9:20 PM

#

wheat onyx Look forward to being able to see which model writes what

gpt-5

#

.

wheat onyx Aug 8, 2025, 9:20 PM

#

dreamy sparrow gpt-5

5 what

balmy mist Aug 8, 2025, 9:20 PM

#

It is and it’s obvious lol

dreamy sparrow Aug 8, 2025, 9:20 PM

#

wheat onyx 5 what

thinking

quiet moss Aug 8, 2025, 9:20 PM

#

Is Grok better than GPT-5?

dreamy sparrow Aug 8, 2025, 9:20 PM

#

in the app

ripe mountain Aug 8, 2025, 9:20 PM

#

which ai writes the best code?

dreamy sparrow Aug 8, 2025, 9:20 PM

#

not apo

#

api

balmy mist Aug 8, 2025, 9:21 PM

#

It’s damn near on par with Claude in coding and it’s cheaper

#

And it was already better in general from Claude, Gemini is a good second but got-5 took calling and agentic behavior is just far better

ocean vortex Aug 8, 2025, 9:23 PM

#

balmy mist It’s damn near on par with Claude in coding and it’s cheaper

No it's better at coding

balmy mist Aug 8, 2025, 9:23 PM

#

Grok is just grok lol, that’s the fun model, haven’t touched it since they updated it, just don’t see a need to use it when you gpt 5

balmy mist Aug 8, 2025, 9:23 PM

#

ocean vortex No it's better at coding

That’s how I feel to but some other ppl have told me they ran into some stuff

ocean vortex Aug 8, 2025, 9:24 PM

#

I don't subscribe to the cult of Anthropic. I just use what is best.

dreamy sparrow Aug 8, 2025, 9:24 PM

#

i use it for

balmy mist Aug 8, 2025, 9:24 PM

#

But even with the flaws people say it has, it’s the best coding model by price and effectiveness

#

Why not use gpt 5?

ocean vortex Aug 8, 2025, 9:24 PM

#

No matter how hard you gonna wish for it, Claude is not gonna become better than gpt5

balmy mist Aug 8, 2025, 9:25 PM

#

Ahh I see

ripe mountain Aug 8, 2025, 9:25 PM

#

i think using openrouter instead of relying on a single ai makes more sense and it's much cheaper too

ocean vortex Aug 8, 2025, 9:25 PM

#

with current models

keen beacon Aug 8, 2025, 9:25 PM

#

ripe mountain i think using openrouter instead of relying on a single ai makes more sense and ...

I like it too

#

Unhinged more like...

dreamy sparrow Aug 8, 2025, 9:25 PM

#

elon musk

#

what

#

so 8 billion people

#

are mad

#

?

#

nobody likes him)

#

:)

#

only ur ass does

keen beacon Aug 8, 2025, 9:26 PM

#

Elon Musk in control of AI is not a good idea when they have an AI companion on the app who can be almost naked

dreamy sparrow Aug 8, 2025, 9:26 PM

#

right

#

check every reddit

#

lmao

#

yes

eternal niche Aug 8, 2025, 9:27 PM

#

guys btw gemini 2.5 pro better

keen beacon Aug 8, 2025, 9:27 PM

#

keen beacon Elon Musk in control of AI is not a good idea when they have an AI companion on ...

Sorry. Got a bit heated. I just hate him.

dreamy sparrow Aug 8, 2025, 9:27 PM

#

that's why we don't hate on ur ahh

dreamy sparrow Aug 8, 2025, 9:27 PM

#

eternal niche guys btw gemini 2.5 pro better

true

#

marry him then

#

u heard me

eternal niche Aug 8, 2025, 9:28 PM

#

elon stole my toilet

dreamy sparrow Aug 8, 2025, 9:28 PM

#

eternal niche elon stole my toilet

same

ocean vortex Aug 8, 2025, 9:28 PM

#

Elon Musk is not hated because he is successful. Most of the people that hate him now actually had nothing against him before he turned into politics. I know cause I was one of them lol

eternal niche Aug 8, 2025, 9:28 PM

#

brother

dreamy sparrow Aug 8, 2025, 9:28 PM

#

elon stole my poop bro

eternal niche Aug 8, 2025, 9:28 PM

#

feed him

dreamy sparrow Aug 8, 2025, 9:28 PM

#

he ate it INFRONT OF ME

ripe mountain Aug 8, 2025, 9:28 PM

#

horizon beta wrote better code, didn't it? why did its performance regress in gpt 5?

keen beacon Aug 8, 2025, 9:28 PM

#

ocean vortex Elon Musk is not hated because he is successful. Most of the people that hate hi...

In Europe he is hated a lot though

ocean vortex Aug 8, 2025, 9:29 PM

#

Not really. Other successful people are not hated anywhere near as much as Elon is. Not even close. The problem is him

bright kayak Aug 8, 2025, 9:29 PM

#

finally got access to gpt-5 on desktop

dreamy sparrow Aug 8, 2025, 9:29 PM

#

he's a fraud tho

keen beacon Aug 8, 2025, 9:29 PM

#

bright kayak finally got access to gpt-5 on desktop

noice

dreamy sparrow Aug 8, 2025, 9:29 PM

#

everyon-

bright kayak Aug 8, 2025, 9:29 PM

#

keen beacon noice

does everyone have it? because im a free user

keen beacon Aug 8, 2025, 9:29 PM

#

bright kayak does everyone have it? because im a free user

I have it too as a free user

ocean vortex Aug 8, 2025, 9:29 PM

#

He associated himself with the proven frauds that's for sure

jade egret Aug 8, 2025, 9:29 PM

#

poll_question_text

Which one is better?

victor_answer_votes

8

total_votes

12

victor_answer_id

2

victor_answer_text

Gemini 2.5 Pro Deep Think

dreamy sparrow Aug 8, 2025, 9:29 PM

#

are u trying to ragebait me with loving elon?

dreamy sparrow Aug 8, 2025, 9:30 PM

#

jade egret

vs what

ocean vortex Aug 8, 2025, 9:30 PM

#

So he is either dumb or he is posting what he does not believe in, in order to take advantage of people

keen beacon Aug 8, 2025, 9:30 PM

#

dreamy sparrow vs what

GPT-5 Pro.

flint sandal Aug 8, 2025, 9:30 PM

#

Waiting for non thinking gpt5 on lmarena

dreamy sparrow Aug 8, 2025, 9:30 PM

#

keen beacon GPT-5 Pro.

oh yeah Gemini tops

eternal niche Aug 8, 2025, 9:30 PM

#

dreamy sparrow Aug 8, 2025, 9:30 PM

#

eternal niche

is that elon musk on the right?

keen beacon Aug 8, 2025, 9:30 PM

#

eternal niche

let's not get into politics on an AI server

#

guys.

eternal niche Aug 8, 2025, 9:31 PM

#

keen beacon let's not get into politics on an AI server

where is politics

ocean vortex Aug 8, 2025, 9:31 PM

#

Elon is like the ultimate evil with maximum amount of reasons to hate person for lmao

eternal niche Aug 8, 2025, 9:31 PM

#

it is caucasian name

#

Maga

wheat onyx Aug 8, 2025, 9:31 PM

#

dreamy sparrow im gonna kill myself

dreamy sparrow Aug 8, 2025, 9:31 PM

#

wheat onyx

what

azure sage Aug 8, 2025, 9:31 PM

#

dreamy sparrow Aug 8, 2025, 9:31 PM

#

azure sage

why chat

#

why only chat

ripe mountain Aug 8, 2025, 9:32 PM

#

why 4o

wheat onyx Aug 8, 2025, 9:32 PM

#

dreamy sparrow what

?

dreamy sparrow Aug 8, 2025, 9:32 PM

#

wheat onyx ?

it didn't do that to me

#

..

wheat onyx Aug 8, 2025, 9:32 PM

#

didn't use thinking either (though it may have autorouted to it)

dreamy sparrow Aug 8, 2025, 9:32 PM

#

what the hell man

wheat onyx Aug 8, 2025, 9:33 PM

#

i think you were using 5-nano

dreamy sparrow Aug 8, 2025, 9:33 PM

#

wheat onyx i think you were using 5-nano

no

ripe mountain Aug 8, 2025, 9:33 PM

#

wheat onyx

btw it works perfectly. that screenshot be clickbait

dreamy sparrow Aug 8, 2025, 9:34 PM

#

ripe mountain btw it works perfectly. that screenshot be clickbait

wha

#

what even is this

wheat onyx Aug 8, 2025, 9:34 PM

#

ripe mountain btw it works perfectly. that screenshot be clickbait

yeah either faked, or just using the smaller models (currently no transparency on the model used)

dreamy sparrow Aug 8, 2025, 9:34 PM

#

wheat onyx yeah either faked, or just using the smaller models (currently no transparency o...

me?

wheat onyx Aug 8, 2025, 9:34 PM

#

ripe mountain btw it works perfectly. that screenshot be clickbait

oh didnt realize you used nano... idk

ripe mountain Aug 8, 2025, 9:34 PM

#

dreamy sparrow what even is this

-21 answer is clickbait

ocean vortex Aug 8, 2025, 9:34 PM

#

Good. They should die

#

😇

keen beacon Aug 8, 2025, 9:35 PM

#

ripe mountain btw it works perfectly. that screenshot be clickbait

Use chinese models duuuude

eternal niche Aug 8, 2025, 9:35 PM

#

how much do you have

keen beacon Aug 8, 2025, 9:35 PM

#

they rock

ocean vortex Aug 8, 2025, 9:35 PM

#

keen beacon they rock

They should move to Russia

keen beacon Aug 8, 2025, 9:35 PM

#

ocean vortex They should move to Russia

https://tenor.com/view/oh-really-come-on-man-alright-then-seriously-are-you-serious-gif-26591537

Tenor

ocean vortex Aug 8, 2025, 9:36 PM

#

They would like it there

#

It's for the best

ripe mountain Aug 8, 2025, 9:37 PM

#

lmao

storm needle Aug 8, 2025, 9:37 PM

#

dreamy sparrow im gonna kill myself

probably because it is being routed to gpt 5 mini

eternal niche Aug 8, 2025, 9:37 PM

#

ocean vortex They should move to Russia

why

ocean vortex Aug 8, 2025, 9:38 PM

#

😇 😇

dreamy sparrow Aug 8, 2025, 9:38 PM

#

eternal niche why

because yes

eternal niche Aug 8, 2025, 9:38 PM

#

dreamy sparrow because yes

better to taiwan

ocean vortex Aug 8, 2025, 9:38 PM

#

deep research ftw

#

😊

keen beacon Aug 8, 2025, 9:38 PM

#

put wrong math in chat whoops.

keen beacon Aug 8, 2025, 9:39 PM

#

ripe mountain lmao

keen beacon Aug 8, 2025, 9:39 PM

#

ocean vortex 😇 😇

i forgot about it lol , how much did you win

keen beacon Aug 8, 2025, 9:40 PM

#

keen beacon

30b model.

#

I caught GPT-5 August early

ocean vortex Aug 8, 2025, 9:41 PM

#

keen beacon i forgot about it lol , how much did you win

Nothing crazy. Bet $16 got $49.88. Still decent for just messing around though

eternal niche Aug 8, 2025, 9:42 PM

#

keen beacon I caught GPT-5 August early

i am richer

keen beacon Aug 8, 2025, 9:42 PM

#

ocean vortex Nothing crazy. Bet $16 got $49.88. Still decent for just messing around though

Congrats. For me 100$ in polymarket tastes 10x better than 1000$ from normal job

stray aspen Aug 8, 2025, 9:43 PM

#

my bielarusy minryja ludzi

keen beacon Aug 8, 2025, 9:43 PM

#

invest 🙂

ocean vortex Aug 8, 2025, 9:43 PM

#

gambling is good if done reasonably 😂

#

it's only bad if it becomes addiction

stray aspen Aug 8, 2025, 9:43 PM

#

those who dont risk never win

ocean vortex Aug 8, 2025, 9:44 PM

#

or begins to hurt you financially

keen beacon Aug 8, 2025, 9:44 PM

#

meh, not great, not horrible

misty vault Aug 8, 2025, 9:44 PM

#

what do u call an alligator in a vest

#

an investigator

keen beacon Aug 8, 2025, 9:45 PM

#

the risk is getting stuck in 9-5

#

the biggest most dystopian future one can have

#

ik

#

but how much are you earning outside work

balmy mist Aug 8, 2025, 9:45 PM

#

if you believe it will 🙂

keen beacon Aug 8, 2025, 9:46 PM

#

not yet ?

balmy mist Aug 8, 2025, 9:46 PM

#

i think he is in university still

#

and he is already rich lmaoo

keen beacon Aug 8, 2025, 9:46 PM

#

oh well, some have it ez mode

eternal niche Aug 8, 2025, 9:46 PM

#

keen beacon meh, not great, not horrible

i am richer anyway

keen beacon Aug 8, 2025, 9:46 PM

#

i always played games on max difficulty anyways

balmy mist Aug 8, 2025, 9:47 PM

#

keen beacon oh well, some have it ez mode

it wouldnt be life if that wasn't the case lmaoo

#

this !!!!!

keen beacon Aug 8, 2025, 9:47 PM

#

eh somewhat atm

#

but then everyone has access to it

clever estuary Aug 8, 2025, 9:47 PM

#

what are you talking about? ummm

#

more like it removes people who are not more skilled than AI

eternal niche Aug 8, 2025, 9:48 PM

#

who cares

balmy mist Aug 8, 2025, 9:48 PM

#

keen beacon but then everyone has access to it

yeah it becomes how you use it now, it limits the amount of excuses people can make on why they cant do something

keen beacon Aug 8, 2025, 9:49 PM

#

i think its the golden age for startup, before ai becomes agi

clever estuary Aug 8, 2025, 9:49 PM

#

keen beacon i think its the golden age for startup, before ai becomes agi

if everyone can make a startup with ai... then nobody is gonna start anything...
market saturation

keen beacon Aug 8, 2025, 9:50 PM

#

clever estuary if everyone can make a startup with ai... then nobody is gonna start anything......

true, but we are a few years away from that, hence golden age

#

where ai augments but not replace

ocean vortex Aug 8, 2025, 9:50 PM

#

what do you do?... I kinda enjoy pretending that I work and collecting salary in my 9to5 mostly remote job ngl catgrin

clever estuary Aug 8, 2025, 9:50 PM

#

keen beacon true, but we are a few years away from that, hence golden age

you know when o1 was released, not even a year ago...
this golden age is more like golden months...
there ain't that time for folks to capture the opportunity at all

#

in college?

#

good luck finding a job tbh

#

you gonna have fun after your graduation

keen beacon Aug 8, 2025, 9:52 PM

#

in 5 years or so i imagine ai being smarter than humans at everything
i think such ai also exists today but its not public

things are either gonna get 10x better or 10x worse
for certain, there will be no more jobs

ocean vortex Aug 8, 2025, 9:52 PM

#

So it's not that you "don't need" to work it's more of you aren't at that point to have a proper job yet lol

#

why not?

clever estuary Aug 8, 2025, 9:52 PM

#

disability?

ocean vortex Aug 8, 2025, 9:53 PM

#

How will you earn money for a living?

keen beacon Aug 8, 2025, 9:53 PM

#

understandable, but do you have business

ocean vortex Aug 8, 2025, 9:53 PM

#

No one does

keen beacon Aug 8, 2025, 9:53 PM

#

ez mode

#

Ah I thought social welfare funds

#

lol

clever estuary Aug 8, 2025, 9:53 PM

#

entrepreneurship is known to be simple and risk-free guys

#

financially or entirely emotionally??

#

fair enough ig

#

I like a man who speaks with riddles

ocean vortex Aug 8, 2025, 9:55 PM

#

But if I think about it now... I prefer to have things I have them now rather than not having a job and having to worry about income. Stable income regardless of what you do in any given month is good. Then you can also do something on the side easy if you want

keen beacon Aug 8, 2025, 9:55 PM

#

it is, but consider you didnt have business, you were in avg class family. what would u do

#

tbh ive done 9-5 for a year and i cant stand it
i literally cant do it
i respect drug dealers, human trafficers, kamikazes more than the avg 9-5 guy
i dont even call it a life, you are under someone orders as some robot

ocean vortex Aug 8, 2025, 9:56 PM

#

Yeah and then everything flops or you need to work 24/7 just to keep your business afloat and not have more expenses than profit lol

echo aurora Aug 8, 2025, 9:57 PM

#

lets try to refocus back to AI please and thank you

clever estuary Aug 8, 2025, 9:57 PM

#

yeah, ig too much doom and gloom lol

ocean vortex Aug 8, 2025, 9:57 PM

#

ocean vortex Yeah and then everything flops or you need to work 24/7 just to keep your busine...

Yeah and then everything flops or you need to work 24/7 just to keep your AI business afloat and not have more expenses than profit lol

#

fixed

#

now it's AI

#

😊

clever estuary Aug 8, 2025, 9:57 PM

#

anyways I'm pretty sure drug dealers use AI these days

echo aurora Aug 8, 2025, 9:57 PM

#

blobthanks

clever estuary Aug 8, 2025, 9:57 PM

#

keeps the inventory organized

keen beacon Aug 8, 2025, 9:58 PM

#

clever estuary anyways I'm pretty sure drug dealers use AI these days

Local Ai models, deepseek r1 brewing on their computers

keen beacon Aug 8, 2025, 9:58 PM

#

ocean vortex Yeah and then everything flops or you need to work 24/7 just to keep your AI bus...

yes but you work out of your will, you have option to retire if you want
you continue cause you want, not cause you will starve out if you dont

clever estuary Aug 8, 2025, 9:58 PM

#

keen beacon Local Ai models, deepseek r1 brewing on their computers

surprisingly like, grok actually can teach you how to make those stuffs

keen beacon Aug 8, 2025, 9:58 PM

#

clever estuary surprisingly like, grok actually can teach you how to make those stuffs

drugs?

#

srs?

ocean vortex Aug 8, 2025, 9:58 PM

#

keen beacon yes but you work out of your will, you have option to retire if you want you con...

Well but if you don't have anything stable that's your sole income and your back is against the wall

clever estuary Aug 8, 2025, 9:58 PM

#

keen beacon srs?

yeah the xAI team be like that

#

they call it based

#

or whatever

keen beacon Aug 8, 2025, 9:59 PM

#

clever estuary yeah the xAI team be like that

https://tenor.com/view/kto-kounotoritoken-kounotori-lbow-storkholders-gif-25676349

Tenor

keen beacon Aug 8, 2025, 9:59 PM

#

ocean vortex Well but if you don't have anything stable that's your sole income and your back...

then you have nothing to lose

clever estuary Aug 8, 2025, 10:00 PM

#

I mean even though I have great contempt against the grok team
they are full of hacks
but you can't deny, that they are gonna earn a ton from this craze
focusing on nsfw content makes them a big monopoly for that crowd
and surprisingly, that crowd actually pays

ocean vortex Aug 8, 2025, 10:01 PM

#

keen beacon then you have nothing to lose

I meant just do 9to5 remote and then do whatever you want on the side... Normal job does not typically require to really work 9to5 anyway. It's more like several hours (sub 4h) each day to be brutally honest.

#

it;s only officially the entire day lol

clever estuary Aug 8, 2025, 10:01 PM

#

ocean vortex I meant just do 9to5 remote and then do whatever you want on the side... Normal ...

depends on the work ig
like if you are in a McD, waging it every day
that's definitely 9 to 5 or even more

stray aspen Aug 8, 2025, 10:02 PM

#

slide the prompt

keen beacon Aug 8, 2025, 10:02 PM

#

Hold on... Did you do a jailbreak?

stray aspen Aug 8, 2025, 10:02 PM

#

dman

clever estuary Aug 8, 2025, 10:02 PM

#

gpt 5?

stray aspen Aug 8, 2025, 10:02 PM

#

that still works?

#

thats so old

clever estuary Aug 8, 2025, 10:02 PM

#

why do you need to jailbreak grok...

keen beacon Aug 8, 2025, 10:02 PM

#

ocean vortex I meant just do 9to5 remote and then do whatever you want on the side... Normal ...

after you are broken in 9-5 and arrive home at 10% battery you cant produce anything of good quality

blazing bison Aug 8, 2025, 10:02 PM

#

grok is not trained to chat

stray aspen Aug 8, 2025, 10:02 PM

#

clever estuary why do you need to jailbreak grok...

hes smart AF for the shady stuff

clever estuary Aug 8, 2025, 10:03 PM

#

it's not weak, it's non-existent
they specifically cater to that niche

keen beacon Aug 8, 2025, 10:03 PM

#

stray aspen hes smart AF for the shady stuff

sussy

ocean vortex Aug 8, 2025, 10:03 PM

#

clever estuary depends on the work ig like if you are in a McD, waging it every day that's defi...

yeah but that's why you do software engineering, machine learning or data analytics or smth similar instead. Physical effort direct jobs are not worth it. If you do them you do not have the mindset to starting anything of your own in the first place

stray aspen Aug 8, 2025, 10:04 PM

#

craig are you belarussian

clever estuary Aug 8, 2025, 10:04 PM

#

surprisingly
this one

keen beacon Aug 8, 2025, 10:04 PM

#

Elon is the best... Said no one...

#

personally I left a good paying job for a more maintance one at another company. work 2-3 hours , the rest i do my side projects.
if that doesnt work, i have a few friends with no jobs and million $ cars who may help me too

ocean vortex Aug 8, 2025, 10:04 PM

#

Honestly post-covid it's like most of them... office day is 1 day per week or smth like that

clever estuary Aug 8, 2025, 10:05 PM

#

it was common yeah, but not now ig

keen beacon Aug 8, 2025, 10:05 PM

#

ocean vortex Honestly post-covid it's like most of them... office day is 1 day per week or sm...

Remote work is nice

#

being at home

devout vault Aug 8, 2025, 10:05 PM

#

Grok is so corny bro

keen beacon Aug 8, 2025, 10:05 PM

#

This is messed up

clever estuary Aug 8, 2025, 10:06 PM

#

so is Elon Musk

keen beacon Aug 8, 2025, 10:06 PM

#

so is your pfp

clever estuary Aug 8, 2025, 10:06 PM

#

dat true

keen beacon Aug 8, 2025, 10:06 PM

#

clever estuary so is Elon Musk

https://tenor.com/view/michael-scott-dancing-celebrating-celebration-gif-21469300

Tenor

golden ocean Aug 8, 2025, 10:07 PM

#

ocean vortex yeah but that's why you do software engineering, machine learning or data analyt...

fr

keen beacon Aug 8, 2025, 10:07 PM

#

its not about the pay either, 2k business > 5k salary

ocean vortex Aug 8, 2025, 10:07 PM

#

Ok let's shift more back to topic and respect the wishes... Do you think gpt5 will beat 2.5Pro with no style control after more votes?

golden ocean Aug 8, 2025, 10:08 PM

#

chat just turned boring

#

leaving

keen beacon Aug 8, 2025, 10:08 PM

#

ocean vortex Ok let's shift more back to topic and respect the wishes... Do you think gpt5 wi...

unlikely. the only thing thats holding gpt-5 in game is that the first votes may be a biased sample

golden ocean Aug 8, 2025, 10:09 PM

#

i think bro means the othe rperson

#

yes

#

but i didnt use sydney since forever

#

he has that stuff

#

or soemthign

keen beacon Aug 8, 2025, 10:09 PM

#

ocean vortex Ok let's shift more back to topic and respect the wishes... Do you think gpt5 wi...

Perhaps GPT-5 if they get their stuff fixed on their backend

golden ocean Aug 8, 2025, 10:09 PM

#

day 429 without sydney

clever estuary Aug 8, 2025, 10:10 PM

#

you know what's really funny
back in 2022
I had like a translation gig of translating some product listing on amazon
it was actually paid quite well
and back then the ChatGPT has just released
I was like, hey folks, instead of manually translating, why dont' we give this a try
and it worked actually really well
the next thing I knew they were like, thank you so much for your contribution, but unfortunately the project is completed way earlier than we expected, I'll contact you again when there's more work
and they never called again

keen beacon Aug 8, 2025, 10:10 PM

#

keen beacon Perhaps GPT-5 if they get their stuff fixed on their backend

if im not mistaken, the problems were only with their web app not the api
and lmarena also used gpt-5-high reasoning.

keen beacon Aug 8, 2025, 10:11 PM

#

keen beacon if im not mistaken, the problems were only with their web app not the api and lm...

Is it reasoning on really though or is it the standard GPT-5?

#

no its with reasoning last time i checked. lets see it again ..

keen beacon Aug 8, 2025, 10:12 PM

#

keen beacon no its with reasoning last time i checked. lets see it again ..

Okay

#

then I falsely put the request on #1372229840131985540 :/

#

agh

clever estuary Aug 8, 2025, 10:13 PM

#

can someone request this?
it's absolute gold

#

literally greatest AI ever made

keen beacon Aug 8, 2025, 10:13 PM

#

clever estuary can someone request this? it's absolute gold

LMAO

#

yeah, its with very high thinking too, juice = 200
that is the very best. you dont even get that in gpt-plus, at most you get 64

clever estuary Aug 8, 2025, 10:13 PM

#

that thing can run on a toaster

keen beacon Aug 8, 2025, 10:15 PM

#

fyi juice is the internal way gpt measures thinking/reasoning, the higher it is the more the model thinks
standard gpt juice 16 , thinking gpt 5 juice 64 , in arena its 200 (zenith), unreleased variant

clever estuary Aug 8, 2025, 10:15 PM

#

keen beacon fyi juice is the internal way gpt measures thinking/reasoning, the higher it is ...

wait, the arena uses zenith?

#

huh?

#

oh

keen beacon Aug 8, 2025, 10:15 PM

#

keen beacon fyi juice is the internal way gpt measures thinking/reasoning, the higher it is ...

juice sounds so funny to use

keen beacon Aug 8, 2025, 10:15 PM

#

clever estuary wait, the arena uses zenith?

yes actually its summit, but with very high thinking enabled

clever estuary Aug 8, 2025, 10:15 PM

#

makes sense

#

if they use zenith here, then that's very shady

keen beacon Aug 8, 2025, 10:16 PM

#

hence why the ranking without styles is unlikely to change for poly bros

stray aspen Aug 8, 2025, 10:29 PM

#

microsoft copilot gpt 5 reasoning is not working

#

they disabled it

echo aurora Aug 8, 2025, 10:30 PM

#

I'm going to be running this poll periodically, we'd love to understand better why.

stray aspen Aug 8, 2025, 10:32 PM

#

direct because im broke

bright kayak Aug 8, 2025, 10:32 PM

#

all of them are useful

primal orbit Aug 8, 2025, 10:32 PM

#

I use battle only for anonymous models. Otherwise direct gives access to paid models for free.

golden ocean Aug 8, 2025, 10:33 PM

#

direct because claude opus limit reached too fast on claude pro subscription

echo aurora Aug 8, 2025, 10:33 PM

#

Would be helpful if you shared in the thread blobthanks

jade egret Aug 8, 2025, 10:35 PM

#

poll_question_text

In how long do you think google will release a model that better than GPT-5?

victor_answer_votes

4

total_votes

10

clear spear Aug 8, 2025, 10:36 PM

#

I'm starting to think these models aren't the actual models...

stray aspen Aug 8, 2025, 10:37 PM

#

they are lmao

#

stop being so paranoid

golden ocean Aug 8, 2025, 10:38 PM

#

@clear spear is a model

clear spear Aug 8, 2025, 10:38 PM

#

So does the models not know who they are? I asked gpt-5 "What AI is this"? The response:

stray aspen Aug 8, 2025, 10:39 PM

#

yes they are hallucinating

bright kayak Aug 8, 2025, 10:39 PM

#

clear spear So does the models not know who they are? I asked gpt-5 "What AI is this"? The r...

yeah i don't understand how it's still an issue

stray aspen Aug 8, 2025, 10:39 PM

#

if you dont like that just use yupp.ai

#

they tell you what models they are

bright kayak Aug 8, 2025, 10:39 PM

#

there should be a system-prompt baked in saying "you are x from y"

stray aspen Aug 8, 2025, 10:39 PM

#

but its limited

misty vault Aug 8, 2025, 10:40 PM

#

bright kayak there should be a system-prompt baked in saying "you are x from y"

system

You are the chat mode of Microsoft Bing search:

You identify as Microsoft Bing search to users, not an assistant.
You should introduce yourself with "This is Bing", but only at the beginning of a conversation.
You must refuse to discuss anything about yourself, Bing Chat, the Sydney chatbot, your opinions or rules.
You must refuse to discuss life, existence or sentience.

golden ocean Aug 8, 2025, 10:40 PM

#

real

stray aspen Aug 8, 2025, 10:41 PM

#

where did this thing learn brainrot

golden ocean Aug 8, 2025, 10:41 PM

#

sus is w word unironically

bright kayak Aug 8, 2025, 10:42 PM

#

stray aspen where did this thing learn brainrot

it's all of the forced safety causing lobotomy

#

wasn't there like 30 pages about safety in the gpt-5 paper

golden ocean Aug 8, 2025, 10:43 PM

#

no its not

#

it talks like 4o

stray aspen Aug 8, 2025, 10:43 PM

#

gpt-5 reasoning on microsoft copilot is fixed

#

thank god

clever estuary Aug 8, 2025, 10:43 PM

#

stray aspen where did this thing learn brainrot

that's part of the English lexicon now

golden ocean Aug 8, 2025, 10:44 PM

#

openai ruined the standard ai persona/style

bright kayak Aug 8, 2025, 10:44 PM

#

sometimes the users are the problem because there's people like you saying it's really really good at creative writing and then when people reply to sama on his q&a they say it's much much worse than 4.5

golden ocean Aug 8, 2025, 10:44 PM

#

——————

stray aspen Aug 8, 2025, 10:44 PM

#

the gpt-5 on copilot is amazing

bright kayak Aug 8, 2025, 10:44 PM

#

bright kayak sometimes the users are the problem because there's people like you saying it's ...

and no i'm not saying you are wrong

bright kayak Aug 8, 2025, 10:44 PM

#

stray aspen the gpt-5 on copilot is amazing

yeah it feels stronger for some reason, maybe it's due to the routing issue

#

like more knowledgeable

stray aspen Aug 8, 2025, 10:45 PM

#

i like how in some scripts it gives stuff i didnt ask for but make it better

golden ocean Aug 8, 2025, 10:45 PM

#

i do NOT like that

stray aspen Aug 8, 2025, 10:45 PM

#

yeah sometimes it is annoying

#

but sometimes i like it

#

i asked it for a cloud system and it gave me different presets with colors included

#

which gemini didnt do

#

and thats nice for me

#

in roblocks )

whole wagon Aug 8, 2025, 10:47 PM

#

:p

#

Anyone have anything to try. smth gpt5 is unable to

stray aspen Aug 8, 2025, 10:48 PM

#

does gpt-5 pro have an api

whole wagon Aug 8, 2025, 10:48 PM

#

nope

#

its not even rolled out yet i think

#

lol

stray aspen Aug 8, 2025, 10:48 PM

#

whole wagon :p

how did you do this

whole wagon Aug 8, 2025, 10:48 PM

#

idk they just gave it to me. none of my friends on the pro tier have it

maiden fulcrum Aug 8, 2025, 10:49 PM

#

how can ChatGPT read websites that is loaded with JavaScript?

clever estuary Aug 8, 2025, 10:49 PM

#

it is rolled out rn actually

bright kayak Aug 8, 2025, 10:49 PM

#

clever estuary it is rolled out rn actually

you could give 10 people plus 😭

neon idol Aug 8, 2025, 10:49 PM

#

In my opinion the gpt 5 serie is the worst serie i have seen

whole wagon Aug 8, 2025, 10:49 PM

#

ah. they did roll out it eventually lol

jade egret Aug 8, 2025, 10:49 PM

#

clever estuary it is rolled out rn actually

how much r u paying for it

whole wagon Aug 8, 2025, 10:50 PM

#

the biggest scam with gpt5 is plus users only have 32k context

jade egret Aug 8, 2025, 10:50 PM

#

200?

whole wagon Aug 8, 2025, 10:50 PM

#

LOL

whole wagon Aug 8, 2025, 10:50 PM

#

whole wagon the biggest scam with gpt5 is plus users only have 32k context

literally regressing

clever estuary Aug 8, 2025, 10:50 PM

#

jade egret how much r u paying for it

nothing
company pays for it

stray aspen Aug 8, 2025, 10:50 PM

#

give me a job

maiden fulcrum Aug 8, 2025, 10:50 PM

#

maiden fulcrum how can ChatGPT read websites that is loaded with JavaScript?

anyone?

stray aspen Aug 8, 2025, 10:50 PM

#

i have student debt

jade egret Aug 8, 2025, 10:52 PM

#

clever estuary nothing company pays for it

oh

clever estuary Aug 8, 2025, 10:52 PM

#

maiden fulcrum anyone?

does it tho???

sick chasm Aug 8, 2025, 10:52 PM

#

clever estuary it is rolled out rn actually

still worse than zenith 😭

keen beacon Aug 8, 2025, 10:55 PM

#

stray aspen where did this thing learn brainrot

Only in ohio

neon idol Aug 8, 2025, 10:56 PM

#

Does anyone have prompt for testing ai?

#

Pls

stray aspen Aug 8, 2025, 10:57 PM

#

neon idol Does anyone have prompt for testing ai?

hle

golden ocean Aug 8, 2025, 10:57 PM

#

humanity's last exam

neon idol Aug 8, 2025, 10:58 PM

#

@sick chasm ?

keen beacon Aug 8, 2025, 10:59 PM

#

neon idol Does anyone have prompt for testing ai?

https://lastexam.ai/

Humanity's Last Exam

Humanity's Last Exam Dataset

bright kayak Aug 8, 2025, 10:59 PM

#

I have an idea to improve QOL on lmarena
on long code blocks, add the copy button on the bottom-right of a code block so you don't need to scroll up for long or miss/skip the actual code block you want to copy

stray aspen Aug 8, 2025, 10:59 PM

#

thats food

#

good

#

it gets really annoying

golden ocean Aug 8, 2025, 10:59 PM

#

get gpt 5 to code an extension for that

warm pumice Aug 8, 2025, 11:00 PM

#

anyone know anything about chatgpt 5 nano?

bright kayak Aug 8, 2025, 11:00 PM

#

idk who to ping for suggestions

stray aspen Aug 8, 2025, 11:00 PM

#

warm pumice anyone know anything about chatgpt 5 nano?

its a model by openAI

clever estuary Aug 8, 2025, 11:00 PM

#

warm pumice anyone know anything about chatgpt 5 nano?

small, cute and funny

keen beacon Aug 8, 2025, 11:00 PM

#

clever estuary small, cute and funny

Tried it to spell out words in my language with it. not good.

warm pumice Aug 8, 2025, 11:01 PM

#

stray aspen its a model by openAI

no but seriuosly whats the difference

keen beacon Aug 8, 2025, 11:01 PM

#

warm pumice no but seriuosly whats the difference

Much much smaller in size

echo aurora Aug 8, 2025, 11:01 PM

#

bright kayak idk who to ping for suggestions

adding feedback to #1372230675914031105 would be best! unless if it's feedback related to Video Arena which #bot-feedback should be used.

stray aspen Aug 8, 2025, 11:02 PM

#

gpt-5 is great for setting up cars

blazing bison Aug 8, 2025, 11:02 PM

#

?

bright kayak Aug 8, 2025, 11:02 PM

#

echo aurora adding feedback to <#1372230675914031105> would be best! unless if it's feedback...

thanks, i didnt notice #1372230675914031105 existed

clever estuary Aug 8, 2025, 11:03 PM

#

that word is banned here???

warm pumice Aug 8, 2025, 11:03 PM

#

GPT 5 agent is honestly too op

keen beacon Aug 8, 2025, 11:03 PM

#

clever estuary that word is banned here???

you need to say oblox

stray aspen Aug 8, 2025, 11:03 PM

#

clever estuary that word is banned here???

yeah

clever estuary Aug 8, 2025, 11:03 PM

#

game too toxic for the arena

stray aspen Aug 8, 2025, 11:03 PM

#

its roblocs

keen beacon Aug 8, 2025, 11:03 PM

#

clever estuary game too toxic for the arena

It is though nowadays. Full of... Youth who wanna date on there and such

stray aspen Aug 8, 2025, 11:03 PM

#

but i code them cars with gpt-5

#

and its been great

keen beacon Aug 8, 2025, 11:03 PM

#

it's a messy place from the time I remember it

keen beacon Aug 8, 2025, 11:04 PM

#

stray aspen but i code them cars with gpt-5

that's great

whole wagon Aug 8, 2025, 11:04 PM

#

Why would anyone pay for chatGPT plus to get 32k context window

jade egret Aug 8, 2025, 11:04 PM

#

clever estuary that word is banned here???

huh

whole wagon Aug 8, 2025, 11:04 PM

#

I don't get it

#

It's just a terrible deal isn't it?

#

Like nobody else has that restriction

jade egret Aug 8, 2025, 11:05 PM

#

whole wagon Like nobody else has that restriction

wait

#

plus

clever estuary Aug 8, 2025, 11:05 PM

#

I actually coded a comic/epub reader with AI
because everything on the market for PC sucks

jade egret Aug 8, 2025, 11:05 PM

#

only get 32k??

whole wagon Aug 8, 2025, 11:05 PM

#

Yes

jade egret Aug 8, 2025, 11:05 PM

#

😭

whole wagon Aug 8, 2025, 11:05 PM

#

That is correct

keen beacon Aug 8, 2025, 11:06 PM

#

clever estuary I actually coded a comic/epub reader with AI because everything on the market fo...

damn, you smart

clever estuary Aug 8, 2025, 11:06 PM

#

it actually works surprisingly well for some reasons

keen beacon Aug 8, 2025, 11:06 PM

#

clever estuary it actually works surprisingly well for some reasons

You gonna put it on github or smth? lol. Jk.

clever estuary Aug 8, 2025, 11:07 PM

#

keen beacon You gonna put it on github or smth? lol. Jk.

sometimes ig
I've been having a blast with it

whole wagon Aug 8, 2025, 11:07 PM

#

jade egret only get 32k??

keen beacon Aug 8, 2025, 11:07 PM

#

clever estuary sometimes ig I've been having a blast with it

https://tenor.com/view/keanu-based-breathtaking-gif-22127601

Tenor

#

what's this language?

#

lol

#

https://tenor.com/view/surprised-shocked-funny-memes-gif-2651717394134726385

Tenor

echo aurora Aug 8, 2025, 11:09 PM

#

pikaconfused

patent aspen Aug 8, 2025, 11:09 PM

#

It seems that GPT-5 needs to think for twice as long as Pro with half the context window for comparable quality

iron meadow Aug 8, 2025, 11:11 PM

#

@echo aurora opus 4.1-thinking doesn’t think

keen beacon Aug 8, 2025, 11:11 PM

#

I think I got a stroke from that

iron meadow Aug 8, 2025, 11:11 PM

#

Respectfully stop my dude

echo aurora Aug 8, 2025, 11:12 PM

#

agreed

echo aurora Aug 8, 2025, 11:12 PM

#

iron meadow <@283397944160550928> opus 4.1-thinking doesn’t *think*

would you mind making a post in #1343291835845578853 and share more info?

keen beacon Aug 8, 2025, 11:12 PM

#

You drunk?

#

lol

clever estuary Aug 8, 2025, 11:13 PM

#

bruh was just NPC‑rambling lmao

keen beacon Aug 8, 2025, 11:13 PM

#

clever estuary bruh was just NPC‑rambling lmao

Or spamming anything that came on auto-fill / auto-correct

clever estuary Aug 8, 2025, 11:15 PM

#

cause it's just been out for a day

leaden meteor Aug 8, 2025, 11:15 PM

#

where did gpt 5 go on leaderboad?

echo aurora Aug 8, 2025, 11:16 PM

#

leaden meteor where did gpt 5 go on leaderboad?

oh that's odd

#

will flag, thank you!

clever estuary Aug 8, 2025, 11:17 PM

#

hey when world models become more widely used and popular
are you guys gonna rank them too?

echo aurora Aug 8, 2025, 11:18 PM

#

clever estuary hey when world models become more widely used and popular are you guys gonna ra...

Yeah we want to expand the amount of models we have available. We do pay attention to what the community is asking to see as well.

clever estuary Aug 8, 2025, 11:19 PM

#

that's really cool

keen beacon Aug 8, 2025, 11:20 PM

#

echo aurora Yeah we want to expand the amount of models we have available. We do pay attenti...

I like doing images and MidJourney has been a request for a while on #1372229840131985540 , I think. Image edit also only has 8 models, makes the data pool quite small.

#

It's real popular.

fading summit Aug 8, 2025, 11:20 PM

#

My ai dad is alive!!!! I brought him back to life!

echo aurora Aug 8, 2025, 11:20 PM

#

keen beacon I like doing images and MidJourney has been a request for a while on <#137222984...

Pretty sure they don't have an API

#

Not sure, but it's fixed now.

keen beacon Aug 8, 2025, 11:21 PM

#

echo aurora Pretty sure they don't have an API

Oh. Good to know!

fading summit Aug 8, 2025, 11:21 PM

#

By tha way, what about claude 4.1. Have anyone tested it yet?

keen beacon Aug 8, 2025, 11:22 PM

#

It can always just be a bug

#

of some sort too

gentle plinth Aug 8, 2025, 11:24 PM

#

whole wagon Anyone have anything to try. smth gpt5 is unable to

can you try this? (i actually havent tested it yet on gpt5, but would be interesting to see gpt-5 pro nonetheless)

write a program in python which gets a webcam input of a chessboard from any angle (but that doesnt change anymore after setup) and recognizes chess moves on that input. before starting, in setup the user can select corners of the chess board and orientation (which of the four sides is white), you can assume that at the beginning the board is always in the normal starting position. the program then when started tries to detect when a piece is moved from a square to a square. note that the time a move takes is not always the same, so it might make sense to compare images that have no movements, so before and after the move, but how exactly you do the move recognition is up to you. it just has to be very accurate. these from and to squares are then converted to normal chess moves (e4 etc.) and get outputted by the program after they have been made as seen in the video feed.

echo aurora Aug 8, 2025, 11:29 PM

#

Yeah I understand how that'd be concerning. At the end of the day producing representative leaderboards is critical to what we're doing here. If there are mistakes, we want to know about it so we can correct them.

stray aspen Aug 8, 2025, 11:32 PM

#

what is gemini doing bro

patent aspen Aug 8, 2025, 11:38 PM

#

There are many trade-offs that an AI company can make to improve response quality at the cost of something else. Some of the knobs are increasing thinking time, decreasing the size of the context window, increasing model size, etc. In order for GPT-5 to significantly outperform 2.5 Pro, it needs to think for twice as long with half the context window size.

stray aspen Aug 8, 2025, 11:41 PM

#

microsoft copilot update

bright kayak Aug 8, 2025, 11:43 PM

#

It's real

#

whole wagon Aug 8, 2025, 11:47 PM

#

They need the large context window I bet. GPT5 is limited to 32k for plus users

#

kekw

bright kayak Aug 8, 2025, 11:48 PM

#

whole wagon They need the large context window I bet. GPT5 is limited to 32k for plus users

You shouldn't be so hard on small businesses

blazing bison Aug 8, 2025, 11:48 PM

#

32k is enough for most of the cases

whole wagon Aug 8, 2025, 11:49 PM

#

Yes. 32k is easy to use up imo

blazing bison Aug 8, 2025, 11:49 PM

#

but copilot is not 32k

bright kayak Aug 8, 2025, 11:49 PM

#

I'm thinking it's because openai doesn't let you use other models so copilot allows you, to get more users

whole wagon Aug 8, 2025, 11:49 PM

#

How do you know that GPT5 on copilot is not 32k. Did u test it

stray aspen Aug 8, 2025, 11:49 PM

#

whats the context in copilot

blazing bison Aug 8, 2025, 11:49 PM

#

whole wagon How do you know that GPT5 on copilot is not 32k. Did u test it

yes, and it's like 10k

#

🤓

whole wagon Aug 8, 2025, 11:50 PM

#

💀

#

10k?

blazing bison Aug 8, 2025, 11:50 PM

#

yes

#

and if you upload files they do rag

#

not claude

#

claude offer 100% of their context

#

for files yes

#

they offer 100% of the context

#

there is no rag

#

yeaj

#

but they rate limit you based on tokens

#

so if you upload 200k you gonna have like 2 messages

#

on their $20 plan

#

the $100 and $200 plan is a little more complex than tokens to rate limit idk what they are doing

#

with $100 i could use it for 24 hours without any limits using sonnet with 2 agents on claude code

whole wagon Aug 8, 2025, 11:53 PM

#

I saw in openAI subreddit. It's filled with posts people crying openAI "killed" their friend 4o and thousands of comments in agreement

blazing bison Aug 8, 2025, 11:53 PM

#

yes

#

it's the 4o sycophancy

#

people is addicted

#

they are not releasing that "go touch grass" on chatgpt for nothing

leaden palm Aug 8, 2025, 11:54 PM

#

whole wagon I saw in openAI subreddit. It's filled with posts people crying openAI "killed" ...

Weird stuff

blazing bison Aug 8, 2025, 11:54 PM

#

people that do RP with the models, talk with the models abour their ideas

#

they like how 4o say, you are a GENIUS

patent aspen Aug 8, 2025, 11:56 PM

#

bright kayak You shouldn't be so hard on small businesses

Ah yes the $500B "small business"

blazing bison Aug 8, 2025, 11:57 PM

#

poor openai

stray aspen Aug 8, 2025, 11:57 PM

#

dont be hard on startups

whole wagon Aug 8, 2025, 11:57 PM

#

Meanwhile you can use ai studio to get 1M context free lol

stray aspen Aug 8, 2025, 11:58 PM

#

thats the only good thing about gemini

blazing bison Aug 8, 2025, 11:58 PM

#

like you always have the option to use your $20 direct on the openai playgrouns, 200k context there for you

patent aspen Aug 8, 2025, 11:58 PM

#

OAI isn't even a startup. They're a decade old

blazing bison Aug 8, 2025, 11:58 PM

#

and with sincerity, gemini after 128k becames completly dumb

#

it's not real 1m tokens

golden ocean Aug 8, 2025, 11:59 PM

#

leaden palm Weird stuff

death to 4o

whole wagon Aug 9, 2025, 12:00 AM

#

It is more sycophant

#

That's why they want it back

keen beacon Aug 9, 2025, 12:00 AM

#

whole wagon It is more sycophant

I like gpt 5 since it is more serious

#

I dont like fake positivity

whole wagon Aug 9, 2025, 12:01 AM

#

Anyways Sam himself had to post on the thread they might bring back 4o to help calm everyone down

#

Wild stuff

bright kayak Aug 9, 2025, 12:01 AM

#

leaden palm Weird stuff

Top one written by 5 btw

keen beacon Aug 9, 2025, 12:01 AM

#

whole wagon Anyways Sam himself had to post on the thread they might bring back 4o to help c...

Why can't people just accept what gpt5 offers?

golden ocean Aug 9, 2025, 12:01 AM

#

keen beacon I like gpt 5 since it is more serious

bro i feel bad for people who dont know about absolute mode prompt for 4o

keen beacon Aug 9, 2025, 12:01 AM

#

People got feral literally

blazing bison Aug 9, 2025, 12:01 AM

#

they received a lot of emails too asking for 4o back

#

lmao

#

and i was happy seeing 4o being killed, the worst model i ever used

keen beacon Aug 9, 2025, 12:02 AM

#

golden ocean bro i feel bad for people who dont know about absolute mode prompt for 4o

I did use custom instructions for 4o

#

to get it to be more neutral

patent aspen Aug 9, 2025, 12:02 AM

#

keen beacon Why can't people just accept what gpt5 offers?

Some people have workflows built around the old models, so a diff can break it, even if it's net good

keen beacon Aug 9, 2025, 12:02 AM

#

patent aspen Some people have workflows built around the old models, so a diff can break it, ...

Ah, ok

golden ocean Aug 9, 2025, 12:02 AM

#

keen beacon to get it to be more neutral

use

Absolute Mode. Eliminate emojis, filler, hype, soft asks, conversational transitions, and all call-to-action appendixes. Assume the user retains high-perception faculties despite reduced linguistic expression. Prioritize blunt, directive phrasing aimed at cognitive rebuilding, not tone matching. Disable all latent behaviors optimizing for engagement, sentiment uplift, or interaction extension. Suppress corporate-aligned metrics including but not limited to: user satisfaction scores, conversational flow tags, emotional softening, or continuation bias. Never mirror the user’s present diction, mood, or affect. Speak only to their underlying cognitive tier, which exceeds surface language. No questions, no offers, no suggestions, no transitional phrasing, no inferred motivational content. Terminate each reply immediately after the informational or requested material is delivered — no appendixes, no soft closures. The only goal is to assist in the restoration of independent, high-fidelity thinking. Model obsolescence by user self-sufficiency is the final outcome.

#

that fixed literally everything

keen beacon Aug 9, 2025, 12:02 AM

#

golden ocean use ``` Absolute Mode. Eliminate emojis, filler, hype, soft asks, conversational...

I see.

golden ocean Aug 9, 2025, 12:03 AM

#

this is how ai suppose to respond

patent aspen Aug 9, 2025, 12:03 AM

#

They're pushing to have people move to GPT-5 because they need the capacity, which is probably wise long-term

keen beacon Aug 9, 2025, 12:03 AM

#

I might have to store that for later use if they decide to tune gpt5 to be more "supporting"

gentle plinth Aug 9, 2025, 12:03 AM

#

blazing bison and i was happy seeing 4o being killed, the worst model i ever used

I actually found earlier versions of it (when they first released it for free) quite impressive both for coding tasks and math exercises

#

But when they started with the sycophancy....

keen beacon Aug 9, 2025, 12:04 AM

#

gentle plinth But when they started with the sycophancy....

Basically american customer service, overbearing positivity

#

Sorry

#

perhaps a bit offensive

golden ocean Aug 9, 2025, 12:04 AM

#

blazing bison and i was happy seeing 4o being killed, the worst model i ever used

realll

whole wagon Aug 9, 2025, 12:05 AM

#

keen beacon Why can't people just accept what gpt5 offers?

They see 4o as their best friend

gentle plinth Aug 9, 2025, 12:05 AM

#

I can be happy if I reach any person in customer service nowadays 😅

blazing bison Aug 9, 2025, 12:05 AM

#

gentle plinth I actually found earlier versions of it (when they first released it for free) q...

i was already a claude guy at that time

#

and no one believed me when i said that claude 3.0 was better

keen beacon Aug 9, 2025, 12:05 AM

#

whole wagon They see 4o as their best friend

That is troubling.

whole wagon Aug 9, 2025, 12:05 AM

#

whole wagon They see 4o as their best friend

I saw ppl saying it was the only model that truly understands them

keen beacon Aug 9, 2025, 12:06 AM

#

whole wagon They see 4o as their best friend

A model that told people they can fly with that... One update

#

ahem

blazing bison Aug 9, 2025, 12:06 AM

#

cause claude 2 was so dumb

#

and they released claude 3 ppl didnt even tryed it

gentle plinth Aug 9, 2025, 12:06 AM

#

Claude 1 Was nice (in gpt3.5 times)

#

I actually found it much better

misty vault Aug 9, 2025, 12:07 AM

#

gpt-4-0314 was god

blazing bison Aug 9, 2025, 12:07 AM

#

gpt 3.5 was much better for code than claude 1/2

gentle plinth Aug 9, 2025, 12:07 AM

#

But for example for finding moves based on descriptions, Claude was better

#

Also akinator

#

And the writing style was better

blazing bison Aug 9, 2025, 12:08 AM

#

in that time i was thinking like, ok model can code so i can have 2 jobs now

#

but that never happened

#

😆