#general

1 messages ยท Page 90 of 1

blazing bison
#

even today models can do like 20% of my job

leaden palm
#

some say that when you spend enough time with a model you begin to imitate it, even when you aren't using it

#

(see _opencv_ on x for instances of this)

misty vault
#

1 year after talking with sydney

leaden palm
misty vault
#

yes

bright kayak
blazing bison
#

it's interesting

leaden palm
inner gate
#

Whatโ€™s gpt oss?? How does it compare to gpt5?

blazing bison
#

i don't know now if my experience with sydney was so good like in memory

bright kayak
keen beacon
whole wagon
#

Those benchmarks are not representative of real world performance

#

Be warned lol

leaden palm
#

oh i saw an interesting thread about gpt oss, let me pull it up

keen beacon
leaden palm
#

it's truly fried

patent aspen
#

Talking about OpenAI as some poor little startup that we shouldn't be so hard on for needing a small context window and long thinking times to benchmaxx is the weirdest complex

bright kayak
inner gate
#

Thanks

whole wagon
#

Simple bench guy had a funny story. When he posted gpt oss result openAI guys contacted him to rerun it and then they got an even worse result

#

He talked about it in latest vid

#

Titled gpt5

tulip kindle
#

Where GPT-5?

whole wagon
#

It is there

#

It is just not the top

#

So it's hard to see

bright kayak
#

Better question is wen gpt6

cosmic salmon
bright kayak
tulip kindle
whole wagon
#

Pay the $20 for a non transparent router and 32k context window

#

Great deal

bright kayak
#

Same for me

#

Gpt 5 too expensive but grok 4 is fine

tulip kindle
#

Just strange if one time they get GPT-5 and after that delete

whole wagon
#

Well they couldn't keep gpt5 there because it's literally better than chatGPT plus for free then

#

Because you are always on the full model when selected in LM arena

#

Feels like plus tier regressed so much....

#

At least you could select o3

cosmic salmon
echo aurora
patent aspen
#

Could also just be capacity issues

clear spear
#

wait where did gpt-5 go?

patent aspen
#

Oh I thought it was chatgpt5

#

Nvm

clear spear
#

WHY'D THEY DELETE IT!?

echo aurora
#

It's coming back

clear spear
#

oh okay ๐Ÿ˜Š

cosmic salmon
#

Maybe GPT-5 were the LLMs we chatted with along the way...

thin creek
#

Okay

misty vault
#

Maybe the GPT-5 was the friends we made along the way

misty vault
whole wagon
#

But why buy plus tier when GPT5 on lmarena is better than the router

thin creek
#

Is this Claude 4.1?

whole wagon
#

Yes

clear spear
#

kinda upset to why the models can't use search actively

gentle plinth
clear spear
#

all my data?

cosmic salmon
whole wagon
#

Ngl. People are still going to be putting personal info into LM arena they should at least set up a filter

whole wagon
#

Before publishing the data

clear spear
#

so even deleted chats?

cosmic salmon
blazing bison
blazing bison
clear spear
#

I ain't got no money to pay for router

#

will someone give me their card info pretty please

cosmic salmon
echo aurora
clear spear
cosmic salmon
clear spear
echo aurora
blazing bison
#

lmao

cosmic salmon
#

welp, an attempt was made

clear spear
blazing bison
#

gpt-5 killed

#

rip

echo aurora
cosmic salmon
floral comet
clear spear
#

you dare to delete my message inferior being

stray aspen
#

its llama 1.1

keen beacon
#

@clear spear ay

#

Why is your pfp a

echo aurora
#

it's a dog

cosmic salmon
#

it's a dog

clear spear
#

furrypepe

#

OMG

#

IT'S HUGE

echo aurora
#

stap

keen beacon
#

Itโ€™s Ronaldo

#

suii

clear spear
#

cause I sure do ๐Ÿ˜„

keen beacon
echo aurora
#

kk back to AI topics please!

clear spear
keen beacon
#

GPT-5 moment.

cosmic salmon
clear spear
#

yeah I want it back NOW!

#

please

keen beacon
#

Itโ€™s bad

clear spear
keen beacon
cosmic salmon
clear spear
#

wait does gpt-5 on lmarena have the ability to use specific models for different tasks?

#

@echo aurora

cosmic salmon
keen beacon
cosmic salmon
#

Unless you mean different experts, but we don't know if GPT-5 is MoE or not

keen beacon
#

We all know it

#

Why is GPT-5 gone from LMArena

cosmic salmon
#

They killed it

#

Trying to revive it now

clear spear
#

currently in the bahamas

keen beacon
#

Bruh

misty vault
clear spear
#

is

#

that

#

about

cosmic salmon
maiden fulcrum
#

where is gpt-5

#

i cant see it on direct chat

clear spear
cosmic salmon
clear spear
#

WAIT 5 WEEKS

echo aurora
maiden fulcrum
clear spear
inner gate
echo aurora
inner gate
#

Must be why

misty vault
#

In the meantime u guys will have to use @deep adder as replacement for gpt-5

#

temporary

maiden fulcrum
#

I dont like trolls

clear spear
inner gate
#

Is that what trolls are

#

๐Ÿค”

keen beacon
#

Is there a limit to LMArena models? Iโ€™m a new to these kind of stuff

cosmic salmon
maiden fulcrum
#

@echo aurora is gpt-5 on lmarena using (high)?

cosmic salmon
inner gate
#

You should ask pineapple for more details

cosmic salmon
stray aspen
#

wheres gpt-5 normal

maiden fulcrum
#

@echo aurora I cant delete my history chat. It always reappears

cosmic salmon
echo aurora
keen beacon
inner gate
#

I thought I hit the rate limit for gpt5 when it stopped working lol

#

Then it got removed so I thought it was an error

echo aurora
#

But I will raise to the team if that's something we want to start doing

misty vault
#

crack bench

keen beacon
inner gate
echo aurora
stray aspen
#

craigbench

inner gate
#

I see

echo aurora
inner gate
#

Or is it the general error message

maiden fulcrum
#

@echo aurora I am still seeing the bug

echo aurora
inner gate
#

Thatโ€™s what I did

#

And whatever chats get removed from the history

echo aurora
inner gate
#

Also gets removed on the website

echo aurora
#

let me know if it's still there

inner gate
keen beacon
#

I really wonder how the rate limit resets if thereโ€™s no account needed

maiden fulcrum
#

@echo aurora I cant see it on Edge

cosmic salmon
keen beacon
echo aurora
pseudo magnet
#

wheres gpt-5 ๐Ÿ˜ž

inner gate
pseudo magnet
#

it will be back??

echo aurora
inner gate
#

Pineapple and his team wants you to have a great experience

echo aurora
#

yes! it will be back

inner gate
#

Yes it will

#

๐Ÿ˜‰

keen beacon
#

I will not abuse it of course, Iโ€™m just curious

maiden fulcrum
#

it wont go

inner gate
pseudo magnet
#

thanks @inner gate and @echo aurora

inner gate
#

Todayโ€™s history

#

Try after that

#

After deleting ur history also close ur tab

#

So it refreshes

#

Then check again

keen beacon
inner gate
#

It might be because of OCD

maiden fulcrum
#

shared?

keen beacon
#

???

inner gate
#

I usually delete the chats I donโ€™t use (I donโ€™t have OCD) it just looks cleaner

#

I like it that way

cosmic salmon
inner gate
keen beacon
misty vault
inner gate
#

Cus itโ€™s free but not

keen beacon
inner gate
#

I thought so

cosmic salmon
#

If the product is free, then maybe we're the product... ๐Ÿ˜”

inner gate
#

Our data is the oroduct

#

Lol

keen beacon
#

Who cares.

inner gate
#

I donโ€™t to be honest

keen beacon
#

Itโ€™s not like Iโ€™m gonna use more than 5-10 messages per chat.

inner gate
#

Lol

#

I sometimes ask them for car tunes and compare answers

keen beacon
#

I mostly use Opus for help when codex is too dumb.

whole wagon
#

4o coming back confirmed kekw

inner gate
#

My Claude 4 opus usually stops working after a few messages

#

I havenโ€™t used it recently

#

Maybe itโ€™s been fixed

keen beacon
#

Elon said Grok 5 is coming before the end of the year

cosmic salmon
#

Opus is nice for agentic coding imo, but for fine-grade adjustments o3 is better

inner gate
#

Gemini

keen beacon
whole wagon
#

All this benchmarks don't matter kek

inner gate
cosmic salmon
inner gate
#

U can tell it u murdered someone and itโ€™ll give you excuses on why you had no choice

keen beacon
cosmic salmon
inner gate
#

What grok4 model is used on LM arena?

keen beacon
#

Who cares, itโ€™s not like they donโ€™t already know all my weaknesses

inner gate
keen beacon
#

Yeah.

inner gate
#

I did not know that

keen beacon
#

Look, Iโ€™m gonna show you.

inner gate
#

Go ahead

atomic adder
#

I swear gpt 5 just disappeared

cosmic salmon
#

Please don't swear

keen beacon
# inner gate Go ahead

When I press that i it shouldโ€™ve show me the limits but itโ€™s stupid on mobile, so here are the limits.

atomic adder
#

im betting gpt 5 just disappeared lol

keen beacon
#

Ignore its gpt4.

#

The real models are here.

atomic adder
keen beacon
atomic adder
keen beacon
jade egret
#

lowkey it kinda fun talking with gemini 2.5 pro

stray aspen
#

i love gpt-5

atomic adder
keen beacon
misty vault
#

girlfriend simulator

#

oh wait... that's sydney

cosmic salmon
atomic adder
keen beacon
#

Or at least one, Iโ€™m curious about the models

jade egret
misty vault
#

no

unborn lantern
#

where is gpt 5? Lol

keen beacon
#

ASI

#

Be amazed

unborn lantern
#

Not in lmarena

jade egret
#

gemini tweaking

keen beacon
#

Ah

keen beacon
jade egret
#

prob role playing lol

stray aspen
#

why is gemini hallucinating apache 2.0 licenses and copyright crap in its python scripts

jade egret
#

his prob

keen beacon
#

You gave him PTSD

jade egret
#

and btw

#

this is the rest of the message

vale estuary
#

What happened with GPT-5 ?

jade egret
keen beacon
stray aspen
vale estuary
#

Its not there in the site anymore

jade egret
jade egret
sacred quail
keen beacon
jade egret
jade egret
#

my gemini is high

#

i will name the chat "high gemini"

cosmic salmon
#

that's what happens when your reasoning effort is too high lol

inner gate
keen beacon
#

Anyone played with verbosity parameter?

#

From my quick research itโ€™s just a yap meter

sacred quail
#

4o lovers would love the current gemini btw. Praising you for doing breathe

keen beacon
#

Wonder if it makes any difference in code

#

Beside the explanations

stray aspen
#

@keen beacon

vale estuary
#

GPT-5 isn't there anymore?

cosmic salmon
keen beacon
stray aspen
#

its roblocks

vale estuary
keen beacon
#

Roadblocks

inner gate
cosmic salmon
wheat onyx
stray aspen
wheat onyx
#

so it's beyond using the wrong model?

#

ah

buoyant dust
#

where is gpt5

cosmic salmon
inner gate
#

๐–จ ๐—Œ๐–พ๐–พ

wheat onyx
#

but that was pervasive. it works for me, and it worked for others using nano

keen beacon
inner gate
#

๐–ณ๐—๐–บ๐—‡๐—„๐—Œ๐—‡๐—†๐–บ๐—‡

keen beacon
#

New strawberry problem

#

Same I get a satisfaction

stray aspen
#

lmao

inner gate
#

๐Ÿฅฒ

stray aspen
#

if it got it right it means this is all a government conspiracy

keen beacon
#

I donโ€™t know what that 7 free evals does.

wheat onyx
#

i've heard that not adding the extra 0 decimal can cause issues (but shouldnt)

#

5.90-5.11 works, even though 5.9-5.11 should always work too

keen beacon
inner gate
keen beacon
#

The X is shy

stray aspen
#

got it wrong

inner gate
#

Mahbe it means I get gpt pro for free

#

7 days a week

keen beacon
wheat onyx
inner gate
#

Wait

wheat onyx
#

theres nano, mini, normal

inner gate
#

If it lets me evaluate models for free

#

7 times

#

Idk

keen beacon
#

@wheat onyx see

#

5 gets it wrong.

wheat onyx
#

and you don't know which it is

stray aspen
#

it does it right

keen beacon
#

Is this Claude?

wheat onyx
#

no

keen beacon
#

Or Copilot

wheat onyx
#

web gpt

stray aspen
#

its copilot

keen beacon
#

I see

stray aspen
#

yeah it works

azure sage
#
poll_question_text

the best model

victor_answer_votes

6

total_votes

9

victor_answer_id

3

victor_answer_text

gemini-2.5-pro

wheat onyx
#

they really have to fix the autorouter

keen beacon
#

AGI when

#

You know what

#

I want ASI

#

I want to focus on myself more

#

Let the AI do everything

prime mulch
#

I also want asi

stray aspen
wheat onyx
#

also something they have to fix - if you have a REALLY long conversation, it basically crashes

clear spear
#

WHY IS MY GPT 5 NOT BACK YET

tidal ginkgo
#

why is gpt-5 gone

frail ibex
#

Same here. No GPT-5. Just mini and nano

tidal ginkgo
#

maybe bcz of costs

slow sail
#

too much use?

tidal ginkgo
#

maybe

#

used it a lot here

slow sail
#

i guess pineapple can say

storm needle
tidal ginkgo
#

GUYZ WHY IS MY BEST FRE MODAL NOT HERE YATTTTT!!1!1111111!!!!! I WANT FRE STUF!!!11!!1

keen ferry
#

too much hype around gpt 5 and too many people use it without actually using the actual vote feature id say

echo aurora
#

Yeah GPT-5 isn't available atm

slow sail
#

dont just ping them man

echo aurora
#

team is looking into

tidal ginkgo
#

ok

tidal ginkgo
floral comet
#

There's o3 now

#

I noticed it's a lot faster now, do the performance degrade?

vital lake
mental fern
#

It takes too long to delete a chat. The deletion doesn't seem to complete until the page redirects to the homepage. Also, if I try to delete multiple chats at once, the process often fails and none of them get deleted. Is anyone else experiencing this?

solid brook
#

They are trying to fix it

proper spindle
#

Hey guys. Where is GPT-5?

wet sparrow
tidal ginkgo
#

lol

#

btw it back

woven turret
#

we have gpt-5 and gpt-5 chat?

stray aspen
#

gpt-5-chat is live

stray aspen
#

yeah

woven turret
random wolf
#

everyone I need help. why is it always saying "Something went wrong with this response, please try again."???

blazing bison
#

try again some hours later

tidal ginkgo
#

gpt-5 chat???

#

what is even the difference?

blazing bison
#

this is the gpt-5 in chatgpt

#

this model is finetuned for chat

tidal ginkgo
#

oh so the gpt-5 is more "pro"

blazing bison
#

yes

#

no

#

actually

tidal ginkgo
#

?

blazing bison
#

one is more nicer with the answers the other no

tidal ginkgo
#

ok? still donยดt understand lol

blazing bison
#

one gonna say, nice, you are smart, good job

#

the other not

tidal ginkgo
#

ok

#

so chat is nicer lol

blazing bison
#

yes

tidal ginkgo
#

or does it only have more temp?

tidal ginkgo
blazing bison
tidal ginkgo
#

ok

blazing bison
#

finetuned for chat

errant thorn
#

which is smarter

tidal ginkgo
#

same ig

blazing bison
#

the difference is clear

tidal ginkgo
#

oh

blazing bison
#

just the format of the answer change

tidal ginkgo
#

just dumbed down

#

that

blazing bison
#

i would use gpt -5-chat more btw

#

for day to day things

tidal ginkgo
#

for coding im using gpt-5

blazing bison
#

yes

#

for coding gpt-5

tidal ginkgo
#

for everyday chat i use gpt-5-chat

blazing bison
#

yeah

#

exactly

tidal ginkgo
#

cool

blazing bison
#

but if you are studying code

#

gpt-5-chat is better

tidal ginkgo
#

yeah

#

havenยดt used it much so

#

just testing itยดs coding skills

#

wait

blazing bison
#

claude sonnet still better coder btw

tidal ginkgo
#

is there a gpt-5-thinking?

tidal ginkgo
blazing bison
#

no

tidal ginkgo
#

oh

#

ok

blazing bison
#

wait

#

the no is for gpt-5 thinking

tidal ginkgo
#

oh

#

ok

blazing bison
#

about opus being better than sonnet, in benchs it is

#

test for yourself

tidal ginkgo
#

ok

#

lemme check

woven turret
#

is this gpt-5 still (high)

tidal ginkgo
#

idk

solid brook
#

Gpt 5 chat does not reason i think

woven turret
#

oh, my simple technical question show that:

  1. gpt-5(lmarena) = gpt-5 thinking (chatgpt+) >= new gpt-5 (lmarena)
  2. gpt-5-chat = gpt-5(plus)
  3. gpt-5 in copilot has limit context
tidal ginkgo
#

there is a thinking

solid brook
#

Guys the gpt 5 thinking is only in chatgpt website

#

In api there is reason effort

woven turret
#

i dont know, but gpt-5 on lmarena have great outputs, better than gpt-5 (plus user) and copilot, oc in my case

tidal ginkgo
#

wait

#

copilot has gpt-5?

woven turret
#

yeah sure

#

it randomly use thinking and quick answer

tidal ginkgo
woven turret
#

lmarena team did well

tidal ginkgo
#

is there a limit?

tidal ginkgo
woven turret
tidal ginkgo
#

btw someone do a bench gpt 5 and gpt 5 chat

woven turret
#

the limit is its output, maybe

tidal ginkgo
solid brook
tidal ginkgo
#

ok

solid brook
#

@echo aurora Context window limit is 400k for gpt 5 right?

tidal ginkgo
#

god yall pinging admins lol

mental fern
#

Is GPT-5 always โ€œGeneratingโ€ฆโ€?

woven turret
#

it is thinking, maybe, lol

mental fern
#

maybe my question is quite hard, thx

blazing bison
#

both answer at the same speed

woven turret
#

Yeah, my gpt-5 on plus account didnt think in my simple question test

mental fern
#

official description

blazing bison
#

atleast on lmarena

#

2 seconds for first token for both

#

no matter what the prompt

#

so are both thinking o both non thinking

mental fern
#

it looks like the gpt-5 on chatgpt.com automatically selects thinking or no-thinking

ocean fulcrum
#

Don't be confused, start super grok for sometime, then turned to gpt 5 you can clearly see what others can't

ocean fulcrum
#

Is it takes millions for subscription

solid brook
#

It is 300 doolers a month

jade egret
#

google stock doing pretty good today lol

blazing bison
#

tested on api playground, both reason

solid brook
#

Gemini gemini gemini

tidal ginkgo
mental fern
#

Mmm i refreshed the page and it appeared.

fallow trench
#

how to use veo3?

tidal ginkgo
#

but itยดs veo 3 fast

fallow trench
#

ok

#

thanks

tidal ginkgo
#

maybe lots of people using it at a time?

stray aspen
#

just use "think extremely hard"

#

at the start of you rprompt

#

yupp.ai is also a great website for trying gpt-5

#

but its limited

#

but overall microsoft copilot has hbecome a great option for gpt-5

#

in my opinion

mental fern
#

I don't think gpt-5 and gpt-5-chat are equally smart.

jade egret
#

why does gemini keep talking aboitu the current time

jade egret
#

pi + e?

mental fern
#

TREE(2) is actually 3

jade egret
#

i googled it ๐Ÿ™ƒ

stray aspen
#

isnt gpt 5 chat a router

mental fern
#

I tried switching it to Thinking, but failed.

inner gate
#

Whays the difference between gpt5 and gpt5 chat?

#

Ahhhhh that makes more sense thanks lol

patent aspen
stray aspen
#

The open ai website itself says it

solid brook
tidal ginkgo
#

oh ok

haughty tangle
solid brook
hoary elbow
#

What is the difference between GPT five and GPT five chat?

solid brook
#

Use that

tidal ginkgo
tidal ginkgo
hoary elbow
#

All right

tidal ginkgo
#

gpt-5 better at any reasoning

hoary elbow
#

Ok

woven turret
tidal ginkgo
#

hey yall

#

how much does 5 chat hallucinate?

hoary elbow
#

Whenever I chat with him only in one conversation, he hallucinated and Iโ€™ve had over 20 conversations with it

#

How about you guys?

#

Oh, youโ€™re talking about the chat version

#

I never really talk to it

tidal ginkgo
#

yeah

#

u shouldnยดt lol

hoary elbow
tidal ginkgo
#

nobody uses it in here

haughty tangle
hoary elbow
#

Do you use it?

#

Mr. Discor

tidal ginkgo
#

just test it

solid brook
#

It is bad

hoary elbow
#

Then how would you know if itโ€™s worse?

tidal ginkgo
#

for something i want a quick answer 2

solid brook
#

The chat version

tidal ginkgo
#

dumber answers

hoary elbow
#

All right

#

But is it at least faster?

tidal ginkgo
#

yes

#

lots faster

#

use it depending on your situation

hoary elbow
#

OK, so Iโ€™m gonna use GPT-5 for coding and essays etc. GPT-5-chat for simple stuff

hoary elbow
#

Because you said it was way faster

tidal ginkgo
#

yes

hoary elbow
solid brook
#

Actually

hoary elbow
#

So GPT five chat is like the direct answer version of GPT five

solid brook
#

I think gpt 5 mini is better than gpt 5 chat

tidal ginkgo
solid brook
#

Gpt 5 mini reasons

tidal ginkgo
#

mmm

#

lemme test them

hoary elbow
#

And I never tested the chat version so

tidal ginkgo
#

dude

#

chat is way faster

hoary elbow
#

Yeah, but itโ€™s worse

tidal ginkgo
#

yeah, i mean

hoary elbow
#

Someone said it was even worse than GPT five mini

solid brook
#

Yeah but gpt 5 mini is a good balance of power and speed

tidal ginkgo
#

like 2 sec of difference

#

the problem with chat is

#

itยดs dumbed down for people that only use chatgpt on their phones for a recipe on a tuesday

hoary elbow
#

People keep saying that GPT five is a one step closer to AGI but is it really I just wanna hear yโ€™allโ€™s opinions and Iโ€™m not talking about the chat version

solid brook
tidal ginkgo
#

guys

haughty tangle
#

Before GPT 5 I thought we would have AGI 2026, now Iโ€™m thinking 2027-2029

hoary elbow
#

All right, canโ€™t wait till 26-27-28 or 29 to see how AI gets

tidal ginkgo
#

lol

tidal ginkgo
#

just wait until gpt-6

#

that hopefully is an AGI

hoary elbow
#

If Gemini three pro comes out, will it be better than GPT five

hoary elbow
#

Will it be one step closer to AGI?

tidal ginkgo
#

but not announcments

tidal ginkgo
haughty tangle
tidal ginkgo
#

more like baby crawling

tidal ginkgo
#

will be even closer than gemini 3

haughty tangle
tidal ginkgo
#

by a lot

hoary elbow
#

When baby grok releases, what will it teach kids?

tidal ginkgo
#

ehhhh

#

not my field lol

hoary elbow
#

All right

#

Thatโ€™s OK

solid brook
hoary elbow
#

What happens if you show GPT five to a Victorian child

tidal ginkgo
haughty tangle
#

sigh we still wait for AGI so we can finally have an infinite self-improving feedback loop and have a dystopian future with unaligned ASI

tidal ginkgo
hoary elbow
#

All right

tidal ginkgo
#

lol

hoary elbow
#

People are saying GPT five sucks

haughty tangle
hoary elbow
#

Is that true?

haughty tangle
tidal ginkgo
haughty tangle
#

Itโ€™s like when grok 4 released

hoary elbow
#

All right

tidal ginkgo
#

very good

#

best ai yet

hoary elbow
#

Whenever I used Grok for for coding, it was damn near Gemini 2.0 pro level

tidal ginkgo
#

for now...

#

until gemini 3 or grok 5

solid brook
hoary elbow
#

So I donโ€™t know all about all the hype in for Glock four when it came out

tidal ginkgo
#

they all want an AGI by 2026

#

which for now is NOT happening

solid brook
#

I feel 2027 or 2028

haughty tangle
tidal ginkgo
#

closer to 2028 than 2027

#

sadly

haughty tangle
#

ASI though? Wonโ€™t take long once we have AGI

#

not gonna take long for obvious reasons

tidal ginkgo
#

lol

solid brook
tidal ginkgo
#

maybe 2030

haughty tangle
#

Iโ€™ve seen people say ASI will take decades after AGI

tidal ginkgo
solid brook
#

I think most people buy supergrok for ani not the actuall ai

hoary elbow
stray aspen
tidal ginkgo
#

elon chill

solid brook
stray aspen
#

It's great for math

solid brook
#

But it is not worth the price at all

stray aspen
#

Yeah it's a robbery

tidal ginkgo
#

not in a million years

stray aspen
#

Just use lm arena lmao

hoary elbow
#

If Grok 4 was actually the best AI model at its time then I would understand all the hype

solid brook
#

I love lmarena.

tidal ginkgo
#

hey

#

are there any benches on every model of the gpt-5 family?

#

like 5, 5 chat, mini and nano

solid brook
#

Forget 5 chat it is pointless

hoary elbow
#

Yeah, and itโ€™s way less expensive

solid brook
#

5 mini i think is close to o3

tidal ginkgo
hoary elbow
#

But you can use it for free on lmarena just like Grok four and you can also use it for free in ChatGPT

solid brook
#

I didn't see any limits on the gpt 5 models

hoary elbow
#

Yeah, but thereโ€™s no limit in lmarena though

tidal ginkgo
#

how is gpt-5 doing on writing?

#

creative writing?

hoary elbow
#

GPT five is doing quite great on writing

solid brook
#

I think i heard a few say that it is worse than 4o in creative writing

tidal ginkgo
#

maybe because chat is more for public

hoary elbow
#

To anyone who has tested GPT five pro how did it feel?

tidal ginkgo
#

its better in that

hoary elbow
#

How was it?

solid brook
hoary elbow
#

All right

tidal ginkgo
#

yeah

#

is there more context window on pro?

solid brook
#

No

#

The chatgpt website context window is sht

tidal ginkgo
solid brook
#

Proof please

hoary elbow
#

I canโ€™t wait for Gemini 3 pro and Grok five

#

I wonder how it is

solid brook
tidal ginkgo
#

gemini 3 closer

#

grok 5 farther

hoary elbow
#

All right

tidal ginkgo
#

but much MUCH better

hoary elbow
#

I understand it might be much better because itโ€™ll come after

tidal ginkgo
#

but gonna cost more than a house in 2025

hoary elbow
#

But what were yโ€™all experience with Grok four because there was too much hype for it when it came out

#

When it coded, bad games was I just using bad prompts

#

Is it really the best model?

solid brook
#

Let's hope all of them create good models with good prices to keep the competetion alive

tidal ginkgo
#

is that proof?

solid brook
#

Uhm i don't see any gpt pro

hoary elbow
#

Their input prices and output prices are very cheap

solid brook
#

Actually on chatgpt website context window is limited 32k. The contex window in pic are for api

tidal ginkgo
#

?

solid brook
#

And gpt 5 pro doesn't have any api

tidal ginkgo
#

sadly

hoary elbow
#

No one talks about copilot of around here so I just wanna clear this out. GPT five is now on Copilot.

tidal ginkgo
#

yeah but nobody cares about gemini today

solid brook
#

Gemini 2.5 pro becomes dumb after 200k context

tidal ginkgo
solid brook
#

The 1 mil is useless

hoary elbow
#

Ok

tidal ginkgo
#

nobody uses copilot anyways lol

solid brook
#

Gpt 5 is nerfed on copilot

#

Lmarena gpt 5 much better

tidal ginkgo
hoary elbow
#

I feel like GPT five on Copilot might be a faster version

tidal ginkgo
#

havenยดt tried copilot

hoary elbow
#

Let me test it out with GPT chat

tidal ginkgo
solid brook
hoary elbow
#

Theyโ€™re using GPT five chat

tidal ginkgo
#

EWWWWW I JUST GOT 5 CHAT IN BATTLE

hoary elbow
#

I told them the exact same thing and they gave the exact same answers

solid brook
obsidian shell
#

wait which gpt5 they have in copilot?

tidal ginkgo
solid brook
#

Nerfed

obsidian shell
#

chat?

solid brook
#

It does not reason much

obsidian shell
#

i thought it was pro normal nano and mini

tidal ginkgo
tidal ginkgo
solid brook
#

On lmarena you can get the model to think up to 3 or 4 minutes. But not on copilot

obsidian shell
#

am talking about the pro plan of copilot

tidal ginkgo
obsidian shell
#

i do

hoary elbow
#

I wanna clear something up

tidal ginkgo
#

well lmarena better

hoary elbow
#

It says itโ€™s GPT four

#

So Copilot just straight up, lied to our faces

tidal ginkgo
#

everyone does that

#

even chatgpt website

#

oh

solid brook
#

They have to give it system promt to say it is gpt t

tidal ginkgo
#

nvm

solid brook
#

Gpt 5

hoary elbow
#

Maybe they didnโ€™t change the system prompt yet

tidal ginkgo
#

maybe

hoary elbow
#

But when they change the system prompt, I might know, or it might straight up lie to my face and say itโ€™s the original version of ChatGPT five

tidal ginkgo
#

he doesnยดt lie lol

clear spear
#

so umm... why is there a gpt-5-chat? what's that about?

hoary elbow
#

Itโ€™s a small version of ChatGPT five

solid brook
#

Hmmm guys when i go use gpt 3.5 on legacy lmarena i get nostalgia

hoary elbow
#

And itโ€™s bad

tidal ginkgo
leaden palm
hoary elbow
#

Well, not that bad but like itโ€™s worse than GPT five

tidal ginkgo
#

gpt-5 is the better, more reasoning model

tidal ginkgo
clear spear
tidal ginkgo
#

lol the pfp

#

oH ITS A DOG

solid brook
#

The said they want to simplify the models. But i feel it got more complicated

tidal ginkgo
#

wwasnยดt on the livestream at all had to go out

#

just saw the first 5 min

hoary elbow
#

Is there anything better than GPT image right now?

tidal ginkgo
#

much better in text

sturdy mica
#

gpt-5 high is op

#

oh my god

tidal ginkgo
sturdy mica
#

gpt-5 high

tidal ginkgo
sturdy mica
#

yeah you guess?

tidal ginkgo
#

closest we got to AGI

#

right now

tidal ginkgo
hoary elbow
#

I feel like whenever AI gets like a very big update and they suddenly become like the biggest model I feel like when it updates it might not be like as better so I feel like in like 2029 the first AGI will come

tidal ginkgo
unborn lantern
#

In pro version?

hoary elbow
#

Well, I didnโ€™t really think about it before

#

I said or 2029

tidal ginkgo
#

i keep my ground

#

closer to 28 than 27

#

but just hypothesis lol

solid brook
#

Guys gpt 4 was 2 years ago. The diffrence between gpt 5 and gpt 4 is a whole lot. 2027 will be AGI

#

Or 2028

tidal ginkgo
#

oh wow

#

2 years ago

#

time flies

tidal ginkgo
hoary elbow
#

What video generators do they have in video Arena?

tidal ginkgo
#

and kling master

#

and wan

#

i think so

hoary elbow
#

All right

#

It also has sea dance too. I saw one of the pros.

solid brook
#

Google has crazy advancments in I

#

AI

#

Damn no image

#

Compare this to all other AI companies

solar hollow
#

yes google is king

patent aspen
# solid brook

They didn't even mention publishing 80% of the seminal AI papers of the last decade

#

ChatGPT couldn't exist without Google

rocky mauve
#

ai is surprisingly growing and evolving so fast, last year ai was garbage and couldnโ€™t do anything good

solid brook
#

I feel google has AGI internal

#

I mean classified

#

The might have it

#

Dude i am saying classified

#

How would you know

#

Hmmm

#

Overall every tech before releasing gets used by the goverment first

astral prawn
#

Rip btw I feel for u

#

Anyone else here using local system prompts with gpt5? How do you feel itโ€™s performing?

Kind of sucks we canโ€™t tweak temperature anymore

prime mulch
#

@echo aurora all image generators expect flux kontext and gpt are dead

autumn cloud
#

yo

#

what's best AI for coding atm? gpt 5?

prime mulch
#

Gemini 2.5 pro and gpt 5

solid brook
#

Use the api

#

Or lmarena

autumn cloud
#

thanks

lament bone
#

how do i delete all data in lmarena?

gentle elk
quiet pollen
solar hollow
random canyon
#

What is the difference between the regular gpt-5 and gpt-5-chat?

solar hollow
#

sota is nowhere near agi yet

#

and if you want to believe unlikely, unverifiable things, go ahead believe in "agi internally" or some god

solid brook
solar hollow
#

did you ever find any indication of it at all?

keen beacon
solid brook
#

If you look in history

lone relic
solid brook
#

Every big tech before being public was used secretly

warm fulcrum
lone relic
#

so is opus 4 itself

#

for coding

lone relic
keen beacon
warm fulcrum
warm fulcrum
lone relic
solid brook
#

Limited

keen beacon
lone relic
warm fulcrum
#

noooooo

keen beacon
solid brook
lone relic
#

yeah

solid brook
#

The api price

lone relic
#

nothing is free in this world

solid brook
keen beacon
lone relic
solid brook
lone relic
keen beacon
acoustic cliff
solid brook
lone relic
lone relic
lone relic
#

esp if agentic

#

for that warp gives limited free opus 4.1

#

wait are we allowed to share referral links here??

solid brook
#

Gpt 5 is way more efficent than opus 4.1

lone relic
#

tru

keen beacon
#

Did you try it

lone relic
#

250 creds

#

per month

#

50ish at least

#

i managed to hit 70-80 though

solid brook
#

Even if opus 4.1 is better it is very minimal not worth the price. Claude must change or it is cooked

keen beacon
#

Artificial analysis shows GPT-5 is bad at coding.

solid brook
#

Wait for gemini 3 that will blow everything

lone relic
lone relic
solid brook
lone relic
#

ye its hella good

solid brook
#

Just say think hard in promt

keen beacon
#

lol

solid brook
lone relic
#

are we allowed to share referral links here?:

solid brook
#

Idk

#

Ask the mods

lone relic
#

<@&1349916362595635286> am i allowed to share referral links here?

zealous panther
#

check rules

#

To ensure we have an inclusive and welcoming community we have some rules everyone should review and adhere to. The moderation team has final say over if violations of these rules have or have not occurred, along with the actions we take in response.

โœ… Act in accordance with Discordโ€™s Terms of Service and Community Guidelines.Violations of these terms and guidelines should be reported directly to Discord. Itโ€™s also recommended to be familiar with Discordโ€™s Safety Center for more information on how to remain safe while using Discord.

โœ… No NSFW, Harmful Content, or Spam. This includes, but isnโ€™t limited to, hate speech, harassment, racisms, sexism, homophobia, illegal content, inappropriate profile pictures, sharing of inappropriate content, and so on.

โœ… Treat others with Respect. Be kind, assume good intent from others, and keep disagreements respectful. Itโ€™s encouraged to share your disagreements, but only if itโ€™s done in a respectful and productive way.

โœ… Do not promote or advertise. This includes sharing of: social media, other Discord servers, or involved projects in an promoting manner.

โœ… Avoid political and religious content. As a space thatโ€™s inclusive to many different worldviews we ask to avoid topics related to politics and religion in order to maintain an inclusive space. It is okay to have discussion related to new policy or laws as long as itโ€™s related to AI.

โœ… Do not impersonate staff, moderators, or others. Efforts to impersonate LMArena staff, server moderators, or other community members is not allowed, even in a joking manner.

โœ… Message in English only. Please keep discussions in English.

Most importantly, remember why we are here, to advance the understanding and application of AI!

keen beacon
#

Wait this the photo from yesterday.

#

This is from now.

#

98% at AIME but somehow canโ€™t solve a equation

solid brook
#

Guys

#

It is not funny anymore

keen beacon
#

Imagine the thousands of simple equations GPT canโ€™t solve.

solid brook
#

Every single ai is garbage at math without reason

lone relic
keen beacon
solid brook
#

I mean without thinking

#

You must enable thinking

lone relic
#

today gpt 5 has been dumbed out, sam altman posted in twitter abt it

lone relic
solid brook
#

Lemme see x

lone relic
#

without thinking it was bad

#

with thinking tho unbeatable tbh

celest zenith
#

I fine-tuned OpenAIโ€™s OSS 20B reasoning model using the most popular medical reasoning dataset and published the results on Hugging Face. The model can break down complex medical cases step-by-step, identify possible diagnoses in clinical scenarios, and answer board-exam-style questions with logical reasoning.

During training, I used 4-bit optimization and enhanced the modelโ€™s performance in medical contexts while preserving its Chain-of-Thought reasoning capabilities. The training format includes โ€œquestion,โ€ โ€œComplex_CoT,โ€ and โ€œResponseโ€ fieldsโ€”allowing the model to first reason in detail, then provide the final answer.

You can check it out here:
๐Ÿ”— https://huggingface.co/dousery/medical-reasoning-gpt-oss-20b

Iโ€™d love to hear feedback from anyone working on or interested in medical AI.

keen beacon
#

They said they will double the rate limits for Plus

lone relic
#

mhm

keen beacon
#

And the model is from march

solid brook
#

Okay so what is the point of this argument?

keen beacon