#general | Arena | Page 97

sour spindle Aug 14, 2025, 6:45 PM

#

might be a google guy now

#

look what you've done openAI

eternal niche Aug 14, 2025, 6:45 PM

#

it depends on what

keen beacon Aug 14, 2025, 6:48 PM

#

Because they are pre-release models

tired herald Aug 14, 2025, 6:49 PM

#

🙂

stray aspen Aug 14, 2025, 6:50 PM

#

tired herald 🙂

how

#

are you a LMArena dev or seomthing

tired herald Aug 14, 2025, 6:50 PM

#

no

#

just having fun with extensions

#

it doesnt fully work right now tho

native flame Aug 14, 2025, 6:52 PM

#

Hii, is the legacy site down??

echo aurora Aug 14, 2025, 6:53 PM

#

native flame Hii, is the legacy site down??

I'm seeing this too. Will flag. blobthanks

keen beacon Aug 14, 2025, 6:53 PM

#

Well the code named models appear only randomly so uhm you can just hope that if you have a long convo, they will appear a lot.

tired herald Aug 14, 2025, 6:58 PM

#

I just got a really good idea for generating code with models

ocean vortex Aug 14, 2025, 6:59 PM

#

tired herald I just got a really good idea for generating code with models

Good for you

hollow imp Aug 14, 2025, 7:00 PM

#

Does the legacy site have any updates

#

Like any new models smth

tired herald Aug 14, 2025, 7:00 PM

#

i dont think so

hollow imp Aug 14, 2025, 7:00 PM

#

I really like the customisation parameters

tired herald Aug 14, 2025, 7:01 PM

#

good point

hollow imp Aug 14, 2025, 7:01 PM

#

Like when @echo aurora you guys changed into new lmarena I was asking gpt and perplexity about you guys

#

😂

tired herald Aug 14, 2025, 7:01 PM

#

503 Service Unavailable
No server is available to handle this request.

#

damn

#

legacy dead

#

#ai-creations message

echo aurora Aug 14, 2025, 7:04 PM

#

hollow imp Does the legacy site have any updates

The legacy site doesn't receive updates sorry to say.

tired herald Aug 14, 2025, 7:05 PM

#

oh

#

it

#

seems

#

file sending may break due to the limit of tokens per user message

#

Ah, who cares

#

okay, found another small bug.....

#

no

#

make it easier to generate code with the AIs, and easier to copy and view it

#

damn

#

thats ai for you

#

it guesses

#

intelligently guesses

#

no, the thing is that on LMArena, they dont have any system message at all

#

So they dont have a name per se

#

none of them is trustworthy

#

they profit by being friendly

stray aspen Aug 14, 2025, 7:28 PM

#

google deepmind guy

#

hes nice

keen beacon Aug 14, 2025, 7:30 PM

#

Demis 100 percent

stray aspen Aug 14, 2025, 7:30 PM

#

agreed

tired herald Aug 14, 2025, 7:32 PM

#

HAHAHAHAHHA

#

FINALLY

eternal niche Aug 14, 2025, 7:32 PM

#

where is V.P.

tired herald Aug 14, 2025, 7:33 PM

#

#ai-creations message

#

working system prompt

eternal niche Aug 14, 2025, 7:33 PM

#

🙁

tired herald Aug 14, 2025, 7:34 PM

#

Finally I can force them AIs to not roleplay as completely different models

#

#

yes my bruv

#

ChatGPT, trained by OpenAI

high ginkgo Aug 14, 2025, 7:41 PM

#

Stop using that dumbass chat format

trail creek Aug 14, 2025, 7:42 PM

#

why are you glasing gpt this much tf?!!

#

damn

neon idol Aug 14, 2025, 7:43 PM

#

Ehat is the prompt?

#

And then how can I have the picture

#

Uhm idk how to do

#

Lol

#

I eill give you the answer

tired herald Aug 14, 2025, 7:45 PM

#

well well well

#

gpt 5 mini high is so slow 😭

#

reasoning or no?

#

okay

neon idol Aug 14, 2025, 7:46 PM

#

I have from grok 4

stray aspen Aug 14, 2025, 7:47 PM

#

craig are you paying a claude subscription

tired herald Aug 14, 2025, 7:48 PM

#

GPT-5 Mini High

#

Claude 4.1 Opus Reasoning

#

why do mine feel worse

neon idol Aug 14, 2025, 7:49 PM

#

@tired herald bro if I dm you can you see with grok 4?

#

I have the svg

tired herald Aug 14, 2025, 7:50 PM

#

Huh

#

okay

#

dm me

golden ocean Aug 14, 2025, 7:52 PM

#

tired herald why do mine feel worse

omg fraud detected

tired herald Aug 14, 2025, 7:53 PM

#

tired herald why do mine feel worse

Actually mine look more detailed

tired herald Aug 14, 2025, 7:53 PM

#

golden ocean omg fraud detected

Nani

#

I wonder why system prompts dont work on the mistral medium model

neon idol Aug 14, 2025, 8:09 PM

#

Glm 4.5 is goodddddd

scenic salmon Aug 14, 2025, 8:09 PM

#

👋

tired herald Aug 14, 2025, 8:10 PM

#

Hi?

scenic salmon Aug 14, 2025, 8:10 PM

#

Just joined, so yeah, hi, lol

tired herald Aug 14, 2025, 8:10 PM

#

Go check out the other channels 👍

golden ocean Aug 14, 2025, 8:13 PM

#

tired herald Go check out the other channels 👍

yeah go check them out NOW @scenic salmon.

scenic crypt Aug 14, 2025, 8:14 PM

#

@echo aurora

tired herald Aug 14, 2025, 8:14 PM

#

golden ocean yeah go check them out NOW <@405963936987611146>.

Huh 😭

echo aurora Aug 14, 2025, 8:14 PM

#

scenic salmon Just joined, so yeah, hi, lol

ablobwave

echo aurora Aug 14, 2025, 8:14 PM

#

scenic crypt <@283397944160550928>

@scenic crypt

scenic crypt Aug 14, 2025, 8:14 PM

#

echo aurora <@644528429605060628>

Is the problem solved?

echo aurora Aug 14, 2025, 8:16 PM

#

scenic crypt Is the problem solved?

Which issue are you referring to?

misty vault Aug 14, 2025, 8:17 PM

#

echo aurora Which issue are you referring to?

gpt-4-0314 not being in the arena

scenic crypt Aug 14, 2025, 8:19 PM

#

echo aurora Which issue are you referring to?

There is a problem with the video creation converter counter. It says that 8 videos cannot be created per day. Veo 3 Audio no longer appears.

#

@echo aurora

echo aurora Aug 14, 2025, 8:20 PM

#

scenic crypt There is a problem with the video creation converter counter. It says that 8 vid...

Yes, the daily amount for Video Arena has been fixed.

#

Veo 3 Audio no longer appears
This was changed. There is now Veo 3 with and without audio.

eternal niche Aug 14, 2025, 8:24 PM

#

trail creek why are you glasing gpt this much tf?!!

golden ocean Aug 14, 2025, 8:24 PM

#

Russian state news

#

it does suck tho

#

wtf is this L by gpt 5?

#

obvious indication it is windows machine and provides linux commands

#

even gpt 3 dont fail that

pure falcon Aug 14, 2025, 8:26 PM

#

eternal niche

If you post this one more time, I’m reporting you

eternal niche Aug 14, 2025, 8:27 PM

#

pure falcon If you post this one more time, I’m reporting you

ok

#

he said not to spam

#

i am not spamming

pure falcon Aug 14, 2025, 8:28 PM

#

Posting the same thing 700 times is spamming

golden ocean Aug 14, 2025, 8:28 PM

#

eternal niche ok

ayo please dont attack lmarena with russian botnet

trail creek Aug 14, 2025, 8:28 PM

#

real

echo aurora Aug 14, 2025, 8:28 PM

#

We've taken action, let's move on please blobthanks

white hatch Aug 14, 2025, 8:29 PM

#

eternal niche ok

😭

trail creek Aug 14, 2025, 8:29 PM

#

https://tenor.com/view/the-warn-button-looks-very-useful-the-mute-button-looks-very-useful-ngl-ban-warn-mute-gif-14828387595483272930

Tenor

obsidian cargo Aug 14, 2025, 8:32 PM

#

theory: the stealth model 'toad' is deepseek v4

stray aspen Aug 14, 2025, 8:48 PM

#

oh damn

#

they banned the gemini ragebaiter

misty vault Aug 14, 2025, 8:57 PM

#

Huh wdym? paws is still here

#

paws day 1 without gemini 2.5 pro😔

inner gate Aug 14, 2025, 9:12 PM

#

I prefer grok 4 over gpt 5 and 2.5 I was wondering if anyone else doed

hollow imp Aug 14, 2025, 9:17 PM

#

Doed?

wicked root Aug 14, 2025, 9:25 PM

#

Hello, LMArena ranking aside, would you guys recommend GPT5 (subscription model) for coding complex (1500+ line) projects?

marsh stratus Aug 14, 2025, 9:36 PM

#

depends on what your alternatives are

#

Ideally you'd have multiple models ready to go

wicked root Aug 14, 2025, 9:47 PM

#

marsh stratus Ideally you'd have multiple models ready to go

I've been using gemini pro exclusively but I get rate limited every day

tidal ginkgo Aug 14, 2025, 9:49 PM

#

guys

#

i think i might never use LMArena again

marsh stratus Aug 14, 2025, 9:50 PM

#

That's honestly pretty boilerplate

tidal ginkgo Aug 14, 2025, 9:51 PM

#

is my personal info in risk for using this service?

leaden palm Aug 14, 2025, 9:51 PM

#

tidal ginkgo i think i might never use LMArena again

this looks like boilerplate google analytics

#

if you don't like analytics, use a browser with tracker blocking or get ublock origin

marsh stratus Aug 14, 2025, 9:53 PM

#

wicked root I've been using gemini pro exclusively but I get rate limited every day

as someone who uses both Gemini and GPT, it's nice to have them cover each other's weaknesses

tidal ginkgo Aug 14, 2025, 9:53 PM

#

yea i will use tracking blocking and vpn

#

inner gate Aug 14, 2025, 9:56 PM

#

hollow imp Doed?

DOES

wintry tinsel Aug 14, 2025, 10:00 PM

#

inner gate I prefer grok 4 over gpt 5 and 2.5 I was wondering if anyone else doed

What do you use it for that makes you prefer it

#

I prefer Claude opus for everything not, math/logic/advanced reasoning

wintry tinsel Aug 14, 2025, 10:01 PM

#

tidal ginkgo

Use brave it’s got automatic tracker blocking

tidal ginkgo Aug 14, 2025, 10:02 PM

#

leaden palm if you don't like analytics, use a browser with tracker blocking or get ublock o...

nah i already installed the thing this guy said

wintry tinsel Aug 14, 2025, 10:02 PM

#

That’s fine but brave is goated it’s got everything built it, 0 bloat ware, and many other useful features

inner gate Aug 14, 2025, 10:02 PM

#

I usually just use it as a companion I’m not that techy. I don’t prefer it because of intelligence I more meant the actual app lol. I do like how it responds tho

wintry tinsel Aug 14, 2025, 10:02 PM

#

Built in Tor as well

inner gate Aug 14, 2025, 10:02 PM

#

General things

wintry tinsel Aug 14, 2025, 10:03 PM

#

inner gate I usually just use it as a companion I’m not that techy. I don’t prefer it becau...

Bruh are you using Ani

inner gate Aug 14, 2025, 10:03 PM

#

ANI?

wintry tinsel Aug 14, 2025, 10:05 PM

#

Yeah

inner gate Aug 14, 2025, 10:05 PM

#

Idk what that is

#

As a companion I meant when I have a question it’s my go to

hollow imp Aug 14, 2025, 10:09 PM

#

inner gate DOES

Eww tobi how will you execute the PLAN like this

inner gate Aug 14, 2025, 10:09 PM

#

💔 I’ll ask it for ideas

wicked root Aug 14, 2025, 10:15 PM

#

@marsh stratusI think I'm in love with claude. Gemini's good but I keep getting rate limited, Grok is... special in a 'stay away from me by at least 5m at all times kind of way' but it froze when I asked some complex coding question, but Claude re wrote my entire code with just a single prompt.

viscid sun Aug 14, 2025, 10:25 PM

#

why chash the LMarena chat history ? how to I solve this problem ?

stark mango Aug 14, 2025, 10:33 PM

#

hello

#

every boddyy

keen beacon Aug 14, 2025, 10:35 PM

#

tidal ginkgo i think i might never use LMArena again

Every social media platform knows a bit more about you than LMArena

#

lol

#

this is silly.

tidal ginkgo Aug 14, 2025, 10:36 PM

#

well now i use tracking blocker

keen beacon Aug 14, 2025, 10:36 PM

#

tidal ginkgo well now i use tracking blocker

Everybody should use that when they install any browser

#

some dont even know ad blocking

tidal ginkgo Aug 14, 2025, 10:36 PM

#

yea

echo aurora Aug 14, 2025, 10:37 PM

#

viscid sun why chash the LMarena chat history ? how to I solve this problem ?

Is your chat history now missing?

echo aurora Aug 14, 2025, 10:46 PM

#

viscid sun why chash the LMarena chat history ? how to I solve this problem ?

Unfortunately, if your chat history is missing you won't be able to get it back. This is a problem our team is looking to address both with new features but overall site reliability as well. I am sorry you don't have access to that history anymore.

trail creek Aug 14, 2025, 10:55 PM

#

nano banana removed from the arena?

echo aurora Aug 14, 2025, 10:59 PM

#

trail creek nano banana removed from the arena?

I don't believe so, what makes you say that?

trail creek Aug 14, 2025, 11:03 PM

#

Its been very long since i got it in the battle.

#

don't lie to me yes it is

golden ocean Aug 14, 2025, 11:17 PM

#

echo aurora I don't believe so, what makes you say that?

bro thinks hes bing chat

keen beacon Aug 14, 2025, 11:30 PM

#

trail creek don't lie to me yes it is

It's not removed

#

I just got it in my testing

golden ocean Aug 14, 2025, 11:32 PM

#

trail creek Aug 14, 2025, 11:33 PM

#

Idk why you are lying but it is hundred percent removed

#

im not schizo.

golden ocean Aug 14, 2025, 11:35 PM

#

golden ocean

This is @echo aurora's reaction rn to

golden ocean Aug 14, 2025, 11:35 PM

#

trail creek Idk why you are lying but it is hundred percent removed

this message

misty vault Aug 14, 2025, 11:48 PM

#

Or

I'm sorry, but I don't believe that's accurate. I think there may be some misunderstanding here. I'm still learning, so my assessment could be mistaken, and I appreciate your understanding and patience.🙏

exotic stream Aug 14, 2025, 11:48 PM

#

tidal ginkgo yea i will use tracking blocking and vpn

Buddy your data isn't that important chill the hell down 🤣😭😭

#

Mf acting like the CIA is on his ass

golden ocean Aug 14, 2025, 11:49 PM

#

No bro it's not that, i mean like what if the data of me gooning with ai girlfriend gets leaked and it can linked to my identity irl?!?! then no one want to hire me anymore : (((

#

like claude serious asf but damn he know how to .... well i can't say more due to the rules here :(

#

🤣😭😭

stray aspen Aug 14, 2025, 11:52 PM

#

dont link your identity tot he AI

golden ocean Aug 14, 2025, 11:53 PM

#

but LMarena does that now for us

keen beacon Aug 14, 2025, 11:55 PM

#

trail creek im not schizo.

Maybe they forgot you

keen beacon Aug 14, 2025, 11:56 PM

#

golden ocean No bro it's not that, i mean like what if the data of me gooning with ai girlfri...

The stuff is anonymized, even in direct chats, unless you provide your real name and other revealing info

golden ocean Aug 14, 2025, 11:56 PM

#

other websites might have my name

#

keen beacon Aug 14, 2025, 11:58 PM

#

if it concerns you, you can always use a VPN.

golden ocean Aug 14, 2025, 11:58 PM

#

exotic stream Buddy your data isn't that important chill the hell down 🤣😭😭

that was indeed my plan, but then this mean @exotic stream interrupted us 😡 im shaking and crying rn

exotic stream Aug 14, 2025, 11:59 PM

#

golden ocean that was indeed my plan, but then this mean <@411705957849104384> interrupted us...

If privacy concerns you that much just use a local model bro

golden ocean Aug 15, 2025, 12:09 AM

#

not the mc video deleted too 😭

misty vault Aug 15, 2025, 12:10 AM

#

golden ocean not the mc video deleted too 😭

severe stream Aug 15, 2025, 12:10 AM

#

Hey does anyone here what the limits for each model are or is it just completely free

keen beacon Aug 15, 2025, 12:11 AM

#

severe stream Hey does anyone here what the limits for each model are or is it just completel...

There are some rate limits

#

especially if you put too many inputs in one minute aka rpm

misty vault Aug 15, 2025, 12:12 AM

#

severe stream Aug 15, 2025, 12:12 AM

#

keen beacon especially if you put too many inputs in one minute aka rpm

do you know how much it is

keen beacon Aug 15, 2025, 12:12 AM

#

severe stream do you know how much it is

No, but the whole service is free.

#

just dont rapid fire the chat

#

lol

severe stream Aug 15, 2025, 12:13 AM

#

keen beacon lol

Damn why is'nt this website popular then wth

keen beacon Aug 15, 2025, 12:13 AM

#

severe stream Damn why is'nt this website popular then wth

Don't know, lol

echo aurora Aug 15, 2025, 12:35 AM

#

severe stream Damn why is'nt this website popular then wth

Tell your friends!

inner gate Aug 15, 2025, 12:37 AM

#

echo aurora Tell your friends!

Have u tried advertising lol

golden ocean Aug 15, 2025, 12:40 AM

#

echo aurora Tell your friends!

am i your friend? 🥺

echo aurora Aug 15, 2025, 12:43 AM

#

golden ocean am i your friend? 🥺

You didn't wish me happy birthday

golden ocean Aug 15, 2025, 12:45 AM

#

stray aspen Aug 15, 2025, 12:46 AM

#

is that you pineapple

misty vault Aug 15, 2025, 12:46 AM

#

yes

echo aurora Aug 15, 2025, 12:51 AM

#

I take it back

pure falcon Aug 15, 2025, 12:52 AM

#

echo aurora You didn't wish me happy birthday

What! How did i miss this?? Happy bday pineapple!!

echo aurora Aug 15, 2025, 12:53 AM

#

pure falcon What! How did i miss this?? Happy bday pineapple!!

Lol not my birthday, thanks tho

pure falcon Aug 15, 2025, 12:54 AM

#

Well, you deserve blessings for a wonderful day no matter what

queen thorn Aug 15, 2025, 1:17 AM

#

is there an answer to why most of the models get all fucky and respond with that?
"Something went wrong while generating the response. Please try again." or is there maybe a temporary fix i could use to use my current chats so that the model doesn't lose the context of whatever it's working on?

inner gate Aug 15, 2025, 1:49 AM

#

queen thorn is there an answer to why most of the models get all fucky and respond with that...

Hmmm

blazing bison Aug 15, 2025, 1:56 AM

#

queen thorn is there an answer to why most of the models get all fucky and respond with that...

1 - This message generally means that you are rate limited
2 - About more context, prob no, bcs of costs

vital lake Aug 15, 2025, 1:58 AM

#

@echo aurora
How do you guys get all these expensive APIs lol? Sorry for ping

blazing bison Aug 15, 2025, 1:59 AM

#

vital lake <@283397944160550928> How do you guys get all these expensive APIs lol? Sorry f...

Why don't use the discord search function?

#

Ppl ask it 999 times everyday

echo aurora Aug 15, 2025, 2:00 AM

#

queen thorn is there an answer to why most of the models get all fucky and respond with that...

Yeah sorry to say sometimes models will error out for different reasons. We are working on making these error messages more clear about what went wrong

vital lake Aug 15, 2025, 2:01 AM

#

blazing bison Why don't use the discord search function?

What do I search lol?

golden ocean Aug 15, 2025, 2:11 AM

#

the video arena ruined search discord function

golden ocean Aug 15, 2025, 2:41 AM

#

scenic salmon Aug 15, 2025, 2:42 AM

#

vital lake <@283397944160550928> How do you guys get all these expensive APIs lol? Sorry f...

I assume they're granted API credits, if not near unlimited API access by all of the model companies since lmarena provides valuable feedback on what users like and don't like 🤷‍♂️

#

saves them from having to do the research themselves

echo aurora Aug 15, 2025, 2:44 AM

#

vital lake <@283397944160550928> How do you guys get all these expensive APIs lol? Sorry f...

Sry for delay, doorbell. We are paying for the usage. This blog post should be helpful: https://news.lmarena.ai/new-lmarena/

scenic salmon Aug 15, 2025, 2:46 AM

#

ha, you got VCs to back you, I should pick your brain if you were involved in the raising at all @echo aurora

vital lake Aug 15, 2025, 2:50 AM

#

LOL

#

Sounds to me like they want to build their own models

#

Using the data

scenic salmon Aug 15, 2025, 2:55 AM

#

vital lake Sounds to me like they want to build their own models

it's not the worst idea, they have more data on the type of responses preferred than any of the individual model companies

vital lake Aug 15, 2025, 2:56 AM

#

scenic salmon it's not the worst idea, they have more data on the type of responses preferred ...

And your getting high quality data from things like GPT-5 and Opus

#

Thats worth milions

scenic salmon Aug 15, 2025, 2:57 AM

#

yeah, no synthetic training data needed, all real human questions

scenic salmon Aug 15, 2025, 2:58 AM

#

vital lake Thats worth milions

$100M to be exact when

vernal oxide Aug 15, 2025, 3:00 AM

#

how to do i use bananaLLM

#

the uhh

#

nano banana image generation

vital lake Aug 15, 2025, 3:01 AM

#

Btw how did Solar get so good?

#

I remember they were hot garbage

#

They fr locked in

echo aurora Aug 15, 2025, 3:05 AM

#

vernal oxide how to do i use bananaLLM

Models that are using a codename can only be used in the Battle mode (random models being sampled side-by-side). Meaning you aren't able to select it specifically

vital lake Aug 15, 2025, 3:06 AM

#

echo aurora Models that are using a codename can only be used in the Battle mode (random mod...

Do you guys charge a lower AI lab if they wanted to test their AI as a codename?

rough brook Aug 15, 2025, 4:24 AM

#

hi guys what the best prompts for still frame, still camera no movement?

true tusk Aug 15, 2025, 4:43 AM

#

HI

unborn lily Aug 15, 2025, 4:58 AM

#

Hi

opaque mirage Aug 15, 2025, 5:14 AM

#

hi

hollow field Aug 15, 2025, 5:39 AM

#

Hi

keen beacon Aug 15, 2025, 6:14 AM

#

wth

wintry tinsel Aug 15, 2025, 7:24 AM

#

Countdown to Gemini 3💀

rare python Aug 15, 2025, 7:32 AM

#

echo aurora Models that are using a codename can only be used in the Battle mode (random mod...

Is the proposal of direct chat for codename models disapproved or still on your todo list?

tribal nymph Aug 15, 2025, 7:46 AM

#

Many models shown on the site aren’t the real ones or are just named versions that don’t exist (like GPT-5, Grok 4, etc.).
They are actually working on GPT-4 & Grok 2.

Kindly fix it ASAP

keen beacon Aug 15, 2025, 7:55 AM

#

wintry tinsel Countdown to Gemini 3💀

Where

#

Guys I have a question

#

Has anyone of you experienced a model becoming unable to answer some questions correctly after it was upgraded?

#

Like GPT-o4 answered some correctly, then o3 wasn't able to get it at all

#

I found that Deepseek and Qwen answer my questions correctly based on incorrect data (!) it was trained on what the heck

unborn lantern Aug 15, 2025, 8:07 AM

#

@echo aurora Brother, the chat history issue is causing a lot of trouble. Please take necessary steps to fix it.

random wolf Aug 15, 2025, 8:31 AM

#

I hope they put soon the "cancel button" it's so frustrating

#

any solution to stop the "generating"?

solid brook Aug 15, 2025, 9:04 AM

#

wintry tinsel Countdown to Gemini 3💀

What happened?

keen beacon Aug 15, 2025, 9:30 AM

#

tribal nymph Many models shown on the site aren’t the real ones or are just named versions th...

Evidence for that?

queen thorn Aug 15, 2025, 9:47 AM

#

blazing bison 1 - This message generally means that you are rate limited 2 - About more contex...

I understand, but it's weird since like it's not even related to what AI you're choosing on the website, once this error appears even changing it to a different model wouldn't fix that whatsoever, the chat itself goes to hell

#

until you're starting a new chat that is, and everything works well again, only if there was a way to maybe export the chat after it dies and maybe import it into another one so that you'll be able to feed the model context about what you're trying to build with it, instead of starting from zero and having to feed it information about what you were using it for in the previous chats

#

because it’s currently in a very unstable state, the website and the concept are cool, but that doesn’t mean much if your chats die after just three minutes of use

verbal nimbus Aug 15, 2025, 9:52 AM

#

#

https://x.com/btibor91/status/1955241562486763962

verbal nimbus Aug 15, 2025, 10:21 AM

#

For fairness, I think other reasoning levels should be included, not a configuration that even Pro users won't get to use (if X posts are accurate).

keen beacon Aug 15, 2025, 10:24 AM

#

i believe toad is a qwen series model:

it uses emojis in a similar style
"the vibes"

#

neither do deepseek models use emojis for responses nor does gemini.

The only other model i've observed is chatgpt-4o-latest

#

obtuse heart Aug 15, 2025, 10:36 AM

#

tribal nymph Many models shown on the site aren’t the real ones or are just named versions th...

0 evidence

viscid timber Aug 15, 2025, 10:41 AM

#

what happened to video arena 4

ocean vortex Aug 15, 2025, 10:47 AM

#

verbal nimbus For fairness, I think other reasoning levels should be included, not a configura...

There are some decent reasons for them to do this though. Reasoning effort changes are more impactful in cost for chatgpt than API, since it's often doing tool calls and that mostly happens while it's reasoning. They are fine going all out on API even for diminishing returns but for chatgpt this makes no sense if they get big increase in cost with little to no benefit. And then also, "128" technically still falls within high reasoning effort range for gpt5, since medium on API is 64.

blazing bison Aug 15, 2025, 10:49 AM

#

queen thorn I understand, but it's weird since like it's not even related to what AI you're ...

The rate limits are for all models, not for specific model. And you know that every prompt you send is gonna be public right? Lmarena is for you to test models not for real work

blazing bison Aug 15, 2025, 10:51 AM

#

ocean vortex There are some decent reasons for them to do this though. Reasoning effort chang...

Roon said that this juicy thing is not the reasoning effort

neon idol Aug 15, 2025, 10:51 AM

#

keen beacon i believe toad is a qwen series model: - it uses emojis in a similar style - "t...

Toad? What the hell is?

blazing bison Aug 15, 2025, 10:51 AM

#

And that gpt thinking is using high reasoning

ocean vortex Aug 15, 2025, 10:52 AM

#

tribal nymph Many models shown on the site aren’t the real ones or are just named versions th...

Kindly educate yourself please. Just because a model responds it is based on some older variant, that is completely normal behavior. Models are not trained about themselves much usually, since this is not very useful for IRL applications of solving actual tasks or problems. Fine-tuning on redundant things like that will degrade performance everywhere else

blazing bison Aug 15, 2025, 10:53 AM

#

Claude is trained to say which model they are btw

#

But others, no

ocean vortex Aug 15, 2025, 10:54 AM

#

Yeah that's why I said "usually"

#

Most of the time models will get this info from a system message

blazing bison Aug 15, 2025, 10:54 AM

#

Ye just claude do that

#

Just claude train to say which model they are i mean

#

Openai, grok, they use the system prompt generally

ocean vortex Aug 15, 2025, 10:55 AM

#

blazing bison Roon said that this juicy thing is not the reasoning effort

link? 🧐

#

I'm pretty sure it is. There's a direct link between reasoning effort and that number for sure

blazing bison Aug 15, 2025, 10:56 AM

#

I didn't saved it but it was in Tibor post

viscid lake Aug 15, 2025, 10:57 AM

#

#

Chat gpt us still better at Pokémon

fleet lintel Aug 15, 2025, 10:59 AM

#

Which company is Nano-banana? OpenAI ?

keen beacon Aug 15, 2025, 10:59 AM

#

viscid lake

Maybe you could ask it for a different style?

keen beacon Aug 15, 2025, 11:00 AM

#

fleet lintel Which company is Nano-banana? OpenAI ?

Google.

fleet lintel Aug 15, 2025, 11:00 AM

#

Does nano implies small model??

keen beacon Aug 15, 2025, 11:01 AM

#

fleet lintel Does nano implies small model??

Well we can only assume that is the case. No one knows for sure

viscid lake Aug 15, 2025, 11:03 AM

#

keen beacon Maybe you could ask it for a different style?

Maybe

unborn lantern Aug 15, 2025, 11:04 AM

#

Is there any website like lmarena?

#

Anyone know?

viscid lake Aug 15, 2025, 11:05 AM

#

imagen-4.0-ultra-generate-preview-06-06_A_steampunk_mechanic.png

#

First,banana
Second chat gpt
Third is imegen 4 ultra

ocean vortex Aug 15, 2025, 11:06 AM

#

blazing bison I didn't saved it but it was in Tibor post

Tried to look for it and only found this. Which seems to confirm it being the reasoning effort rather than deny lol

unborn lantern Aug 15, 2025, 11:07 AM

#

Is there any website like lmarena?

ocean vortex Aug 15, 2025, 11:11 AM

#

Also I don't think juice number directly translates gpt5 vs o3. For o3-medium the number is 2X compared to gpt5-medium, but it only just generates more:

#

not by much I mean

ocean vortex Aug 15, 2025, 11:12 AM

#

unborn lantern Is there any website like lmarena?

https://www.designarena.ai

blazing bison Aug 15, 2025, 11:21 AM

#

ocean vortex Tried to look for it and only found this. Which seems to confirm it being the re...

Idk them, there is another post that he said that think harder on router can reach high thinking too

#

I think that it is this one

hollow imp Aug 15, 2025, 11:26 AM

#

😭

rocky mauve Aug 15, 2025, 11:36 AM

#

anyone know an ai that’s capible of making high quality gfx’s? Is it even possible yet

hollow imp Aug 15, 2025, 11:51 AM

#

rocky mauve anyone know an ai that’s capible of making high quality gfx’s? Is it even possib...

No it isn't at all

#

I've tried that a lot

#

Please don't waste your time like me for that

ocean vortex Aug 15, 2025, 11:53 AM

#

blazing bison Idk them, there is another post that he said that think harder on router can rea...

that's different context though. Their router was the weakest link at the start and their model card technically describes it as being more significant than it was on launch. If they haven't improved that yet I'm sure they will. It honestly made no sense that it couldn't do more than low reasoning effort, especially now with it being renamed to "Auto" and describing it as "decides how long to think"

#

It should be able to do chat/low/medium (for Plus sub), the way they are presenting it

blazing bison Aug 15, 2025, 11:54 AM

#

it sucks anyway

#

like o3

#

always better on api

ocean vortex Aug 15, 2025, 11:54 AM

#

Then for Pro high as well (128)

blazing bison Aug 15, 2025, 11:55 AM

#

their chatgpt prompt sucks and the infra or whatever they do on the backend sucks

#

and gpt-5-pro feel exctly like o3-pro, i'm sure they just changed the name

ocean vortex Aug 15, 2025, 11:56 AM

#

blazing bison it sucks anyway

I mean at the start when that was named "GPT5" it did actually perform better than gpt5 with no reasoning and that was fairly obvious. Now it's still a good option for people who are confused by all the models and just want to ask a question

blazing bison Aug 15, 2025, 11:57 AM

#

now that gemini copy pasted their frontend

#

when they finish copy pasting the features too

#

it's gonna be better than chatgpt

ocean vortex Aug 15, 2025, 11:58 AM

#

blazing bison and gpt-5-pro feel exctly like o3-pro, i'm sure they just changed the name

Nah no way. It is based on gpt5 now rather than o3. Different base model. It's like comparing gpt4o from last year vs gpt4.1

blazing bison Aug 15, 2025, 11:59 AM

#

ocean vortex Nah no way. It is based on gpt5 now rather than o3. Different base model. It's l...

i retryed my old prompts btw

#

it did the same frontend

#

the same answer

ocean vortex Aug 15, 2025, 11:59 AM

#

Try something different but still with visuals, you should see the difference tbh

#

it now ranks nr1 on webdev

blazing bison Aug 15, 2025, 11:59 AM

#

yeah i tryed already to see the difference

ocean vortex Aug 15, 2025, 11:59 AM

#

while before it was bad

blazing bison Aug 15, 2025, 11:59 AM

#

there is none

#

the model that they deployed for testers on api was called o3-alpha too

#

xDD

ocean vortex Aug 15, 2025, 12:00 PM

#

I haven't tried frontend, but for svg the difference is obvious

blazing bison Aug 15, 2025, 12:00 PM

#

after that they changed it to nectarine

ocean vortex Aug 15, 2025, 12:01 PM

#

blazing bison xDD

yeah that's just random joke lol

#

they did it with gpt2-chatbot as well

blazing bison Aug 15, 2025, 12:01 PM

#

idk about that

#

the model that won the math olympiada has o3-alpha on the name too

ocean vortex Aug 15, 2025, 12:02 PM

#

They knew everyone would see that name, they did it deliberately...

blazing bison Aug 15, 2025, 12:02 PM

#

it was leaked on the git

ocean vortex Aug 15, 2025, 12:02 PM

#

Not to make it too obvious that it's gpt5 base I think

#

But still promote their stuff

ocean vortex Aug 15, 2025, 12:04 PM

#

blazing bison it was leaked on the git

it was on lmarena and webdev

#

with that name

#

It was immediately obvious to everyone that this name alias is public

#

Besides, @blazing bison , those gains wouldn't be possible otherwise without changing the base model tbh. gpt5-medium generates less than o3-medium but performs considerably better

winged burrow Aug 15, 2025, 12:06 PM

#

/video prompt

blazing bison Aug 15, 2025, 12:07 PM

#

ocean vortex Besides, <@224577039724838912> , those gains wouldn't be possible otherwise with...

i believe that gpt-5 is another base model

#

just the pro version that is strange

#

as a heavy user of the pro version

#

i noticed the differece from o1 to o3-pro clearly

#

but from o3-pro for gpt-5-pro

#

almost none

ocean vortex Aug 15, 2025, 12:07 PM

#

blazing bison i believe that gpt-5 is another base model

well then pro is different as well. Pro is literally just some code for the most part

blazing bison Aug 15, 2025, 12:07 PM

#

and i'm talking about frontend in general

ocean vortex Aug 15, 2025, 12:08 PM

#

And it is using gpt5

#

not o3

blazing bison Aug 15, 2025, 12:08 PM

#

they took 3 months to update from o1-pro to o3-pro

#

but released gpt-5-pro on the same day

#

that was already sus asf

#

and

#

there is no gpt-5 pro on api why

#

it's like they changed o3-pro low to o3-pro medium now

ocean vortex Aug 15, 2025, 12:10 PM

#

They were slow with o3-pro as well. I think it took them time to test and approve. You can probably get some unexpected results by running each prompt 10+ times and to make gains it is not super easy out the box. Small changes even to just the prompting can make a fairly big difference

ornate agate Aug 15, 2025, 12:10 PM

#

lol. Isn’t pro just gpt high now?

ocean vortex Aug 15, 2025, 12:12 PM

#

blazing bison there is no gpt-5 pro on api why

This time it may actually be capacity for real... They did big changes with new releases and are still finetuning the caps for subs. Pro API would nuke their gpus lol

blazing bison Aug 15, 2025, 12:12 PM

#

ocean vortex They were slow with o3-pro as well. I think it took them time to test and approv...

like i said, i can't proof, but i'm not the only one pro user that feels that, everyone that heavy use pro models from openai feels the same

#

i think the real gpt-5 pro is gonna appear on chatgpt only after it's avaliable on api

ornate agate Aug 15, 2025, 12:13 PM

#

I thought it was leaked that pro is just gpt high

ocean vortex Aug 15, 2025, 12:13 PM

#

blazing bison i think the real gpt-5 pro is gonna appear on chatgpt only after it's avaliable ...

conspiracy theory much. This is extremely unlikely

blazing bison Aug 15, 2025, 12:13 PM

#

ornate agate I thought it was leaked that pro is just gpt high

gpt high didn't take 10-20 minutes to answer

blazing bison Aug 15, 2025, 12:14 PM

#

ocean vortex conspiracy theory much. This is *extremely* unlikely

well i have it, i'm using it, and the AI communites on x says the same

ocean vortex Aug 15, 2025, 12:14 PM

#

ornate agate lol. Isn’t pro just gpt high now?

No, pro is parallel test-time compute. As should have been immediatelly obvious but you can read their model card if you don't want to take my word for it. 👀

blazing bison Aug 15, 2025, 12:15 PM

#

and there is a router on pro models on chatgpt too

#

it answer in seconds if your question is simple

ornate agate Aug 15, 2025, 12:16 PM

#

Hmm. What is it then?

keen beacon Aug 15, 2025, 12:16 PM

#

verbal nimbus

What? So 200 juice is reserved for API specifically, and even Pro users don't get this treatment? Is this a joke?

ocean vortex Aug 15, 2025, 12:16 PM

#

In ChatGPT, we also provide access
to gpt-5-thinking using a setting that makes use of parallel test time compute; we refer to this as
gpt-5-thinking-pro.

ornate agate Aug 15, 2025, 12:17 PM

#

verbal nimbus

Yeah this is what I was thinking of. It shows pro as juice 128

blazing bison Aug 15, 2025, 12:17 PM

#

keen beacon What? So 200 juice is reserved for API specifically, and even Pro users don't ge...

yes

ocean vortex Aug 15, 2025, 12:17 PM

#

Notice that even when writing this they knew it's gonna be available at first only on chatgpt

#

not API

keen beacon Aug 15, 2025, 12:19 PM

#

Why did they cap the juice at 200?

#

Why not 512 or 1024?

#

Why this exact number?

blazing bison Aug 15, 2025, 12:20 PM

#

no different results

keen beacon Aug 15, 2025, 12:20 PM

#

We can only wonder...

ocean vortex Aug 15, 2025, 12:20 PM

#

@blazing bison I think it "feels" the same probably because they used much of the same prompting to compute ~10 attempts into a single answer. So when it follows their instructions, answers are to look similar as previous pro, by design

ornate agate Aug 15, 2025, 12:20 PM

#

keen beacon Why did they cap the juice at 200?

I have a theory on that. If you think about it a bit it’s obvious…

keen beacon Aug 15, 2025, 12:20 PM

#

ornate agate I have a theory on that. If you think about it a bit it’s obvious…

Context length?

ornate agate Aug 15, 2025, 12:20 PM

#

Ye

blazing bison Aug 15, 2025, 12:20 PM

#

no

#

it's bcs above 200 there is no difference or the model actually lose performance

keen beacon Aug 15, 2025, 12:21 PM

#

So you want to say that at some point the context of reasoning becomes too long and the model starts to slop?

ocean vortex Aug 15, 2025, 12:21 PM

#

blazing bison no

No what. What evidence do you have to prove this absurd theory that they are using o3-pro in disguise marketed as gpt5-pro?

ornate agate Aug 15, 2025, 12:21 PM

#

Yes. So 200 is the max cot length it was trained on. Above that is slop.

blazing bison Aug 15, 2025, 12:22 PM

#

ocean vortex No what. What evidence do you have to prove this absurd theory that they are usi...

my gut feeling and my friends that actually have pro

#

idk what you're talking about on plus plan

keen beacon Aug 15, 2025, 12:22 PM

#

ornate agate Yes. So 200 is the max cot length it was trained on. Above that is slop.

But why just 200

ocean vortex Aug 15, 2025, 12:22 PM

#

blazing bison my gut feeling and my friends that actually have pro

So basically nothing. It's difficult to test models properly even doing the right things, let alone 'gut feeling'...

blazing bison Aug 15, 2025, 12:23 PM

#

ocean vortex So basically nothing. It's difficult to test models properly even doing the righ...

if you run same prompts and have the same results

#

it's enough

ocean vortex Aug 15, 2025, 12:23 PM

#

No it's not

#

o3-pro was not gpt3.5

blazing bison Aug 15, 2025, 12:23 PM

#

it is

ocean vortex Aug 15, 2025, 12:23 PM

#

?

blazing bison Aug 15, 2025, 12:23 PM

#

now i have better things to do

#

bye

warm fulcrum Aug 15, 2025, 12:23 PM

#

arguing about pointless things goes hard

#

ur being ragebaited and u cant see it

#

🔥

keen beacon Aug 15, 2025, 12:24 PM

#

By the way did you know guys that these stealth models drop their names if you ask them nicely

vapid zinc Aug 15, 2025, 12:24 PM

#

Is there any specified schedule for when the leaderboards are updated?

blazing bison Aug 15, 2025, 12:24 PM

#

🤓

keen beacon Aug 15, 2025, 12:24 PM

#

keen beacon By the way did you know guys that these stealth models drop their names if you a...

I don't know what's the system prompt of LMArena that restricts them to share their names

#

But they will at least very generously share the information about their creators

ocean vortex Aug 15, 2025, 12:25 PM

#

@blazing bison Also wait a sec, you are SERIOUSLY arguing that gpt5-pro is just a scam and that they are using o3-pro instead, based on your gut feeling and no evidence to distinguish between 2 very good models? You are ACTUALLY for real? 🤣 🤣

warm fulcrum Aug 15, 2025, 12:25 PM

#

i wasnt siding with u

ocean vortex Aug 15, 2025, 12:25 PM

#

LOL

blazing bison Aug 15, 2025, 12:25 PM

#

ocean vortex <@224577039724838912> Also wait a sec, you are SERIOUSLY arguing that gpt5-pro ...

they can call anything gpt-5-pro on their end

#

it's not a scam

ocean vortex Aug 15, 2025, 12:25 PM

#

no they can't

#

it would be an obvious scam

nimble trail Aug 15, 2025, 12:26 PM

#

blazing bison it's not a scam

It's technically is tho💀

blazing bison Aug 15, 2025, 12:26 PM

#

it's not, on their end gpt-5 can be called o3-alpha and it's ok

ocean vortex Aug 15, 2025, 12:26 PM

#

What gpt5-pro means is CLEARLY defined in their model card

#

no 2 ways about it

warm fulcrum Aug 15, 2025, 12:26 PM

#

blazing bison they can call anything gpt-5-pro on their end

ur saying that openai will break federal laws

#

good point

flint sandal Aug 15, 2025, 12:27 PM

#

Gpt5pro means that its gemini knockoff

warm fulcrum Aug 15, 2025, 12:27 PM

#

👍

novel crater Aug 15, 2025, 12:27 PM

#

dang you can use Claude 4.1 for free now too lmarena is awesome!

keen beacon Aug 15, 2025, 12:27 PM

#

Imagine if OpenAI really scammed everyone with this move 💀

nimble trail Aug 15, 2025, 12:27 PM

#

blazing bison it's not, on their end gpt-5 can be called o3-alpha and it's ok

What do you mean it's okay 😭😭😭

blazing bison Aug 15, 2025, 12:27 PM

#

warm fulcrum ur saying that openai will break federal laws

?? they can call anything gpt-5-pro bro

keen beacon Aug 15, 2025, 12:27 PM

#

blazing bison ?? they can call anything gpt-5-pro bro

You sound delusional and misinformed

blazing bison Aug 15, 2025, 12:27 PM

#

i'm not saying that they are delivering a worst model

warm fulcrum Aug 15, 2025, 12:27 PM

#

blazing bison ?? they can call anything gpt-5-pro bro

could you be more vague

keen beacon Aug 15, 2025, 12:28 PM

#

keen beacon Imagine if OpenAI really scammed everyone with this move 💀

To be honest it's not much surprising after the news about Sama being a pathological narcissistic liar

warm fulcrum Aug 15, 2025, 12:28 PM

#

ur saying openai is doing illegal stuff

#

which they cnat

blazing bison Aug 15, 2025, 12:28 PM

#

warm fulcrum could you be more vague

if they change reasoning effort of old o3-pro to high and call it gpt-5-pro it's ok

keen beacon Aug 15, 2025, 12:28 PM

#

Lol.

blazing bison Aug 15, 2025, 12:28 PM

#

they aren't breaking any laws

#

doing that

warm fulcrum Aug 15, 2025, 12:28 PM

#

no

keen beacon Aug 15, 2025, 12:28 PM

#

warm fulcrum which they cnat

Yeah, nobody can do illegal stuff

warm fulcrum Aug 15, 2025, 12:28 PM

#

because they market it as a better tool

keen beacon Aug 15, 2025, 12:28 PM

#

They will goto jail

#

Its illegal

blazing bison Aug 15, 2025, 12:28 PM

#

warm fulcrum because they market it as a better tool

if the reasoning effort is high, the result is better

#

then it's a better tool

warm fulcrum Aug 15, 2025, 12:29 PM

#

source: trust me bro

#

i guess gpt-5 is actually just gpt 2 fine tuned

#

we got played..

blazing bison Aug 15, 2025, 12:29 PM

#

i'm not saying that it's actually it

#

no

#

i tryed it

#

with the same prompts

#

and got the exactly same results

keen beacon Aug 15, 2025, 12:29 PM

#

Imagine if everything they did to gpt-5 was just scaling the reasoning up to 200 juice, once Gemini scales up to this number OpenAI will be cooked 💀

warm fulcrum Aug 15, 2025, 12:29 PM

#

blazing bison and got the exactly same results

which platform did you try this in?

warm fulcrum Aug 15, 2025, 12:30 PM

#

keen beacon Imagine if everything they did to gpt-5 was just scaling the reasoning up to 200...

zenith was 64 juice

#

was better than anything atm

wet sparrow Aug 15, 2025, 12:31 PM

#

I agree with him. I don't think the people arguing here even have Pro

#

There are even posts about it on the OpenAI forum

#

OpenAI said they are working on it

keen beacon Aug 15, 2025, 12:32 PM

#

So here's the following dilemma

keen beacon Aug 15, 2025, 12:32 PM

#

neon idol Toad? What the hell is?

the hidden model in battle mode

#

You get GPT Pro with two stupid models running at once that kind of "increases" the odds of successful completion of a task

#

Or one smart GPT High

#

Two stupid models or one smart?

blazing bison Aug 15, 2025, 12:33 PM

#

we don't really know what it is

#

or what it do

keen beacon Aug 15, 2025, 12:33 PM

#

qwen:

warm fulcrum Aug 15, 2025, 12:34 PM

#

"basically guys i think gpt-5 pro is the same as o3 pro bc my gut says so"

wet sparrow Aug 15, 2025, 12:34 PM

#

Theo even made a video about it. It's not directly about Pro Models, but it is about all GPT-5 models. There is a problem with them

warm fulcrum Aug 15, 2025, 12:34 PM

#

holy ragebait mother of 3

wet sparrow Aug 15, 2025, 12:34 PM

#

OpenAI has already addressed this

keen beacon Aug 15, 2025, 12:34 PM

#

keen beacon the hidden model in battle mode

it's trivial to jailbreak LMArena to make the models namedrop

keen beacon Aug 15, 2025, 12:35 PM

#

keen beacon it's trivial to jailbreak LMArena to make the models namedrop

? pls tell in detail

keen beacon Aug 15, 2025, 12:35 PM

#

keen beacon ? pls tell in detail

Up to you, hacker

#

But I can say that, from my experience, if it is a model that is worth attention, everyone will be talking about it

#

Be it Zenith, Toad or whatever

#

agreed

keen beacon Aug 15, 2025, 12:35 PM

#

keen beacon Be it Zenith, Toad or whatever

zenith was probably gpt 5

#

lmarena should probably reveal the hidden models after their public release

#

Toad is nothing special btw

warm fulcrum Aug 15, 2025, 12:36 PM

#

keen beacon zenith was probably gpt 5

zenith was the nectarine model

#

im only assuming because of what theo says

#

it matches with zenith's output speed and accuracy

keen beacon Aug 15, 2025, 12:36 PM

#

ahh okay

warm fulcrum Aug 15, 2025, 12:36 PM

#

not confirmed tho

keen beacon Aug 15, 2025, 12:36 PM

#

keen beacon Toad is nothing special btw

yeah its meh

keen beacon Aug 15, 2025, 12:37 PM

#

keen beacon yeah its meh

I meant that it is not by a major provider

keen beacon Aug 15, 2025, 12:37 PM

#

keen beacon I meant that it is not by a major provider

nah its just not SOTA so IDC

wet sparrow Aug 15, 2025, 12:37 PM

#

warm fulcrum "basically guys i think gpt-5 pro is the same as o3 pro bc my gut says so"

That's why I think you don't have the Pro model. If you had paid for it and didn't notice any improvements, you would be upset too

keen beacon Aug 15, 2025, 12:37 PM

#

If it is by a major provider everyone'd already losing their crap here and there

warm fulcrum Aug 15, 2025, 12:37 PM

#

wet sparrow That's why I think you don't have the Pro model. If you had paid for it and didn...

💀

#

what are u yapping about

keen beacon Aug 15, 2025, 12:37 PM

#

keen beacon If it is by a major provider everyone'd already losing their crap here and there

but i just saw it side by side with qwen 235 b n thought hey they loook similar

warm fulcrum Aug 15, 2025, 12:37 PM

#

i have the pro model

wet sparrow Aug 15, 2025, 12:37 PM

#

Show proof

keen beacon Aug 15, 2025, 12:38 PM

#

Better follow openrouter for stealth models

wet sparrow Aug 15, 2025, 12:38 PM

#

😆

warm fulcrum Aug 15, 2025, 12:38 PM

#

wowie

#

i have gpt-5 pro it seems

#

i must be a millionaire

blazing bison Aug 15, 2025, 12:39 PM

#

There is no point in discuss this btw

hollow imp Aug 15, 2025, 12:39 PM

#

@keen beacon

#

I'm watching this

#

https://youtu.be/fHRS_NOs24w?feature=shared

YouTube

fern

The Indian Genius Nobody Understands

Check out Displate and use code FERN for 23% off one Displate, 27% off two to three, or 33% off four or more*. Or click this link to get the discount automatically: https://displate.com/@fern (ad)

This is the tragic story of one of the greatest minds in history!

If you want to learn more about the crazy math that Ramanujan and Hardy came up w...

▶ Play video

warm fulcrum Aug 15, 2025, 12:39 PM

#

india

hollow imp Aug 15, 2025, 12:39 PM

#

This documentary even better than the movie

warm fulcrum Aug 15, 2025, 12:39 PM

#

🔥

keen beacon Aug 15, 2025, 12:43 PM

#

There are so many stealth models on LMArena and they are total garbage

hollow imp Aug 15, 2025, 12:43 PM

#

@keen beacon

keen beacon Aug 15, 2025, 12:43 PM

#

What?

hollow imp Aug 15, 2025, 12:43 PM

#

What is a stealth model

#

What are they hiding from

keen beacon Aug 15, 2025, 12:43 PM

#

Toad

hollow imp Aug 15, 2025, 12:43 PM

#

Toad?

keen beacon Aug 15, 2025, 12:43 PM

#

I don't know

hollow imp Aug 15, 2025, 12:43 PM

#

https://tenor.com/view/gamatatsu-naruto-toad-jiraiya-gif-15336191263686460582

Tenor

keen beacon Aug 15, 2025, 12:43 PM

#

Yes, and also these Phantoms IIRC

#

They are nothing special anyway

hollow imp Aug 15, 2025, 12:44 PM

#

Iirc?

keen beacon Aug 15, 2025, 12:44 PM

#

If I Remember Correctly

hollow imp Aug 15, 2025, 12:44 PM

#

What is IIRC

keen beacon Aug 15, 2025, 12:44 PM

#

keen beacon They are nothing special anyway

Most hype comes from stealth models from openrouter

hollow imp Aug 15, 2025, 12:44 PM

#

Ohh

#

Bruh I thought

keen beacon Aug 15, 2025, 12:45 PM

#

Here are the big things

hollow imp Aug 15, 2025, 12:45 PM

#

IIRc is a

#

Ai term

#

😭

hollow imp Aug 15, 2025, 12:45 PM

#

keen beacon Most hype comes from stealth models from openrouter

Can you give an example of a stealth model

keen beacon Aug 15, 2025, 12:47 PM

#

hollow imp Can you give an example of a stealth model

Toad, Zenith, Nano-banana

hollow imp Aug 15, 2025, 12:47 PM

#

keen beacon Toad, Zenith, Nano-banana

I know nano banana

keen beacon Aug 15, 2025, 12:48 PM

#

Me too

#

We went to the same school together

keen beacon Aug 15, 2025, 12:51 PM

#

keen beacon Toad, Zenith, Nano-banana

They all come from big startups and enterprises btw. There's no way someone can train a LLM in their garage, so think about something as big as Netflix

#

But it doesn't matter

#

None of them are SOTA

#

Claude, Gemini or Gemini no doubt

#

Gemini has genuinely impressed me today though

ocean vortex Aug 15, 2025, 12:57 PM

#

@echo aurora make this into a channel. We have community-creations, community-polls would fit and be useful. 🙂

keen beacon Aug 15, 2025, 12:58 PM

#

keen beacon Gemini has genuinely impressed me today though

My favorite anime (it is NOT Madoka Magica) is often criticized for things that are present in 99% of other anime, but failed mostly due to marketing reasons

#

There are very few models that have identified this problem

#

Qwen and Deepseek often point it out, as does GPT-5

#

But Gemini was the only one that compared it to other shows and figured out that these criticisms do not actually matter

queen thorn Aug 15, 2025, 1:43 PM

#

blazing bison The rate limits are for all models, not for specific model. And you know that ev...

I'm fully aware that the prompts are public yes, so it's essentially a platform that lets you test different models that stop working about 2 minutes after using them?

somber rover Aug 15, 2025, 1:44 PM

#

how to make video

tired herald Aug 15, 2025, 1:44 PM

#

somber rover how to make video

#1397655624103493813 message

solid brook Aug 15, 2025, 1:48 PM

#

queen thorn I'm fully aware that the prompts are public yes, so it's essentially a platform ...

No i think only the claude opus model has limits

#

Because it is so expensive

queen thorn Aug 15, 2025, 1:49 PM

#

solid brook No i think only the claude opus model has limits

nah man, it's gemini 2.5 pro, chatgpt 5, claude 4 sonnet, opus as well

#

3.7 claude

solid brook Aug 15, 2025, 1:50 PM

#

queen thorn nah man, it's gemini 2.5 pro, chatgpt 5, claude 4 sonnet, opus as well

You sure? After how many prompts did you encounter limits on lmerena?

queen thorn Aug 15, 2025, 1:50 PM

#

solid brook You sure? After how many prompts did you encounter limits on lmerena?

one

#

look

solid brook Aug 15, 2025, 1:50 PM

#

Bruh lol

queen thorn Aug 15, 2025, 1:50 PM

#

one specific prompt

solid brook Aug 15, 2025, 1:50 PM

#

I used gpt 5 high for hours

queen thorn Aug 15, 2025, 1:51 PM

#

i'll give ya a prompt you'll see

solid brook Aug 15, 2025, 1:51 PM

#

Okay send it

queen thorn Aug 15, 2025, 1:51 PM

#

solid brook Aug 15, 2025, 1:52 PM

#

queen thorn

That is not limit

terse shuttle Aug 15, 2025, 1:52 PM

#

queen thorn

try to reload page

#

or clear cookie

queen thorn Aug 15, 2025, 1:52 PM

#

queen thorn Aug 15, 2025, 1:52 PM

#

terse shuttle or clear cookie

i've done both

#

of these things

solid brook Aug 15, 2025, 1:52 PM

#

Send prompt

terse shuttle Aug 15, 2025, 1:52 PM

#

queen thorn i've done both

reopen browser

#

idk

queen thorn Aug 15, 2025, 1:52 PM

#

i'll send you the prompt

#

hold up

#

other things work fine

#

#

whoops

#

wait

#

i'mma sent it in dms

vernal oxide Aug 15, 2025, 1:56 PM

#

guys

#

how do i know if i got the banana llm

keen beacon Aug 15, 2025, 1:58 PM

#

hollow imp This documentary even better than the movie

great!

verbal nimbus Aug 15, 2025, 1:59 PM

#

Gemini is falling behind a bit when it comes to agentic abilities (hopefully the next model proves this wrong).

tired herald Aug 15, 2025, 2:01 PM

#

the next Gemini will probably topple everything

verbal nimbus Aug 15, 2025, 2:02 PM

#

tired herald the next Gemini will probably topple everything

nightride-on might be the next one

#

Not sure whether it's Flash though

queen thorn Aug 15, 2025, 2:02 PM

#

can anyone in this chat try and use this prompt and tell me if it bricks a chat for any of you on any model?

    <form method="POST" action="{{ route('register') }}">
        @csrf

        <!-- Name -->
        <div>
            <x-input-label for="name" :value="__('Name')" />
            <x-text-input id="name" class="block mt-1 w-full" type="text" name="name" :value="old('name')" required autofocus autocomplete="name" />
            <x-input-error :messages="$errors->get('name')" class="mt-2" />
        </div>

        <!-- Username -->
        <div class="mt-4">
            <x-input-label for="username" :value="__('Username')" />
            <x-text-input id="username" class="block mt-1 w-full" type="text" name="username" :value="old('username')" required autocomplete="username" />
            <x-input-error :messages="$errors->get('username')" class="mt-2" />
            <p class="text-sm text-gray-600 mt-1">This will be your unique profile URL: {{ url('/') }}/username</p>
        </div>

        <!-- Email Address --> ```

tired herald Aug 15, 2025, 2:02 PM

#

verbal nimbus `nightride-on` might be the next one

No idea, havent played at all with stealth models

verbal nimbus Aug 15, 2025, 2:02 PM

#

tired herald No idea, havent played at all with stealth models

I only encountered it once, it was pretty good.

tired herald Aug 15, 2025, 2:03 PM

#

queen thorn can anyone in this chat try and use this prompt and tell me if it bricks a chat ...

it does

verbal nimbus Aug 15, 2025, 2:03 PM

#

It would be nice if there was a mode that tested tool use/ReAct loops more.

tired herald Aug 15, 2025, 2:03 PM

#

queen thorn can anyone in this chat try and use this prompt and tell me if it bricks a chat ...

its probably escaping the data and making the packet invalid

queen thorn Aug 15, 2025, 2:03 PM

#

yeah it's weird as hell

tired herald Aug 15, 2025, 2:03 PM

#

not really, thats how code just works sometimes

queen thorn Aug 15, 2025, 2:04 PM

#

it replied with that, and after it did, it got bricked completely

#

tired herald Aug 15, 2025, 2:04 PM

#

because it accidentally created a situation that made the LMArena website start going crazy

queen thorn Aug 15, 2025, 2:05 PM

#

tired herald because it accidentally created a situation that made the LMArena website start ...

i'm trying to figure out if there's a way to fix a bricked chat

tired herald Aug 15, 2025, 2:06 PM

#

i dont think so

queen thorn Aug 15, 2025, 2:06 PM

#

nothing seems to be working lmao

tired herald Aug 15, 2025, 2:06 PM

#

prob is impossible

#

but idk

#

ask pineapple

stray aspen Aug 15, 2025, 2:07 PM

#

queen thorn

reload and try again

#

thats what pineapple told me

queen thorn Aug 15, 2025, 2:08 PM

#

stray aspen reload and try again

#

cleaned my cache, hard reload, cleaned the browser history

#

disabled all of my extensions

#

restarted my browser

#

try this prompt and tell me if it's working for ya

queen thorn Aug 15, 2025, 2:08 PM

#

queen thorn can anyone in this chat try and use this prompt and tell me if it bricks a chat ...

.

tired herald Aug 15, 2025, 2:09 PM

#

queen thorn try this prompt and tell me if it's working for ya

Its not the browser

#

its LMArena

queen thorn Aug 15, 2025, 2:09 PM

#

has to be an issue with lmarena

#

yeah

#

yeah

#

true

tired herald Aug 15, 2025, 2:09 PM

#

i know this issue

stray aspen Aug 15, 2025, 2:09 PM

#

that code is the prompt?

queen thorn Aug 15, 2025, 2:09 PM

#

yeah copy paste it

#

into lmarena

tired herald Aug 15, 2025, 2:09 PM

#

My Extension had an issue with it

stray aspen Aug 15, 2025, 2:09 PM

#

which model

tired herald Aug 15, 2025, 2:09 PM

#

All models

queen thorn Aug 15, 2025, 2:09 PM

#

any of them

stray aspen Aug 15, 2025, 2:09 PM

#

ok

queen thorn Aug 15, 2025, 2:10 PM

#

could it be that it's happening because it's inside of that text code block?

#

that?

stray aspen Aug 15, 2025, 2:10 PM

#

that prompt is broken

#

tired herald Aug 15, 2025, 2:10 PM

#

queen thorn that?

no

stray aspen Aug 15, 2025, 2:11 PM

#

theres something wrong with your prompt

tired herald Aug 15, 2025, 2:11 PM

#

no

white hatch Aug 15, 2025, 2:11 PM

#

nothing breaks for me

tired herald Aug 15, 2025, 2:11 PM

#

the prompt is not the issue

queen thorn Aug 15, 2025, 2:11 PM

#

white hatch nothing breaks for me

HUH?

#

on a new chat tho?

tired herald Aug 15, 2025, 2:11 PM

#

white hatch nothing breaks for me

then you have luck? or you added something to the prompt

stray aspen Aug 15, 2025, 2:11 PM

#

but everything works fine until i put that prompt

tired herald Aug 15, 2025, 2:11 PM

#

its how LMArena handles messages

white hatch Aug 15, 2025, 2:11 PM

#

tired herald then you have luck? or you added something to the prompt

Probably luck

white hatch Aug 15, 2025, 2:12 PM

#

queen thorn on a new chat tho?

yes

tired herald Aug 15, 2025, 2:12 PM

#

white hatch Probably luck

Can you show a pic

queen thorn Aug 15, 2025, 2:12 PM

#

yeah idk man that's insanely weird

#

idk how??

white hatch Aug 15, 2025, 2:12 PM

#

#

tired herald Aug 15, 2025, 2:13 PM

#

how very very interesting

stray aspen Aug 15, 2025, 2:13 PM

#

thats GREAT

queen thorn Aug 15, 2025, 2:13 PM

#

white hatch

WHAT?!

tired herald Aug 15, 2025, 2:13 PM

#

I think I also know why its working for you

queen thorn Aug 15, 2025, 2:13 PM

#

gpt 5 high as well???

white hatch Aug 15, 2025, 2:13 PM

#

yes

queen thorn Aug 15, 2025, 2:14 PM

#

tired herald Aug 15, 2025, 2:14 PM

#

who wouldve guessed

#

its the 1:1 the same issue I had with files for my extension

queen thorn Aug 15, 2025, 2:15 PM

#

tired herald who wouldve guessed

bruh

tired herald Aug 15, 2025, 2:15 PM

#

queen thorn

say hi first for example

#

and then give

queen thorn Aug 15, 2025, 2:15 PM

#

let's try that

tired herald Aug 15, 2025, 2:15 PM

#

<x-guest-layout>
    <form method="POST" action="{{ route('register') }}">
        @csrf

        <!-- Name -->
        <div>
            <x-input-label for="name" :value="__('Name')" />
            <x-text-input id="name" class="block mt-1 w-full" type="text" name="name" :value="old('name')" required autofocus autocomplete="name" />
            <x-input-error :messages="$errors->get('name')" class="mt-2" />
        </div>

        <!-- Username -->
        <div class="mt-4">
            <x-input-label for="username" :value="__('Username')" />
            <x-text-input id="username" class="block mt-1 w-full" type="text" name="username" :value="old('username')" required autocomplete="username" />
            <x-input-error :messages="$errors->get('username')" class="mt-2" />
            <p class="text-sm text-gray-600 mt-1">This will be your unique profile URL: {{ url('/') }}/username</p>
        </div>

        <!-- Email Address -->

white hatch Aug 15, 2025, 2:16 PM

#

```php

tired herald Aug 15, 2025, 2:16 PM

#

yes

#

thats the code block format

queen thorn Aug 15, 2025, 2:16 PM

#

tired herald Aug 15, 2025, 2:16 PM

#

You need to copy my message through the discord copying

#

try that

#

<x-guest-layout>
    <form method="POST" action="{{ route('register') }}">
        @csrf

        <!-- Name -->
        <div>
            <x-input-label for="name" :value="__('Name')" />
            <x-text-input id="name" class="block mt-1 w-full" type="text" name="name" :value="old('name')" required autofocus autocomplete="name" />
            <x-input-error :messages="$errors->get('name')" class="mt-2" />
        </div>

        <!-- Username -->
        <div class="mt-4">
            <x-input-label for="username" :value="__('Username')" />
            <x-text-input id="username" class="block mt-1 w-full" type="text" name="username" :value="old('username')" required autocomplete="username" />
            <x-input-error :messages="$errors->get('username')" class="mt-2" />
            <p class="text-sm text-gray-600 mt-1">This will be your unique profile URL: {{ url('/') }}/username</p>
        </div>

        <!-- Email Address -->

queen thorn Aug 15, 2025, 2:17 PM

#

tired herald Aug 15, 2025, 2:17 PM

#

still not copying with discord bruv

#

``

#

`

echo aurora Aug 15, 2025, 2:17 PM

#

ocean vortex <@283397944160550928> make this into a channel. We have community-creations, com...

Maybe. I'm like to try and keep channel lists light, and I don't think polls get used enough to justify their own channel atm.

tired herald Aug 15, 2025, 2:17 PM

#

you need three of `

#

at the beginning

#

and end

#

works everytime for me

white hatch Aug 15, 2025, 2:18 PM

#

📎 Presets_details.txt

tired herald Aug 15, 2025, 2:18 PM

#

yes

#

that

#

exactly that

#

copy it and paste

#

queen thorn Aug 15, 2025, 2:19 PM

#

#

interesting

tired herald Aug 15, 2025, 2:19 PM

#

its message handling stuff

queen thorn Aug 15, 2025, 2:20 PM

#

that's so freaking weird

tired herald Aug 15, 2025, 2:21 PM

#

🙂

stray aspen Aug 15, 2025, 2:21 PM

#

thats GREAT

verbal nimbus Aug 15, 2025, 2:21 PM

#

Interesting that GPT-5-High actually loses to Gemini 2.5 Pro the majority of the time

tired herald Aug 15, 2025, 2:22 PM

#

well

#

it happens

queen thorn Aug 15, 2025, 2:23 PM

#

welp, so lmarena

#

gotta fix some stuff

tired herald Aug 15, 2025, 2:23 PM

#

well, LMArena is changing its shape slowly

queen thorn Aug 15, 2025, 2:24 PM

#

new update?!

#

haha

tired herald Aug 15, 2025, 2:24 PM

#

no

queen thorn Aug 15, 2025, 2:24 PM

#

jkjk

tired herald Aug 15, 2025, 2:24 PM

#

🙂

#

Im making another small extension to change the look of LMArena into more OpenChat style yk

stray aspen Aug 15, 2025, 2:25 PM

#

can you put the model selector next to the iamge button

white hatch Aug 15, 2025, 2:25 PM

#

tired herald Im making another small extension to change the look of LMArena into more OpenCh...

yooooooo good luck

tired herald Aug 15, 2025, 2:26 PM

#

stray aspen can you put the model selector next to the iamge button

Thats what im working on

#

right this second

#

very fun stuff

verbal nimbus Aug 15, 2025, 2:27 PM

#

verbal nimbus Interesting that GPT-5-High actually loses to Gemini 2.5 Pro the majority of the...

Actually it's kinda odd, because Gemini has a higher win rate against more models, yet is ranked lower. Green = higher win rate. Red = lower win rate. Yellow = tie/no data. Statistical paradox?

queen thorn Aug 15, 2025, 2:29 PM

#

tired herald well, LMArena is changing its shape slowly

make it so that the prompts are gonna work by adding that

hello

😂

tired herald Aug 15, 2025, 2:30 PM

#

😭

queen thorn Aug 15, 2025, 2:30 PM

#

just kidding it's still not gonna do anything

tired herald Aug 15, 2025, 2:31 PM

#

the + has an issue with me

#

oh no

queen thorn Aug 15, 2025, 2:31 PM

#

tired herald the + has an issue with me

make ai give it a quick upper cut back into place

verbal nimbus Aug 15, 2025, 2:31 PM

#

verbal nimbus Actually it's kinda odd, because Gemini has a higher win rate against more model...

@echo aurora How does this work?

tired herald Aug 15, 2025, 2:31 PM

#

queen thorn make ai give it a quick upper cut back into place

ai would fail miserably 😭

solid brook Aug 15, 2025, 2:32 PM

#

queen thorn make it so that the prompts are gonna work by adding that ```php hello ``` 😂

Lol the filters turn off when adding ```

tired herald Aug 15, 2025, 2:32 PM

#

oh what

#

no way

queen thorn Aug 15, 2025, 2:32 PM

#

huh?

#

wait lemme try and use that on that website

tired herald Aug 15, 2025, 2:33 PM

#

test it out

solid brook Aug 15, 2025, 2:33 PM

#

Uhm but still the model itself has safety filters

keen beacon Aug 15, 2025, 2:33 PM

#

solid brook Uhm but still the model itself has safety filters

Yea and it won't let me generate girls kissing

#

😠

tired herald Aug 15, 2025, 2:33 PM

#

solid brook Uhm but still the model itself has safety filters

Not when you give it a really nice system prompt

solid brook Aug 15, 2025, 2:34 PM

#

Yeah....

tired herald Aug 15, 2025, 2:34 PM

#

queen thorn Aug 15, 2025, 2:34 PM

#

?

tired herald Aug 15, 2025, 2:34 PM

#

filters....

queen thorn Aug 15, 2025, 2:34 PM

#

yeah hold on

#

it's still responding it's funny

#

to me

solid brook Aug 15, 2025, 2:36 PM

#

tired herald

How to do that?

verbal nimbus Aug 15, 2025, 2:36 PM

#

tired herald the + has an issue with me

Did it fall out of the flex box?

tired herald Aug 15, 2025, 2:36 PM

#

Prob because I turned it into a circle

tired herald Aug 15, 2025, 2:36 PM

#

solid brook How to do that?

Not yet available for normal people

solid brook Aug 15, 2025, 2:37 PM

#

tired herald Not yet available for normal people

You mean you're part of lmarena team?

#

Or connected to them

random wolf Aug 15, 2025, 2:38 PM

#

how do I stop "generating"? it's been an hour, lol

verbal nimbus Aug 15, 2025, 2:38 PM

#

tired herald Prob because I turned it into a circle

That shouldn't change it

verbal nimbus Aug 15, 2025, 2:38 PM

#

random wolf how do I stop "generating"? it's been an hour, lol

Did you refresh the page?

tired herald Aug 15, 2025, 2:38 PM

#

verbal nimbus That shouldn't change it

no idea

random wolf Aug 15, 2025, 2:38 PM

#

@verbal nimbus yes brother, I did.

#

is there any solution, to fix this problem? it's so frustrating.

queen thorn Aug 15, 2025, 2:39 PM

#

do you think it's possible to save a chat that has that prompt that bricks it?

tired herald Aug 15, 2025, 2:39 PM

#

verbal nimbus That shouldn't change it

just lemme cook well

verbal nimbus Aug 15, 2025, 2:39 PM

#

random wolf <@858135822389346344> yes brother, I did.

Do new chats work?

queen thorn Aug 15, 2025, 2:39 PM

#

because something weird just happened

tired herald Aug 15, 2025, 2:39 PM

#

ups

random wolf Aug 15, 2025, 2:39 PM

#

verbal nimbus Do new chats work?

it's working. but the chat AI I used, knows all the information and important things I have

queen thorn Aug 15, 2025, 2:40 PM

#

something very weird just happened @tired herald ya gotta help me understand that

#

that's insanely weird

#

so i've had a chat that was bricked right?

#

couldn't send any messages whatsoever

#

not even a hello

tired herald Aug 15, 2025, 2:40 PM

#

yes

#

?

verbal nimbus Aug 15, 2025, 2:41 PM

#

random wolf it's working. but the chat AI I used, knows all the information and important th...

Ah yeah, that's annoying. Very odd that it hasn't timed-out.

queen thorn Aug 15, 2025, 2:41 PM

#

i've responded with ``` and then wrote the whole code thing

#

and it legit worked somehow???

verbal nimbus Aug 15, 2025, 2:41 PM

#

random wolf it's working. but the chat AI I used, knows all the information and important th...

If you want the info, you can copy the whole chat out by manually highlighting everything and ask another AI to clean the raw paste and summarize it.

queen thorn Aug 15, 2025, 2:41 PM

#

random wolf Aug 15, 2025, 2:41 PM

#

verbal nimbus Ah yeah, that's annoying. Very odd that it hasn't timed-out.

I hope they will fix it

queen thorn Aug 15, 2025, 2:41 PM

#

but now

#

tired herald Aug 15, 2025, 2:42 PM

#

queen thorn

ok, I really dont know why its happening, thats really weird

#

@echo aurora

#

we need your help

random wolf Aug 15, 2025, 2:42 PM

#

verbal nimbus If you want the info, you can copy the whole chat out by manually highlighting e...

I will try bro. thank you! that's a big help

queen thorn Aug 15, 2025, 2:43 PM

#

tired herald <@283397944160550928>

legit yeah, WE NEED ANSWERS!!!

#

haha

tired herald Aug 15, 2025, 2:44 PM

#

dem

solid brook Aug 15, 2025, 2:49 PM

#

queen thorn legit yeah, WE NEED ANSWERS!!!

Report in #1343291835845578853

echo aurora Aug 15, 2025, 2:51 PM

#

tired herald <@283397944160550928>

Hey sorry I'm in the middle of something else and haven't been following this chat closely. Can you submit a bug report and TLDR everything so I can take a look/let the team know?

queen thorn Aug 15, 2025, 2:52 PM

#

echo aurora Hey sorry I'm in the middle of something else and haven't been following this ch...

i'll do that right now

white hatch Aug 15, 2025, 2:53 PM

#

btw, try clearing cookie files

tired herald Aug 15, 2025, 2:54 PM

#

#ai-creations message

#

do the buttons look good enough?

grizzled turtle Aug 15, 2025, 2:58 PM

#

if someone know how to stop generation? my generation with chatbot bugged, its just loads endlessly

#

for example, so that the chatbot gives an error or if possible in some other way

echo aurora Aug 15, 2025, 2:59 PM

#

poll_question_text

What version do you use the most?

victor_answer_votes

13

total_votes

24

victor_answer_id

3

victor_answer_text

Direct

terse shuttle Aug 15, 2025, 3:00 PM

#

grizzled turtle if someone know how to stop generation? my generation with chatbot bugged, its j...

nohow

echo aurora Aug 15, 2025, 3:00 PM

#

grizzled turtle if someone know how to stop generation? my generation with chatbot bugged, its j...

Unfortunately, your only two options at the moment are to refresh the page (sometimes this works) or start a new chat. This is a problem we're aware of where models will continue to generate without ending.

grizzled turtle Aug 15, 2025, 3:01 PM

#

echo aurora Unfortunately, your only two options at the moment are to refresh the page (some...

unfortunately, restarting the site doesnt help, Ill have to make a new chat(

#

thanks for the answer

glossy epoch Aug 15, 2025, 3:43 PM

#

Hello to all

red sluice Aug 15, 2025, 3:46 PM

#

Why is style control not called "length control" instead?
Style control is confusing!

inner gate Aug 15, 2025, 3:55 PM

#

Howdy partners

echo aurora Aug 15, 2025, 3:56 PM

#

grizzled turtle unfortunately, restarting the site doesnt help, Ill have to make a new chat(

Really sorry to hear that it didn't work. The current state what happens when chats get stuck isn't great, and we plan to make changes to help with this.

raven helm Aug 15, 2025, 3:57 PM

#

@echo aurora , I’m sorry to interrupt, I’d just like to ask a quick question; why is it that there is no direct chat on WebDev Arena?

inner gate Aug 15, 2025, 4:03 PM

#

raven helm <@283397944160550928> , I’m sorry to interrupt, I’d just like to ask a quick que...

What’s webdev used for?

patent bane Aug 15, 2025, 4:05 PM

#

is gpt-5-high on lmarena supports tools?

terse shuttle Aug 15, 2025, 4:23 PM

#

patent bane is gpt-5-high on lmarena supports tools?

nah

#

only chat or gpt-5-search in search mode

terse shuttle Aug 15, 2025, 4:24 PM

#

inner gate What’s webdev used for?

react apps

#

but WebDeb doesn't hace direct or side by side mode 🙁

hardy lion Aug 15, 2025, 4:33 PM

#

red sluice Why is style control not called "length control" instead? Style control is confu...

because it uses others things besides length, like lists, bold and markdown formatting

ancient reef Aug 15, 2025, 4:37 PM

#

Any opinions on "folsom-0805-1"
I tried it on a logic prompt and it actually wasn't terrible.

obsidian cargo Aug 15, 2025, 4:37 PM

#

I wish we could compare toad to zenith

whole wagon Aug 15, 2025, 4:42 PM

#

“We have to make these horrible trade-offs right now,” he said. “We have better models, and we just can’t offer them because we don’t have the capacity. We have other kinds of new products and services we’d love to offer.” Sam in recent interview

queen thorn Aug 15, 2025, 4:42 PM

#

BRO HOW IS NO ONE COMPLAINING ABOUT THIS?!

#

IT'S LITERALLY ALWAYS HAPPENING LEGIT ONE SINGLE PROMPT CAUSED IT

#

WHAT THE HELL

whole wagon Aug 15, 2025, 4:42 PM

#

That's been a thing forever, seems no resolve

queen thorn Aug 15, 2025, 4:43 PM

#

bruhhhhhhhhhhhhhhhhhhhhhhhhh

stray aspen Aug 15, 2025, 4:43 PM

#

queen thorn BRO HOW IS NO ONE COMPLAINING ABOUT THIS?!

wdym lmao

#

its probably the most common complaint

queen thorn Aug 15, 2025, 4:43 PM

#

and still no solution? not even a temporary fix?

stray aspen Aug 15, 2025, 4:43 PM

#

no

queen thorn Aug 15, 2025, 4:43 PM

#

bruv

stray aspen Aug 15, 2025, 4:43 PM

#

lmarena team is working on it

tired herald Aug 15, 2025, 4:44 PM

#

uff

gentle plinth Aug 15, 2025, 4:46 PM

#

whole wagon ```“We have to make these horrible trade-offs right now,” he said. “We have bett...

even gpt-oss-120b is better then the minimal effort gpt-5 imo

queen thorn Aug 15, 2025, 4:46 PM

#

gentle plinth even gpt-oss-120b is better then the minimal effort gpt-5 imo

Something went wrong while generating the response. Please try again.

ocean vortex Aug 15, 2025, 4:48 PM

#

gentle plinth even gpt-oss-120b is better then the minimal effort gpt-5 imo

gpt5-minimal < gpt4.1

hollow imp Aug 15, 2025, 4:49 PM

#

https://cdn.discordapp.com/attachments/1261350936769724548/1405882476232048752/image.png?ex=68a07196&is=689f2016&hm=34908bdcd463852a3199f976aab3472fe8319abd9a0d1a815cae7a2dc0f73df0&

ocean vortex Aug 15, 2025, 4:49 PM

#

but to make this more confusing, this is also likely true:
gpt5-chat > gpt4.1

gentle plinth Aug 15, 2025, 4:49 PM

#

ocean vortex but to make this more confusing, this is also likely true: gpt5-chat > gpt4.1

it depends to which model you get routed

gentle plinth Aug 15, 2025, 4:49 PM

#

ocean vortex gpt5-minimal < gpt4.1

anything is better then gpt5-minimal xD

ocean vortex Aug 15, 2025, 4:50 PM

#

gentle plinth it depends to which model you get routed

gpt5-minimal is probably only in the API

gentle plinth Aug 15, 2025, 4:50 PM

#

it uses that if it "thinks" that the question doesnt require much thinking

#

which is like in 99% of the cases i feel

brittle furnace Aug 15, 2025, 4:51 PM

#

gentle plinth it uses that if it "thinks" that the question doesnt require much thinking

Something went wrong while generating the response. Please try again.

ocean vortex Aug 15, 2025, 4:52 PM

#

gentle plinth it uses that if it "thinks" that the question doesnt require much thinking

it uses gpt5-chat. That's direct replacement model for chatgpt-4o-latest. It's a different model from gpt5-minimal, this one has no reasoning at all so it most definitely performs better as reasoning wasn't the focus at all

gentle plinth Aug 15, 2025, 4:53 PM

#

terse shuttle but WebDeb doesn't hace direct or side by side mode 🙁

just use the prompt in direct chat, and paste it into react project:

📎 webdev_prompt.txt

tired herald Aug 15, 2025, 4:55 PM

#

very cool

#

ill take this and use it as my system prompt

ocean vortex Aug 15, 2025, 4:55 PM

#

instead of reasoning being something it was trained for that was then later taken away... It was fine-tuned from the get go to perform as good as possible without relying on reasoning

hollow imp Aug 15, 2025, 4:56 PM

#

@gentle plinth pls pdf support noah 🙏

gentle plinth Aug 15, 2025, 4:56 PM

#

wdym

#

i just asked it to generate a site with the prompt

#

and copied it

#

its what they are using in webdev arena apparently

hollow imp Aug 15, 2025, 4:58 PM

#

gentle plinth its what they are using in webdev arena apparently

What is webdev arena

gentle plinth Aug 15, 2025, 4:58 PM

#

but i honestly cannot guarantee that it will work out of the box

#

bc they seem to have some specific project setup

#

but maybe ai can help you with that

gentle plinth Aug 15, 2025, 4:59 PM

#

hollow imp What is webdev arena

https://web.lmarena.ai/

hollow imp Aug 15, 2025, 4:59 PM

#

What does it do

gentle plinth Aug 15, 2025, 4:59 PM

#

it generates a website

#

with two random models

hollow imp Aug 15, 2025, 4:59 PM

#

gentle plinth wdym

You are the ceo of lmaren right?

gentle plinth Aug 15, 2025, 4:59 PM

#

and you have to say which one is better

gentle plinth Aug 15, 2025, 4:59 PM

#

hollow imp You are the ceo of lmaren right?

no

tired herald Aug 15, 2025, 4:59 PM

#

whats even going on here

gentle plinth Aug 15, 2025, 5:00 PM

#

the purple name is just a role i picked xD

hollow imp Aug 15, 2025, 5:00 PM

#

gentle plinth it generates a website

Isn't creating a website very hard and you need wix for that?

indigo hazel Aug 15, 2025, 5:00 PM

#

https://x.com/Alibaba_Qwen/status/1956399490698735950

Qwen (@Alibaba_Qwen)

🚀 Qwen Chat Desktop for Windows is here!
💻 All the power of Qwen Chat — now with MCP support for smarter, faster agents.
⚡ Run up MCP Servers, supercharge your productivity, and stay in control.
📥 Download now → https://t.co/uYQIIGQAJo

gentle plinth Aug 15, 2025, 5:00 PM

#

hollow imp Isn't creating a website very hard and you need wix for that?

ai can do some small websites

hollow imp Aug 15, 2025, 5:00 PM

#

gentle plinth ai can do some small websites

I'm curious

gentle plinth Aug 15, 2025, 5:00 PM

#

for larger projects of course it can get to its limits

hollow imp Aug 15, 2025, 5:00 PM

#

I'mma go play with it

#

What should I say but

gentle plinth Aug 15, 2025, 5:01 PM

#

indigo hazel https://x.com/Alibaba_Qwen/status/1956399490698735950

why would i install something that i can use in the webbrowser

stray aspen Aug 15, 2025, 5:01 PM

#

qwen sucks lmao lol rofl

hollow imp Aug 15, 2025, 5:01 PM

#

Where is direct chat

#

In webdevarenn

stray aspen Aug 15, 2025, 5:02 PM

#

there isnt

atomic stream Aug 15, 2025, 5:02 PM

#

Qwen is good over deepseek?

stray aspen Aug 15, 2025, 5:02 PM

#

not at coding

hollow imp Aug 15, 2025, 5:02 PM

#

gentle plinth ai can do some small websites

I want it to show a photo in the website

stray aspen Aug 15, 2025, 5:02 PM

#

but the image model from qwen is great

atomic stream Aug 15, 2025, 5:02 PM

#

What are your thoughts on Kimi AI?

stray aspen Aug 15, 2025, 5:03 PM

#

it sucks

#

its only good for writing

#

they need to make a reasoning version

hollow imp Aug 15, 2025, 5:03 PM

#

stray aspen it sucks

How's Gpt 5 high

stray aspen Aug 15, 2025, 5:03 PM

#

hollow imp How's Gpt 5 high

its amazing its SotA

hollow imp Aug 15, 2025, 5:03 PM

#

stray aspen its only good for writing

Isn't Claude on top for that

whole wagon Aug 15, 2025, 5:03 PM

#

@echo aurora is it possible to add different reasoning efforts for gpt models? Because we don't get high in the chatGPT app

hollow imp Aug 15, 2025, 5:03 PM

#

stray aspen its amazing its SotA

What about gpt 5 search

bronze urchin Aug 15, 2025, 5:04 PM

#

Do you have a good explanation for prompt injection these days? GPT5 compatible?

stray aspen Aug 15, 2025, 5:04 PM

#

idk

#

i dont use search

hollow imp Aug 15, 2025, 5:04 PM

#

Why

#

Mf pays for openai subscription

#

😡

stray aspen Aug 15, 2025, 5:04 PM

#

because no

#

im not doing any reasearch

bronze urchin Aug 15, 2025, 5:04 PM

#

hollow imp Why

Want to know if my company is safe

stray aspen Aug 15, 2025, 5:04 PM

#

mostly coding

hollow imp Aug 15, 2025, 5:04 PM

#

@stray aspen why don't you compare lmarena gpt 5 high and yupp ai

stray aspen Aug 15, 2025, 5:04 PM

#

hollow imp <@612078049193885696> why don't you compare lmarena gpt 5 high and yupp ai

i dont needd to lmao

hollow imp Aug 15, 2025, 5:05 PM

#

bronze urchin Want to know if my company is safe

What company

atomic stream Aug 15, 2025, 5:05 PM

#

Does Lmarea have any limits aside from opus?

glass gulch Aug 15, 2025, 5:05 PM

#

badlydrawnmmlol

stray aspen Aug 15, 2025, 5:05 PM

#

lmarena is greater than yupp ai regarding usage of gpt-5 high

errant cave Aug 15, 2025, 5:05 PM

#

What a crash holy crap

stray aspen Aug 15, 2025, 5:05 PM

#

holy sigma

hollow imp Aug 15, 2025, 5:05 PM

#

@gentle plinth helloo

gentle plinth Aug 15, 2025, 5:05 PM

#

seems like gpt-oss-120b is around o3-mini level for hard prompts

regal river Aug 15, 2025, 5:05 PM

#

GPT 4o better than GPT 5 wtf

proud yoke Aug 15, 2025, 5:05 PM

#

How did they manage to make 5 worse than 4o 😭

quiet dust Aug 15, 2025, 5:05 PM

#

Is it possible to switch between Auto, Fast and Thinking models on the phone in ChatGPT?

proud yoke Aug 15, 2025, 5:05 PM

#

Is it even cheaper to run?

glass gulch Aug 15, 2025, 5:06 PM

#

How recent is the leaderboard for the rest of you

errant cave Aug 15, 2025, 5:06 PM

#

I think it's likely that OpenAI and lmarena came to some kind of agreement to make GPT-5 look better than it actually was on release

hollow imp Aug 15, 2025, 5:06 PM

#

regal river GPT 4o better than GPT 5 wtf

o3 search is better than gpt5 search I can say that with 100% surity

stray aspen Aug 15, 2025, 5:06 PM

#

new leaderboard

glass gulch Aug 15, 2025, 5:06 PM

#

i cant tell if the announcement implies it changed rn lol

modest prism Aug 15, 2025, 5:06 PM

#

errant cave What a crash holy crap

4o better than 4.5 ? How?

errant cave Aug 15, 2025, 5:06 PM

#

Or maybe OpenAI just deceived lmarena before release

gusty helm Aug 15, 2025, 5:06 PM

#

gemini still top of pack 😄 ?

hollow imp Aug 15, 2025, 5:06 PM

#

errant cave I think it's likely that OpenAI and lmarena came to some kind of agreement to ma...

@echo aurora 🙀

errant cave Aug 15, 2025, 5:06 PM

#

modest prism 4o better than 4.5 ? How?

4o was always top tier

quiet dust Aug 15, 2025, 5:06 PM

#

Guys. Is it possible to switch between Auto, Fast and Thinking models on the phone in ChatGPT?

stray aspen Aug 15, 2025, 5:06 PM

#

errant cave I think it's likely that OpenAI and lmarena came to some kind of agreement to ma...

gpt-5 sucked on release

#

it was trash

#

now its way better

tired herald Aug 15, 2025, 5:06 PM

#

still trash

errant cave Aug 15, 2025, 5:06 PM

#

Still sucks

gentle plinth Aug 15, 2025, 5:06 PM

#

modest prism 4o better than 4.5 ? How?

people vote for it because of its sycophancy

stray aspen Aug 15, 2025, 5:07 PM

#

no

#

its great

#

its SotA

tired herald Aug 15, 2025, 5:07 PM

#

golden trash is still trash

hollow imp Aug 15, 2025, 5:07 PM

#

GPT-3o

stray aspen Aug 15, 2025, 5:07 PM

#

lol

hollow imp Aug 15, 2025, 5:07 PM

#

SOTA

errant cave Aug 15, 2025, 5:07 PM

#

I like GPT-5 clapping back against stupid ideas in contrast to 4o

#

That's like the only thing it's better at though

atomic stream Aug 15, 2025, 5:07 PM

#

stray aspen lmarena is greater than yupp ai regarding usage of gpt-5 high

Yupp ai, is this free?

tired herald Aug 15, 2025, 5:07 PM

#

how interesting

vapid zinc Aug 15, 2025, 5:08 PM

#

i dont understand how ppl think 2.5 pro is so good

stray aspen Aug 15, 2025, 5:08 PM

#

@tired heraldhow much are eyou gonna sell the plugin for

modest prism Aug 15, 2025, 5:08 PM

#

gentle plinth people vote for it because of its sycophancy

OpenAI made people addicted to this model.

hollow imp Aug 15, 2025, 5:08 PM

#

atomic stream Yupp ai, is this free?

It's offering o3 pro for free

#

Such a scam

stray aspen Aug 15, 2025, 5:08 PM

#

vapid zinc i dont understand how ppl think 2.5 pro is so good

i mean its not bad

hollow imp Aug 15, 2025, 5:08 PM

#

Gives you credits for using the ai

tired herald Aug 15, 2025, 5:08 PM

#

stray aspen <@1395809769947660389>how much are eyou gonna sell the plugin for

im not selling, im gonna put up the code for free

stray aspen Aug 15, 2025, 5:08 PM

#

but gpt-5 is way better

tired herald Aug 15, 2025, 5:08 PM

#

prob on github

hollow imp Aug 15, 2025, 5:08 PM

#

Does anyone use webdev arena

vapid zinc Aug 15, 2025, 5:08 PM

#

stray aspen i mean its not bad

yeah, not bad but grok 4, o3 pro, 5 pro are all noticably better no question

stray aspen Aug 15, 2025, 5:09 PM

#

hollow imp Such a scam

its not a scam lmao

#

stop yapping

hollow imp Aug 15, 2025, 5:09 PM

#

It is

stray aspen Aug 15, 2025, 5:09 PM

#

but right now lmarena is way better

hollow imp Aug 15, 2025, 5:09 PM

#

100%

stray aspen Aug 15, 2025, 5:09 PM

#

vapid zinc yeah, not bad but grok 4, o3 pro, 5 pro are all noticably better no question

agreed

hollow imp Aug 15, 2025, 5:09 PM

#

There's no better worse in 2+2

atomic stream Aug 15, 2025, 5:09 PM

#

hollow imp Gives you credits for using the ai

When I went there, it asked me for a login; thank God I have not.

hollow imp Aug 15, 2025, 5:09 PM

#

@gentle plinth Vro hello

gentle plinth Aug 15, 2025, 5:09 PM

#

stop randomly tagging me

tired herald Aug 15, 2025, 5:09 PM

#

ups, LMArena doesnt like this at all

modest prism Aug 15, 2025, 5:09 PM

#

vapid zinc i dont understand how ppl think 2.5 pro is so good

The 03-25 variant was amazing. They just ruined it.

gentle plinth Aug 15, 2025, 5:09 PM

#

or i will block

vapid zinc Aug 15, 2025, 5:10 PM

#

stray aspen agreed

I think its bc on lm arena ppl ask simple questions

stray aspen Aug 15, 2025, 5:10 PM

#

i love how gemini 2.5 pro preview versions were better than the final lol

echo aurora Aug 15, 2025, 5:10 PM

#

errant cave I think it's likely that OpenAI and lmarena came to some kind of agreement to ma...

No. We wouldn't do something like that, ever. That'd go against everything we stand for and are trying to build.

hollow imp Aug 15, 2025, 5:10 PM

#

gentle plinth stop randomly tagging me

How to attach a photo in webdev

gentle plinth Aug 15, 2025, 5:10 PM

#

its not possible

#

currently

#

use the prompt i said in direct chat

hollow imp Aug 15, 2025, 5:10 PM

#

No one has done it

gentle plinth Aug 15, 2025, 5:10 PM

#

then you can

hollow imp Aug 15, 2025, 5:10 PM

#

gentle plinth use the prompt i said in direct chat

Give

gentle plinth Aug 15, 2025, 5:11 PM

#

hollow imp Give

#general message

#

but as i said it will probably not work out of the box

#

ask ai to help you run it

hollow imp Aug 15, 2025, 5:11 PM

#

gentle plinth but as i said it will probably not work out of the box

It will not run on chrome?

#

Wdym

whole wagon Aug 15, 2025, 5:11 PM

#

whole wagon <@283397944160550928> is it possible to add different reasoning efforts for gpt ...

🙁

#

I want to see where the gpt I actually use in chat ranks

modest prism Aug 15, 2025, 5:12 PM

#

stray aspen i love how gemini 2.5 pro preview versions were better than the final lol

Recently I use 2.5 flash more than 2.5 pro. DeepMind used some over-optimized technical changes to 2.5 pro, they do not know how to fix it.

atomic stream Aug 15, 2025, 5:12 PM

#

stray aspen lmarena is greater than yupp ai regarding usage of gpt-5 high

What about Gemini and others? Have anyone got limts on it?

errant cave Aug 15, 2025, 5:12 PM

#

whole wagon I want to see where the gpt I actually use in chat ranks

It's gpt-5-chat I believe

stray aspen Aug 15, 2025, 5:12 PM

#

atomic stream What about Gemini and others? Have anyone got limts on it?

on lmarena?

echo aurora Aug 15, 2025, 5:12 PM

#

whole wagon 🙁

Sorry currently busy with other stuff, will try to address later blobthanks

errant cave Aug 15, 2025, 5:12 PM

#

Fifth place at 1427 ELO

stray aspen Aug 15, 2025, 5:12 PM

#

i have only run into limits with claude 4.1 opus on lmarena

atomic stream Aug 15, 2025, 5:12 PM

#

stray aspen on lmarena?

Yeah

stray aspen Aug 15, 2025, 5:12 PM

#

atomic stream Yeah

as far as i know claude 4.1 opus has limits

whole wagon Aug 15, 2025, 5:12 PM

#

errant cave It's gpt-5-chat I believe

chatGPT has thinking variants still

stray aspen Aug 15, 2025, 5:12 PM

#

not sure about the rest

#

havent run into limits

whole wagon Aug 15, 2025, 5:13 PM

#

They just don't use the high reasoning

coarse glade Aug 15, 2025, 5:13 PM

#

You guys were lying in gpt 5 it really is gpt 4 you guys are lying to us

#

It’s really a bad thing guys

#

I have proof

whole wagon Aug 15, 2025, 5:13 PM

#

Bruh

coarse glade Aug 15, 2025, 5:13 PM

#

#

Look at this

#

And each gpt 5 model says this

hollow imp Aug 15, 2025, 5:14 PM

#

gentle plinth https://discord.com/channels/1340554757349179412/1340554757827461211/14059575082...

If I paste the prompt into gpt-5 high, will that work?

coarse glade Aug 15, 2025, 5:14 PM

#

See you guys were lying

stray aspen Aug 15, 2025, 5:14 PM

#

coarse glade And each gpt 5 model says this

the api doesnt know what model it is

stray aspen Aug 15, 2025, 5:14 PM

#

coarse glade See you guys were lying

its not stop yapping

#

theres no system prompt and the api doesnt know what it is