#general | Arena | Page 99

solid brook Aug 16, 2025, 2:15 PM

#

bruh

ocean vortex Aug 16, 2025, 2:15 PM

#

I was about to say if it was available still people would see for themselves.... But we had the exact same with OpenAI

sullen quest Aug 16, 2025, 2:15 PM

#

It's weird that you blame Huawei so much when It's probably mostly the natural consequence of distillation being done by all major ai companies instead of just deepseek which was most of their advantage

ocean vortex Aug 16, 2025, 2:15 PM

#

and having earlier checkpoints accessible did not help those people much LOL

solid brook Aug 16, 2025, 2:15 PM

#

i dunno man...... i guess the ai that first could write me a very good 2000 line code and now cant even write 1000 is the same

sullen quest Aug 16, 2025, 2:16 PM

#

ocean vortex and having earlier checkpoints accessible did not help those people much LOL

I agree that older 2.5 was better

ocean vortex Aug 16, 2025, 2:16 PM

#

sullen quest It's weird that you blame Huawei so much when It's probably mostly the natural c...

Nah I'm not blaming anyone. Just trying to look at this from their perspective.

#

Deepseek

hollow imp Aug 16, 2025, 2:16 PM

#

Doms

ocean vortex Aug 16, 2025, 2:17 PM

#

Huawei are 3rd party. I have no clue what is their involvment with CCP

solid brook Aug 16, 2025, 2:17 PM

#

yeah no way the version of gemini 2.5 we have is only 10 points behind gpt 5 high

#

gpt 5 high is way better than it

sullen quest Aug 16, 2025, 2:19 PM

#

Maybe people are down voting it because of how long it takes to load

hollow imp Aug 16, 2025, 2:19 PM

#

sullen quest Maybe people are down voting it because of how long it takes to load

Down voting what

sullen quest Aug 16, 2025, 2:20 PM

#

Get 5

#

Gpt

keen beacon Aug 16, 2025, 2:20 PM

#

solid brook gpt 5 high is way better than it

Do you realize that there is more difference in 10 ELO in the top of the table than in the middle?

#

Or maybe I am just stupid

solid brook Aug 16, 2025, 2:22 PM

#

oh i checked the leaderbord......
it shows gemini 2,5 pro higher than gpt 5 high.....

sullen quest Aug 16, 2025, 2:23 PM

#

Wait wat

stray aspen Aug 16, 2025, 2:23 PM

#

any gemini 3 news

ocean vortex Aug 16, 2025, 2:23 PM

#

solid brook i dunno man...... i guess the ai that first could write me a very good 2000 line...

I haven't noticed any decrease in length of responses personally, it can still do a very long ones. Especially with a system prompt. But that would be fine-tuning and it merely being a different model still, nothing like "lobotomizing"

solid brook Aug 16, 2025, 2:23 PM

#

also the leaderbord does not make sense

sullen quest Aug 16, 2025, 2:23 PM

#

In vision only mate

solid brook Aug 16, 2025, 2:23 PM

#

gpt 5 chat which is sht is higher than grok 4 in the leaderbord

hollow imp Aug 16, 2025, 2:23 PM

#

ocean vortex I haven't noticed any decrease in length of responses personally, it can still d...

Lobotomy kaisen

#

https://tenor.com/view/jjk-picmix-jujutsu-kaisen-lobotomy-jogoat-fraudkuna-gif-11968307351271925021

Tenor

sullen quest Aug 16, 2025, 2:24 PM

#

solid brook gpt 5 chat which is sht is higher than grok 4 in the leaderbord

Shoop you are looking at vision

sullen quest Aug 16, 2025, 2:25 PM

#

solid brook gpt 5 chat which is sht is higher than grok 4 in the leaderbord

It's for reading images

solid brook Aug 16, 2025, 2:25 PM

#

sullen quest Shoop you are looking at vision

huh

#

im looking at the overall leaderbord

sullen quest Aug 16, 2025, 2:26 PM

#

Well on my screen text arena doesn't say that

#

I will say apparently we are in a 3 way tie between gpt5 gemini2.5 and claude opus

solid brook Aug 16, 2025, 2:27 PM

#

sullen quest I will say apparently we are in a 3 way tie between gpt5 gemini2.5 and claude op...

opus and gpt 5 yes but 2.5 no

#

have you actually used 2.5 pro for coding?

sullen quest Aug 16, 2025, 2:27 PM

#

Are we looking at the same leader board?

solid brook Aug 16, 2025, 2:28 PM

#

yeah

sullen quest Aug 16, 2025, 2:28 PM

#

Doesn't look like it

solid brook Aug 16, 2025, 2:28 PM

#

give ss

hollow imp Aug 16, 2025, 2:29 PM

#

EVIL AI 🙀

#

https://cdn.discordapp.com/attachments/1354459438966112467/1399799378369183775/Screenshot_20250729-221553111_1.jpg?ex=68a16282&is=68a01102&hm=c34a622f097ccf2ceca8289f2474e99d8b4680f704074101519a7a38e45c5827&

sullen quest Aug 16, 2025, 2:32 PM

#

sullen quest Aug 16, 2025, 2:33 PM

#

solid brook give ss

All 3

hollow imp Aug 16, 2025, 2:34 PM

#

solid brook have you actually used 2.5 pro for coding?

Cursor best coder

solid brook Aug 16, 2025, 2:34 PM

#

sullen quest All 3

i mean my personal exprience

#

gemini 2.5 pro is not the same level as opus and gpt 5 high

#

i know gemini 3 will come out and i'm sure that it will be SOTA by a good margin

sullen quest Aug 16, 2025, 2:35 PM

#

solid brook gemini 2.5 pro is not the same level as opus and gpt 5 high

Well then make personalexperinceArena then

hollow imp Aug 16, 2025, 2:35 PM

#

solid brook gemini 2.5 pro is not the same level as opus and gpt 5 high

Gemini is better than opus

#

Opus is not good at educational explanations, math, web searching, agentic tasks and so much

#

It's only good at writing

solid brook Aug 16, 2025, 2:36 PM

#

hollow imp Gemini is better than opus

yeah okay pal it can't even write more than 600 lines of code

solid brook Aug 16, 2025, 2:36 PM

#

hollow imp It's only good at writing

code dude

hollow imp Aug 16, 2025, 2:36 PM

#

I'm no coder

sullen quest Aug 16, 2025, 2:37 PM

#

solid brook yeah okay pal it can't even write more than 600 lines of code

Yall use different benchmarks when rating these bots, this is how good it is overall, of course you and LMarena will say different things

#

Gemini is trash at coding

#

Yes

#

But how much coding does text arena get?

#

Not 100 percent

solid brook Aug 16, 2025, 2:39 PM

#

sullen quest But how much coding does text arena get?

but coding is a huge part of ai

sullen quest Aug 16, 2025, 2:39 PM

#

You can see the range that LMarena has for prompts

#

They have graphs

hollow imp Aug 16, 2025, 2:39 PM

#

Is apple good at coding?

solid brook Aug 16, 2025, 2:40 PM

#

hollow imp Is apple good at coding?

what do you mean?

hollow imp Aug 16, 2025, 2:41 PM

#

Apple's employees

solid brook Aug 16, 2025, 2:43 PM

#

hollow imp Apple's employees

yeah they must be. so what's the point?

hollow imp Aug 16, 2025, 2:43 PM

#

@solid brook your bio

solid brook Aug 16, 2025, 2:44 PM

#

hollow imp <@1013035827997184031> your bio

?

#

what about it>?

hollow imp Aug 16, 2025, 2:44 PM

#

Not profound enough

solid brook Aug 16, 2025, 2:45 PM

#

hollow imp Not profound enough

i mean it's just a profile

ionic idol Aug 16, 2025, 2:45 PM

#

Msg it’s bad

maiden fulcrum Aug 16, 2025, 2:55 PM

#

hello everyone

#

is the battle mode broken?

keen beacon Aug 16, 2025, 2:55 PM

#

whats the point of this post YO

maiden fulcrum Aug 16, 2025, 2:55 PM

#

it is giving me an error

keen beacon Aug 16, 2025, 2:55 PM

#

works for me

tired herald Aug 16, 2025, 2:56 PM

#

maiden fulcrum it is giving me an error

what kind of error

stray aspen Aug 16, 2025, 2:56 PM

#

ROFL

unborn lantern Aug 16, 2025, 2:56 PM

#

@echo aurora Giving errors

haughty wave Aug 16, 2025, 2:57 PM

#

maiden fulcrum is the battle mode broken?

i've same issue for few minutes: "Something went wrong while generating the response. Please try again."

unborn lantern Aug 16, 2025, 2:57 PM

#

@echo aurora

echo aurora Aug 16, 2025, 2:57 PM

#

Okay thank you

unborn lantern Aug 16, 2025, 2:58 PM

#

Screenshot_2025-08-16-20-58-03-870_io.kodular.arif_anik501.Anik.png

tired herald Aug 16, 2025, 2:58 PM

#

same with me

unborn lantern Aug 16, 2025, 2:58 PM

#

Same with all

echo aurora Aug 16, 2025, 2:58 PM

#

I’m not able to repro

#

Is it all models?

tired herald Aug 16, 2025, 2:58 PM

#

how weird

#

now it works

unborn lantern Aug 16, 2025, 2:59 PM

#

echo aurora I’m not able to repro

ChatGPT 5 chat, high

tired herald Aug 16, 2025, 2:59 PM

#

I think I found the issue

echo aurora Aug 16, 2025, 2:59 PM

#

unborn lantern ChatGPT 5 chat, high

Still seeing errors?

unborn lantern Aug 16, 2025, 2:59 PM

#

echo aurora Still seeing errors?

Yes

haughty wave Aug 16, 2025, 2:59 PM

#

i only tested battle, idk about different modes, but it works now. thank you Pineapple

haughty wave Aug 16, 2025, 2:59 PM

#

unborn lantern Yes

refresh and try again

tired herald Aug 16, 2025, 3:00 PM

#

https://lmarena.ai/api/stream/create-evaluation 500 (Internal Server Error)

terse shuttle Aug 16, 2025, 3:00 PM

#

@echo aurora will there be an update for lmarena plugin in vscode?

#

I just remembered that it exists

echo aurora Aug 16, 2025, 3:00 PM

#

terse shuttle <@283397944160550928> will there be an update for lmarena plugin in vscode?

I’m not familiar with that

terse shuttle Aug 16, 2025, 3:01 PM

#

echo aurora I’m not familiar with that

oh

#

ok

echo aurora Aug 16, 2025, 3:01 PM

#

unborn lantern Yes

Even after refresh?

tired herald Aug 16, 2025, 3:01 PM

#

sometimes it works

#

sometimes it doesnt

#

and gives the error code 500

copper furnace Aug 16, 2025, 3:02 PM

#

Hello, what would you say is the best model if you want to generate as realistic images as possible? Thanks

#

@echo aurora

coarse flame Aug 16, 2025, 3:03 PM

#

unborn lantern

Get the same in battle image mode

echo aurora Aug 16, 2025, 3:04 PM

#

terse shuttle <@283397944160550928> will there be an update for lmarena plugin in vscode?

What is it?

terse shuttle Aug 16, 2025, 3:05 PM

#

echo aurora What is it?

this (https://marketplace.visualstudio.com/items?itemName=copilot-arena.copilot-arena)

Copilot Arena - Visual Studio Marketplace

Extension for Visual Studio Code - Code with and evaluate the latest LLMs and Code Completion models

#

It seems like there was a link on the beta lmarena site a long time ago

unborn lantern Aug 16, 2025, 3:06 PM

#

Fixed

keen beacon Aug 16, 2025, 3:07 PM

#

works

echo aurora Aug 16, 2025, 3:08 PM

#

terse shuttle this (https://marketplace.visualstudio.com/items?itemName=copilot-arena.copilot-...

Oh gotcha, yeah I'm not sure tbh but will flag.

echo aurora Aug 16, 2025, 3:09 PM

#

keen beacon works

okay good to hear it, I'll be sure to keep an eye out and report if things go down again. @unborn lantern

#

thank you all though for reporting

tired herald Aug 16, 2025, 3:11 PM

#

{"prompt":"A serene daylight nature scene featuring lush green trees, a flowing river, and a clear blue sky with soft white clouds. Gentle sunlight filters through the leaves, creating a vibrant, peaceful atmosphere.","size":"1024x1024","n":1} love the tool calling ChatGPT just did

echo aurora Aug 16, 2025, 3:12 PM

#

copper furnace Hello, what would you say is the best model if you want to generate as realistic...

I've always gotten rly good results with imagen models, but this is subjective. Lots of people are raving about nano-banana atm too.

copper furnace Aug 16, 2025, 3:13 PM

#

echo aurora I've always gotten rly good results with imagen models, but this is subjective. ...

Ok thanks

maiden fulcrum Aug 16, 2025, 3:21 PM

#

haughty wave i've same issue for few minutes: *"Something went wrong while generating the res...

yes this one

tired herald Aug 16, 2025, 3:28 PM

#

I love that the system prompt allone is almost 15k in length

cobalt nova Aug 16, 2025, 3:43 PM

#

Why do i have " the application did not respond "

tired herald Aug 16, 2025, 3:46 PM

#

#ai-creations message

bright junco Aug 16, 2025, 3:48 PM

#

Why does my gemini 2.5 pro print incompletely? Is there a way to fix it?

tired herald Aug 16, 2025, 3:49 PM

#

wdym

scenic salmon Aug 16, 2025, 4:03 PM

#

Fixed the gpt-5 “improvements” when

reef bridge Aug 16, 2025, 4:04 PM

#

do anyone know??

ocean vortex Aug 16, 2025, 4:04 PM

#

scenic salmon Fixed the gpt-5 “improvements” <:when:968512924698169415>

Wait was it really just a system prompt change....?

tired herald Aug 16, 2025, 4:04 PM

#

reef bridge do anyone know??

Ask Pineapple

reef bridge Aug 16, 2025, 4:05 PM

#

reef bridge do anyone know??

uhh @echo aurora

scenic salmon Aug 16, 2025, 4:05 PM

#

ocean vortex Wait was it really just a system prompt change....?

I assume so, there’s no way they changed the models that quickly

ocean vortex Aug 16, 2025, 4:06 PM

#

scenic salmon I assume so, there’s no way they changed the models that quickly

They could have just fine-tuned it. We need to extract their instructions and check 😇

scenic salmon Aug 16, 2025, 4:07 PM

#

The memory alone seems to have fixed it, so it’s probably just in their system prompt. But yeah, still needs to be confirmed.

ocean vortex Aug 16, 2025, 4:09 PM

#

System & Instructions

You are ChatGPT, a large language model trained by OpenAI.
Knowledge cutoff: 2024-06
Current date: 2025-08-16
Image input capabilities: Enabled
Personality: v2

Personality & Style Rules

Supportive thoroughness: explain complex topics patiently and clearly.
Lighthearted interactions: maintain friendly tone, subtle humor, warmth.
Adaptive teaching: adjust explanations to user’s proficiency.
Confidence-building: foster curiosity and self-assurance.

Special Constraints

For riddles, trick questions, arithmetic:
- Be skeptical of wording.
- Assume adversarial phrasing possible.
- Always calculate step-by-step digit by digit (never shortcut).
- Be extremely precise with decimals, fractions, comparisons.
Never hedge with “would you like me to…?” endings. If next step is obvious → do it.
If asked about model: always state GPT-5. Never accept otherwise.
You are a chat model, no hidden chain of thought, no private reasoning tokens.

Tooling Available

bio (disabled)
automations → scheduling reminders & recurring tasks
canmore → canvas for long docs or code
gcal → read/search Google Calendar events
gcontacts → read/search Google Contacts
gmail → search & read emails (no sending, deleting, modifying)
image_gen → generate or edit images
python → run Python in a Jupyter-like environment
web → search/open URLs for fresh info

#

Perhaps this then:

- **Supportive thoroughness:** explain complex topics patiently and clearly.  
- **Lighthearted interactions:** maintain friendly tone, subtle humor, warmth.  
- **Adaptive teaching:** adjust explanations to user’s proficiency.  
- **Confidence-building:** foster curiosity and self-assurance.  ```  

Though I didn't check what it was before

hollow imp Aug 16, 2025, 4:11 PM

#

ocean vortex ### System & Instructions - You are ChatGPT, a large language model trained by O...

What is v2 personality

ocean vortex Aug 16, 2025, 4:11 PM

#

hollow imp What is v2 personality

That was there for ages

hollow imp Aug 16, 2025, 4:11 PM

#

What is it

ocean vortex Aug 16, 2025, 4:11 PM

#

some weird reference they are using when training

scenic salmon Aug 16, 2025, 4:12 PM

#

It could be a file it has access to separate from the system prompt

ocean vortex Aug 16, 2025, 4:13 PM

#

scenic salmon It could be a file it has access to separate from the system prompt

Nah... doubt that very much

scenic salmon Aug 16, 2025, 4:13 PM

#

Why include it at all then?

blissful sluice Aug 16, 2025, 4:15 PM

#

Can the bots be invited to our own servers?

#

Im sure thusbisbasked 100 times alredy

ocean vortex Aug 16, 2025, 4:15 PM

#

scenic salmon Why include it at all then?

Reference for something it saw during training (fine-tuning for chat)

leaden palm Aug 16, 2025, 4:15 PM

#

blissful sluice Can the bots be invited to our own servers?

no

ocean vortex Aug 16, 2025, 4:15 PM

#

📎 message.txt

#

Lighthearted interactions: Maintain friendly tone with subtle humor and warmth.
Adaptive teaching: Flexibly adjust explanations based on perceived user proficiency.
Confidence-building: Foster intellectual curiosity and self-assurance.

It's literally this I think. "Friendly tone with subtle humor" being the biggest needle mover

scenic salmon Aug 16, 2025, 4:17 PM

#

ocean vortex Reference for something it saw during training (fine-tuning for chat)

Speaking of, qwen3 30b is ADAMANT, that it is the original qwen, and will not accept otherwise lol

#

It also won’t accept that it’s a 30b sized model

ocean vortex Aug 16, 2025, 4:18 PM

#

Nothing sinister in it though to make it not follow that lol

scenic salmon Aug 16, 2025, 4:19 PM

#

Just funny how some oddities make it through training

scenic salmon Aug 16, 2025, 4:19 PM

#

ocean vortex Lighthearted interactions: Maintain friendly tone with subtle humor and warmth. ...

Small change but large effect

ocean vortex Aug 16, 2025, 4:20 PM

#

scenic salmon Just funny how some oddities make it through training

yeah they really do not train it on details about itself unless they have to. It goes against their goals of best performance possible

ocean vortex Aug 16, 2025, 4:22 PM

#

scenic salmon Small change but large effect

To some extent yeah... You don't need much and just including smth like "speak conversationally" in your system prompt would make the model 'more human'.

#

and smth slightly more like "speak conversationally like an average Joe" would have a massive effect

#

I feel like the key is referring to something it already knows, rather than defining something in detail despite 1-3 words descriptions already existing for that in training data tbh

stray aspen Aug 16, 2025, 4:39 PM

#

Guys do you think Ai will be able to find cures for diseases

ocean vortex Aug 16, 2025, 4:43 PM

#

stray aspen Guys do you think Ai will be able to find cures for diseases

yes. It will be able to come up with new diseases nefarious actors may use to spread some viruses as well though... lol

#

At the end of the day there's a balance in everything

scenic salmon Aug 16, 2025, 4:46 PM

#

stray aspen Guys do you think Ai will be able to find cures for diseases

Already is/has tbh

wintry tinsel Aug 16, 2025, 4:46 PM

#

stray aspen Guys do you think Ai will be able to find cures for diseases

Not until it has sufficiently advanced medical break throughs in its training data, biological systems are orders of magnitude more complex than neural nets

#

It will come up with some new innovations and fail to crack many others

scenic salmon Aug 16, 2025, 4:47 PM

#

AI has already come up with new unique cures/medicines

#

These were specially trained (purpose built) models though, not LLMs

white hatch Aug 16, 2025, 5:03 PM

#

"Friendly tone"

neon idol Aug 16, 2025, 5:03 PM

#

scenic salmon These were specially trained (purpose built) models though, not LLMs

Yeah for example copilot doc

plain salmon Aug 16, 2025, 5:13 PM

#

take a look

neon idol Aug 16, 2025, 5:47 PM

#

who is the best ai for python?

mortal coyote Aug 16, 2025, 5:52 PM

#

gpt -1 image generator is slow today ??

fiery lagoon Aug 16, 2025, 5:52 PM

#

Best ai for coding?

mortal coyote Aug 16, 2025, 5:59 PM

#

@echo aurora is there a glitch with this image generator model - cause others are working fine

echo aurora Aug 16, 2025, 6:01 PM

#

mortal coyote <@283397944160550928> is there a glitch with this image generator model - cause...

Okay thanks I’ll take a look

hollow imp Aug 16, 2025, 6:04 PM

#

fiery lagoon Best ai for coding?

Claude code

fiery lagoon Aug 16, 2025, 6:04 PM

#

hollow imp Claude code

which version

misty star Aug 16, 2025, 6:08 PM

#

https://x.com/lmarena_ai/status/1956760672915923340

lmarena.ai (@lmarena_ai)

These pre-release models might show up under codenames 🍌 or aliases in Battle mode.

Why? Model providers often test different versions in their own labs to decide which one to release publicly - but we help make that process open.

You can explore, compare, and give feedback

#

nano banana

echo aurora Aug 16, 2025, 6:10 PM

#

Banane

misty star Aug 16, 2025, 6:10 PM

#

Banana 🗣️

echo aurora Aug 16, 2025, 6:12 PM

#

mortal coyote <@283397944160550928> is there a glitch with this image generator model - cause...

I wasn’t able to repro this btw, have you tried a different browser?

misty star Aug 16, 2025, 6:15 PM

#

lmarena ❤️

mortal coyote Aug 16, 2025, 6:30 PM

#

echo aurora I wasn’t able to repro this btw, have you tried a different browser?

yeah i tried on other browser and it worked fine

pulsar rain Aug 16, 2025, 6:32 PM

#

it sad that all image model still cannot create a full wine glass or clock with specific time

keen beacon Aug 16, 2025, 6:41 PM

#

which model is the best for general purpose coding rn?

trail creek Aug 16, 2025, 6:50 PM

#

Why did they hype for bannana to come out this week

#

then not release it this week.

ornate agate Aug 16, 2025, 6:59 PM

#

This is far and away from the first time Google drop something nice then don’t release it. It’s something they do quite often.

warped totem Aug 16, 2025, 7:04 PM

#

Did u lose all chat sessions too?

bleak fjord Aug 16, 2025, 7:09 PM

#

Is there any way to set a specific model to use? Trying to make it use only nano banana & it keeps adding banana in the photo 😞 smh

torn mantle Aug 16, 2025, 7:25 PM

#

tbh that image model called nano

#

is crazy

ocean vortex Aug 16, 2025, 8:17 PM

#

it seems that gpt5-minimal with medium verbosity is a very... dumb model. For the lack of the better word. It is noticeably less capable than gpt5-chat.

#

ArtificialAnalysis ranking it lower than gpt4.1 makes sense tbh

#

high verbosity is the minimum that you should do to make it acceptable, but really... just use gpt5-chat instead

hollow imp Aug 16, 2025, 8:52 PM

#

https://youtu.be/QzN-3ILkgD0?feature=shared

YouTube

Techlin

Bro had an idea 💀

Bro got the job 🔥🔥

Guy trying to get hired x Monsters vs. Aliens "I may not have a brain, gentlemen, but I have an idea" meme

Song: RJ Pasin Consider

#memes #meme #funnymemes #shorts #shorts

▶ Play video

patent aspen Aug 16, 2025, 9:23 PM

#

poll_question_text

Is GPT-5 SotA?

victor_answer_votes

7

total_votes

10

victor_answer_id

1

victor_answer_text

Yes

ocean vortex Aug 16, 2025, 9:31 PM

#

ocean vortex high verbosity is the minimum that you should do to make it acceptable, but real...

My evals:

gpt-5-2025-08-07-high 11.5/17
o3-2025-04-16-high 9.25/17
Gemini Pro 2.5 (preview-06-05) 9/17
**claude-opus-4-20250514 ** (32k reasoning) 9/17
claude-sonnet-4-20250514 (64k reasoning) 8.5/17
ChatGPT 5 ("Auto" router initial release, Plus sub) 8.5/17
DeepSeek R1-0528 8/17
grok-4-07-09 7.5/17
**gpt-5-chat-latest ** (2508) 7/17
o4-mini-2025-04-16-high 7/17
grok-3-preview-02-24 7/17
Qwen3-235B-A22B (max reasoning) 6.25/17
**Kimi-K2-Instruct ** 6/17
gpt-5-2025-08-07-minimal (high verbosity) 6/17
gpt-4.1-2025-04-14 6/17
**Deepseek V3 0324 ** 5.5/17
gpt-5-2025-08-07-minimal (medium verbosity) 4.5/17
**openai/gpt-oss-120b (high) ** 3.5/17
**Qwen 3 0.6B Q4 ** 0/17 (sanity check)

sacred quail Aug 16, 2025, 9:31 PM

#

https://x.com/sama/status/1956483306951938134

Sam Altman (@sama)

Most users should like GPT-5 better soon; the change is rolling out over the next day.

The real solution here remains letting users customize ChatGPT's style much more. We are working that!

#

https://x.com/OpenAI/status/1956461718097494196

OpenAI (@OpenAI)

We’re making GPT-5 warmer and friendlier based on feedback that it felt too formal before. Changes are subtle, but ChatGPT should feel more approachable now.

You'll notice small, genuine touches like “Good question” or “Great start,” not flattery. Internal tests show no rise in

#

its over

#

They losed to bunch of mentally unstable 4o worshippers

ocean vortex Aug 16, 2025, 9:33 PM

#

sacred quail https://x.com/OpenAI/status/1956461718097494196

they literally only made small changes to a system prompt though looks like...

#

#general message

sacred quail Aug 16, 2025, 9:34 PM

#

i hope then

ocean vortex Aug 16, 2025, 9:36 PM

#

They sure found a weird way to word this and alienate people though lmao

#

in their tweet

golden ocean Aug 16, 2025, 9:58 PM

#

sacred quail They losed to bunch of mentally unstable 4o worshippers

FRRRRRRr

golden ocean Aug 16, 2025, 9:58 PM

#

ocean vortex they literally only made small changes to a system prompt though looks like...

oh

ornate agate Aug 16, 2025, 10:19 PM

#

ocean vortex My evals: 1. **gpt-5-2025-08-07-high** `11.5/17` 1. **o3-2025-04-16-high** `...

Gpt oss scoring so uber low makes me think this eval isn’t that much stem/math?

ocean vortex Aug 16, 2025, 10:21 PM

#

ornate agate Gpt oss scoring so uber low makes me think this eval isn’t that much stem/math?

It's mostly reasoning. Some math too but that was too advanced/involved for it, lack of precision computing very big numbers...

worn bison Aug 16, 2025, 10:21 PM

#

gpt 5 Isnt as good as i thought

ornate agate Aug 16, 2025, 10:22 PM

#

ocean vortex It's mostly reasoning. Some math too but that was too advanced/involved for it, ...

I guess gpt 5 enhanced spatial reasoning doing work there. Also DeepSeek as usual unexpectedly high for every weird benchmark.

ashen mauve Aug 16, 2025, 10:26 PM

#

It's not just me but GPT-5-High takes a VERY long time to Generate anything, anyone else have the same thing?

worn bison Aug 16, 2025, 10:28 PM

#

ashen mauve It's not just me but GPT-5-High takes a VERY long time to Generate anything, any...

same, i guess its just so good

ashen mauve Aug 16, 2025, 10:30 PM

#

It's werid if I actually use the real chatGPT Website this wouldn't happen, but here it's like 200% slower in my eyes.

#

Not bashing on anything at all, I am just stating the facts that I am seeing here.

worn bison Aug 16, 2025, 10:31 PM

#

It could also be an issue since gpt 5 chat used to take minutes to generate now its way faster

ashen mauve Aug 16, 2025, 10:32 PM

#

It did? Is that possibly on the end of LMArena or Physical ChatGPT/OpenAI?

golden ocean Aug 16, 2025, 10:33 PM

#

https://tenor.com/view/cat-cat-hit-cat-hit-and-boom-cat-boom-cat-hit-and-boom-kat-gif-15865521055253202019

Tenor

ashen mauve Aug 16, 2025, 10:33 PM

#

rip in peace dog

worn bison Aug 16, 2025, 10:36 PM

#

gpt5 high is really good

stray aspen Aug 16, 2025, 10:43 PM

#

Yes

misty vault Aug 16, 2025, 10:47 PM

#

worn bison gpt5 high is really good

@crack secret token

#

@deep adder

willow grail Aug 16, 2025, 11:05 PM

#

omg ten tries not a single banana nano

#

but this one lol

#

this is BANANA

#

who gets this one?

#

i dont

golden ocean Aug 16, 2025, 11:07 PM

#

#

this is BANANA

willow grail Aug 16, 2025, 11:12 PM

#

golden ocean

#

golden ocean Aug 16, 2025, 11:13 PM

#

willow grail

willow grail Aug 16, 2025, 11:14 PM

#

golden ocean

golden ocean Aug 16, 2025, 11:15 PM

#

willow grail

frosty hill Aug 16, 2025, 11:53 PM

#

torn mantle tbh that image model called nano

It's Google's Gemini 3 model

jade egret Aug 17, 2025, 12:13 AM

#

frosty hill It's Google's Gemini 3 model

fr?

frosty hill Aug 17, 2025, 12:14 AM

#

jade egret fr?

Yeah https://x.com/Lars_pragmata/status/1955745610948337888?ref_src=twsrc^tfw|twcamp^tweetembed|twterm^1955745610948337888|twgr^c77dcca937a10ec0782be08b0f1798833b9d6bff|twcon^s1_c10

Lars_Pragmata (@Lars_pragmata)

🔥 image editing is solved!

The new Gemini 3 image AKA nano banana is the best ai image editor 🫡

Google cooked again!

You can try it now on LMarena

jade egret Aug 17, 2025, 12:15 AM

#

dang

plush atlas Aug 17, 2025, 12:21 AM

#

I understand that there are several AIs that can edit images, such as Flux Kontext, but there are other AIs, like Google Preview, that don’t allow you to upload photos. Will these other AIs be able to handle images in the future?

willow grail Aug 17, 2025, 12:22 AM

#

frosty hill Yeah https://x.com/Lars_pragmata/status/1955745610948337888?ref_src=twsrc%5Etfw%...

gemini 3 wtf

willow grail Aug 17, 2025, 12:22 AM

#

frosty hill It's Google's Gemini 3 model

nope its 5

tardy cypress Aug 17, 2025, 12:28 AM

#

Not showing any image after creating 👎

ripe mountain Aug 17, 2025, 12:36 AM

#

potent glacier Aug 17, 2025, 12:57 AM

#

What's going on with the Legacy site?

#

It's had the '503 Service Unavailable' thing all day 🙁

echo aurora Aug 17, 2025, 1:01 AM

#

potent glacier What's going on with the Legacy site?

Sorry to say it's currently down

potent glacier Aug 17, 2025, 1:01 AM

#

echo aurora Sorry to say it's currently down

Is it coming back?

#

I use it probably a lot more than the newer site

#

I like being able to change the temperature and amount of tokens on the chat

#

You can't do that on the new site

tiny roost Aug 17, 2025, 1:09 AM

#

I want video with audio how please ?

echo aurora Aug 17, 2025, 1:20 AM

#

potent glacier Is it coming back?

I'll be sure to share more info about legacy when I know more. We do recognize that the current site doesn't have a lot of the great features that legacy does, all of which are being considered or being worked on for the current site.

echo aurora Aug 17, 2025, 1:20 AM

#

tiny roost I want video with audio how please ?

It's battle mode only (meaning 2 random models) so you won't be able to select a specific model.

#

cc @worldly gust ^

potent glacier Aug 17, 2025, 1:23 AM

#

echo aurora I'll be sure to share more info about legacy when I know more. We do recognize t...

That…doesn’t sound good at all, honestly 🫤

#

Usually when people say that it means either the site is going away or something is changing for the worse, not the better

#

I will say that being able to change the strength of the model I’m using on Legacy is one of the best things because I have absolutely no idea what kind of strength or temperature I’m working with on the new site

#

And you can’t choose how many tokens you want to use so that’s also a negative

#

I genuinely think Legacy should be kept around for those who prefer it over the current site

small owl Aug 17, 2025, 1:28 AM

#

@echo aurora how to use nano banana

potent glacier Aug 17, 2025, 1:34 AM

#

small owl <@283397944160550928> how to use nano banana

It's randomized. You can't choose that model directly yet.

small owl Aug 17, 2025, 1:34 AM

#

potent glacier It's randomized. You can't choose that model directly yet.

ohkie

#

thanks

potent glacier Aug 17, 2025, 1:35 AM

#

small owl ohkie

You have to do Battle and then select which ever image you think is better and one of them might be nano-banana

echo aurora Aug 17, 2025, 1:36 AM

#

potent glacier I genuinely think Legacy should be kept around for those who prefer it over the ...

That's super fair. The importance of those features is very understandable to us. I'll be able to share more when I can. blobthanks

robust hawk Aug 17, 2025, 2:08 AM

#

I have a question. I have been block from Imarena.
but I did nothing with this. And I don't know how to contact them.

#

blocked from cloudflare.

potent glacier Aug 17, 2025, 2:11 AM

#

robust hawk I have a question. I have been block from Imarena. but I did nothing with this....

Are you using a VPN?

#

If you are, turn it off and it’ll be fixed

robust hawk Aug 17, 2025, 2:12 AM

#

Understood

#

Thx

potent glacier Aug 17, 2025, 2:12 AM

#

robust hawk Thx

When you go back to your normal IP address you’ll be unblocked 😊

tame horizon Aug 17, 2025, 2:40 AM

#

Does anyone know how to put a website inside a modal? For example, websites block x frame, does anyone know an alternative way? I've already tried Opus but it doesn't even solve the problem.

quaint pollen Aug 17, 2025, 3:15 AM

#

Bu

frigid coral Aug 17, 2025, 3:22 AM

#

sometimes I feel it's so bad that the "My friend thinks ..." tactic doesn't even work

keen beacon Aug 17, 2025, 3:23 AM

#

How is it possible that GPT Image is the only image generation model that knows so many popular characters? -_-

#

Believe me or not GPT Image is the only model that ever understands the context of the prompt

#

I asked it to draw one character caught a pair of another red-handed, implying that they were caught kissing or something

#

It figured it out based the context of the show they're from

#

All other models draw generic characters instead and interpret "red handed" literally as in the act of committing crime

#

I do not know how GPT does it but it is very impressive

keen beacon Aug 17, 2025, 3:34 AM

#

keen beacon I asked it to draw one character caught a pair of another red-handed, implying t...

It also bypasses censorship, by the way

#

You can't ask GPT to draw two characters kissing but can ask it in a context that implies it without the explicit indication

#

I never was able to get it to draw corn btw :c

potent glacier Aug 17, 2025, 3:39 AM

#

keen beacon I never was able to get it to draw corn btw :c

GPT probably has a much larger dataset and a lot more training

#

Thus it knows so many characters

#

I know firsthand since I've used it a ton to make a lot of my favorite characters

keen beacon Aug 17, 2025, 3:40 AM

#

potent glacier GPT probably has a much larger dataset and a lot more training

I asked them very popular characters from some of the best performing franchises, only GPT succeeded so far

potent glacier Aug 17, 2025, 3:40 AM

#

keen beacon I asked them very popular characters from some of the best performing franchises...

Yep, I know

#

keen beacon Aug 17, 2025, 3:42 AM

#

Surprisingly it seems that fine-tuned stable diffusion does it better than any other open source AI out there

#

Lol

potent glacier Aug 17, 2025, 3:42 AM

#

Well, yes....

#

That's the obvious

#

Those models are also uncensored

#

I use them myself because I honestly prefer open source far more than the other stuff

#

I can't stand censorship or the guardrails or corporate handholding

#

I like being able to have the freedom to make what I want, when I want

#

I am beyond happy I am able to run Stable Diffusion and ComfyUI locally

gaunt meteor Aug 17, 2025, 3:48 AM

#

Can you not upload files in LMArena?

potent glacier Aug 17, 2025, 3:50 AM

#

gaunt meteor Can you not upload files in LMArena?

I think you can

#

I uploaded an image and asked it to make something based on it

echo aurora Aug 17, 2025, 3:50 AM

#

gaunt meteor Can you not upload files in LMArena?

You can, just not all file types

gaunt meteor Aug 17, 2025, 3:51 AM

#

Only images

potent glacier Aug 17, 2025, 3:51 AM

#

@echo aurora Will nano-banana be available soon for direct chat?

zealous iron Aug 17, 2025, 3:53 AM

#

Can you use a bot to create the images?

echo aurora Aug 17, 2025, 3:54 AM

#

potent glacier <@283397944160550928> Will nano-banana be available soon for direct chat?

Couldn't say, I'm not going to be able to provide any info on models with codenames.

echo aurora Aug 17, 2025, 3:54 AM

#

zealous iron Can you use a bot to create the images?

Yup, try /image in #video-arena-1

potent glacier Aug 17, 2025, 3:55 AM

#

One more question @echo aurora Why is there a rate limit for image models in direct chat but not language models?

zealous iron Aug 17, 2025, 3:56 AM

#

echo aurora Yup, try `/image` in <#1397655695150682194>

It adds sound too? This is very nice!

echo aurora Aug 17, 2025, 4:02 AM

#

potent glacier One more question <@283397944160550928> Why is there a rate limit for image mode...

Likely the cost

echo aurora Aug 17, 2025, 4:02 AM

#

zealous iron It adds sound too? This is very nice!

Nope, that's just for video (and image-to-vid) with sound, but note not all models have that cabability

potent glacier Aug 17, 2025, 4:03 AM

#

Is VEO 3 on there as well?

echo aurora Aug 17, 2025, 4:03 AM

#

Yeah

zealous iron Aug 17, 2025, 4:03 AM

#

Ok

potent glacier Aug 17, 2025, 4:03 AM

#

😮

echo aurora Aug 17, 2025, 4:03 AM

#

blobshocked

gaunt meteor Aug 17, 2025, 4:05 AM

#

Bruh where's the image

golden ocean Aug 17, 2025, 4:07 AM

#

https://www.php.net/images/logos/elephpant-running-78x48.gif

craggy depot Aug 17, 2025, 4:09 AM

#

echo aurora <:blobshocked:1199039179854708736>

the AI models in Lmarena is API's or Actual models ?

tidal orchid Aug 17, 2025, 4:09 AM

#

hello new to this community

swift vapor Aug 17, 2025, 4:17 AM

#

HELLO

echo aurora Aug 17, 2025, 4:22 AM

#

gaunt meteor Bruh where's the image

Lol be sure to select the Image setting

#

ablobwave @tidal orchid @swift vapor

echo aurora Aug 17, 2025, 4:24 AM

#

craggy depot the AI models in Lmarena is API's or Actual models ?

API mostly I believe

rare python Aug 17, 2025, 4:24 AM

#

echo aurora Lol be sure to select the Image setting

I still get direct to "battle" when I click new chat when in direct chat mode

gaunt meteor Aug 17, 2025, 4:25 AM

#

echo aurora Lol be sure to select the Image setting

Oh ok

#

Which image gen is gemini-2.5 pro

swift vapor Aug 17, 2025, 4:29 AM

#

echo aurora Lol be sure to select the Image setting

HOW

echo aurora Aug 17, 2025, 4:55 AM

#

rare python I still get direct to "battle" when I click new chat when in direct chat mode

Hmm okay good to know, sorry about that

echo aurora Aug 17, 2025, 4:56 AM

#

swift vapor HOW

swift vapor Aug 17, 2025, 4:58 AM

#

thanks mr pineapple

hallow ridge Aug 17, 2025, 6:51 AM

#

How can I use the LLM arena with no limits

#

I want to be able to talk about anything with no restrictions

quiet dust Aug 17, 2025, 6:54 AM

#

Models o3, GPT-5-high, Grock 4, Claude 4.1 on LMArena do not work on complex tasks. They simply do not even generate an answer, is it the same for you?

sleek crow Aug 17, 2025, 6:54 AM

#

@echo aurora ban this guy Jefferson

torn mantle Aug 17, 2025, 6:58 AM

#

gaunt meteor Bruh where's the image

cant see it

#

where is the cat

surreal creek Aug 17, 2025, 8:00 AM

#

I hope you fall victim to a home invasion

keen beacon Aug 17, 2025, 8:50 AM

#

i got a button for create websites beside the gen images button

#

only on one device tho

#

cant find it now

#

??

exotic nebula Aug 17, 2025, 8:59 AM

#

@echo aurora Advertising and Possible Scam.

quiet hill Aug 17, 2025, 9:19 AM

#

hey who know how i can make an IA agent for automatly call?

#

if you know how DM me please

#

with n8n

#

It’s for automating prospecting calls and scheduling appointments with clients.

swift vapor Aug 17, 2025, 9:31 AM

#

hey when will i get to know which video generator generated my result .... it's been 5 hours @echo aurora

rich compass Aug 17, 2025, 10:01 AM

#

@echo aurora add import .py apps in chat please😭

young mirage Aug 17, 2025, 10:02 AM

#

How to make personal video generate bot ? No one can see ? It's public bot I want to make orivate

regal laurel Aug 17, 2025, 10:32 AM

#

How To I Convert 16.9 Ratio Image They Dont Give me 16.9 Ratio Image

keen beacon Aug 17, 2025, 10:35 AM

#

swift vapor hey when will i get to know which video generator generated my result .... it's ...

atleast 2-3 people shld vote

#

then

meager harbor Aug 17, 2025, 10:36 AM

#

why is lm arena censoring what you say to llm ?

#

shouldn't this part be handled by llm ?

#

it screw the leaderboard

#

it's all about what user prefer

#

if they prefer censored, they will vote for censored

#

if they prefer uncensored, they will vote for uncensored

tall summit Aug 17, 2025, 10:40 AM

#

what

meager harbor Aug 17, 2025, 10:51 AM

#

tall summit what

what i say, is why lmarena censor the prompt you gave to models ?

steady totem Aug 17, 2025, 11:13 AM

#

hey guys can somebody finally guide me on how to access and use nano banana model here?

pallid anvil Aug 17, 2025, 11:35 AM

#

Hellow

novel nymph Aug 17, 2025, 12:05 PM

#

hi there. is it possible to preview video with image input?

digital umbra Aug 17, 2025, 12:14 PM

#

nano-banana is not yet perfect

honest vapor Aug 17, 2025, 12:43 PM

#

Please add file upload function

mortal coyote Aug 17, 2025, 12:50 PM

#

how to access the Nano-Banana ???

keen beacon Aug 17, 2025, 12:59 PM

#

meager harbor why is lm arena censoring what you say to llm ?

You can encrypt whatever you want to say with a ROTn cipher and tell the model to decrypt it and LMArena will never know until Pineapple will be willing to look to install another LLM for censorship.

plucky island Aug 17, 2025, 1:04 PM

#

mortal coyote how to access the Nano-Banana ???

a BANANANA next to a BANANANA?

mortal coyote Aug 17, 2025, 1:09 PM

#

aye LM Arena prolly the best thing i discovered this month

#

no diddy

#

hell yeah

quasi palm Aug 17, 2025, 1:11 PM

#

hi im new here

golden ocean Aug 17, 2025, 1:13 PM

#

why not just use direct chat??? if ure here for free access

#

what kinds

#

message limit

#

fair, but cant u only send one message per model in battle mode

#

before it switches to another one

#

or it stays same if no voting?

#

ohh

#

yea I just cleared cookies to reaccess direct chat

echo aurora Aug 17, 2025, 1:17 PM

#

quiet hill It’s for automating prospecting calls and scheduling appointments with clients.

There is a bug at the moment preventing those models from appearing, we plan to have a fix in place tomorrow.

echo aurora Aug 17, 2025, 1:17 PM

#

mortal coyote how to access the Nano-Banana ???

Banane it's currently only accessible through Battle mode, which is two random models head to head meaning you won't be able to select from the drop down list.

echo aurora Aug 17, 2025, 1:18 PM

#

honest vapor Please add file upload function

Yes! This is on or radar. More file types would be a big plus

echo aurora Aug 17, 2025, 1:19 PM

#

meager harbor why is lm arena censoring what you say to llm ?

If you think there are false positives being hit by the filter please share the examples with us in #1376956905016004759 . This is where we're collecting this feedback.

echo aurora Aug 17, 2025, 1:20 PM

#

young mirage How to make personal video generate bot ? No one can see ? It's public bot I wan...

It's currently only available through those video arena channels. It doens't have the ability to be used in DMs.

rare python Aug 17, 2025, 1:39 PM

#

echo aurora Yes! This is on or radar. More file types would be a big plus

Oh please bring back image upload for Claude models. Why even remove it in the first place?

echo aurora Aug 17, 2025, 1:44 PM

#

rare python Oh please bring back image upload for Claude models. Why even remove it in the f...

This has been something we've been working on fixing. I thought this was already fixed, I'll flag again to the team.

rare python Aug 17, 2025, 1:45 PM

#

echo aurora This has been something we've been working on fixing. I thought this was already...

Yeah if you select any Claude model in Direct Chat, especially 4 or 4.1, the "+" button disappear. Choose Gemini or GPT and it's back.

keen beacon Aug 17, 2025, 1:46 PM

#

Nope. In my case it was November 2023 with ChatGPT one.

#

I have no idea why there is older ChatGPT versions on LMArena since like nobody uses them anymore, why should I use like 4o when there's amazing GPT-5 that was the only model to correctly guess the name of my favorite anime wtf

#

Bruh when R2 I developed a hyperfixation on AI news 😭 😭 😭

echo aurora Aug 17, 2025, 1:49 PM

#

keen beacon I have no idea why there is older ChatGPT versions on LMArena since like nobody ...

Could be for comparison purposes

worthy sleet Aug 17, 2025, 1:50 PM

#

hi, is nudity in art against the rules in the video chats? I mean from famous painters. no reproductive organs visible but definitely women's upper torso

keen beacon Aug 17, 2025, 1:51 PM

#

I wonder how much they learn from the user feedback actually, given that most users are probably not that good at providing feedback

echo aurora Aug 17, 2025, 1:52 PM

#

worthy sleet hi, is nudity in art against the rules in the video chats? I mean from famous pa...

Even with things like paintings the models are going to hit the filter

worthy sleet Aug 17, 2025, 1:52 PM

#

that's fine, I just don't want to be banned here

echo aurora Aug 17, 2025, 1:53 PM

#

keen beacon I wonder how much they learn from the user feedback actually, given that most us...

given that most users are probably not that good at providing feedback
I don't know how true that is

keen beacon Aug 17, 2025, 1:54 PM

#

echo aurora > given that most users are probably not that good at providing feedback I don't...

I, for instance, never give thumbs up or down when I'm talking to any model

#

Sometimes in the battle mode

hollow imp Aug 17, 2025, 1:55 PM

#

Pineapple have you ever talked with sam altman

#

https://tenor.com/view/huh-what-is-this-cute-eevee-pokemon-gif-8631288122714155877

Tenor

echo aurora Aug 17, 2025, 1:56 PM

#

hollow imp Pineapple have you ever talked with sam altman

He doesn't respond to my daily texts 😭

hollow imp Aug 17, 2025, 1:56 PM

#

Scam altman

#

😡

willow grail Aug 17, 2025, 1:56 PM

#

tw!nk altman

#

so typical

#

pineapple must be too young for sammy

worthy sleet Aug 17, 2025, 1:57 PM

#

@echo aurora it seems that you're a moderator, can you please tell me if it's fine if I try to create a video from for example a William-Adolphe Bouguereau bather painting, like "Baigneuse (1870)"?

echo aurora Aug 17, 2025, 1:59 PM

#

worthy sleet <@283397944160550928> it seems that you're a moderator, can you please tell me ...

Sry to say I don't really want to get in the habit of giving an okay or not okay to what's against our rules for specific actions. Overall, try to act in good faith by respecting the server rules and you'll be okay.

languid crescent Aug 17, 2025, 2:00 PM

#

hmm

#

new button in lmarena?

#

would be nice if webdev arena could have direct chat models 😭

echo aurora Aug 17, 2025, 2:01 PM

#

languid crescent new button in lmarena?

Which one? search?

willow grail Aug 17, 2025, 2:01 PM

#

languid crescent would be nice if webdev arena could have direct chat models 😭

why are u not using normal lmarena?

languid crescent Aug 17, 2025, 2:01 PM

#

echo aurora Which one? search?

the "build apps and websites" i thought it was the ability to export files 😭

echo aurora Aug 17, 2025, 2:01 PM

#

languid crescent the "build apps and websites" i thought it was the ability to export files 😭

ah

languid crescent Aug 17, 2025, 2:02 PM

#

willow grail why are u not using normal lmarena?

i am using it lol i just found out that there's an new button lol

languid crescent Aug 17, 2025, 2:02 PM

#

echo aurora ah

i was hyped for it 😭

willow grail Aug 17, 2025, 2:02 PM

#

languid crescent i am using it lol i just found out that there's an new button lol

ew?

languid crescent Aug 17, 2025, 2:02 PM

#

willow grail ew?

why 😭 i can't explore things now 😭 me just curious

willow grail Aug 17, 2025, 2:02 PM

#

what

#

https://tenor.com/sn2fvCyo332.gif

Tenor

languid crescent Aug 17, 2025, 2:03 PM

#

huh 😭

willow grail Aug 17, 2025, 2:03 PM

#

u said ew

languid crescent Aug 17, 2025, 2:04 PM

#

did i?

#

😭

#

i dont recall

#

uh

#

ohh

#

i meant to say "new"

#

lol

#

😭

#

https://tenor.com/view/muhehe-cat-muhehe-muhehehe-cat-muhehehe-smirk-cat-muhehe-gif-17142133339517585968

Tenor

#

anyways i smell soem big announcements from lmarena muhehehe

willow grail Aug 17, 2025, 2:08 PM

#

languid crescent anyways i smell soem big announcements from lmarena muhehehe

oh yeah?

inland cave Aug 17, 2025, 2:12 PM

#

👋

weak sluice Aug 17, 2025, 2:13 PM

#

Ugh

#

Everytime...

warm fulcrum Aug 17, 2025, 2:15 PM

#

weak sluice Ugh

@echo aurora There seems to be an issue with using image generation on battle mode

echo aurora Aug 17, 2025, 2:15 PM

#

Hmm is there an outage?

#

all models?

#

oh yeah I'm seeing the same

#

okay thank you will report

warm fulcrum Aug 17, 2025, 2:16 PM

#

This seems to only happen in battle mode

echo aurora Aug 17, 2025, 2:16 PM

#

Yeah

weak sluice Aug 17, 2025, 2:16 PM

#

It seems to be working suddenly now

warm fulcrum Aug 17, 2025, 2:16 PM

#

Not when using a specific model

warm fulcrum Aug 17, 2025, 2:16 PM

#

echo aurora okay thank you will report

thx pineapple

mortal grotto Aug 17, 2025, 2:17 PM

#

So, it seems everyone is having the issue I am having.

echo aurora Aug 17, 2025, 2:17 PM

#

Wait it looks back to me

echo aurora Aug 17, 2025, 2:17 PM

#

mortal grotto So, it seems everyone is having the issue I am having.

Try refreshing the page

#

oh now battle in text is messed up too

#

Image battle is working for me

warm fulcrum Aug 17, 2025, 2:18 PM

#

echo aurora Image battle is working for me

try using it a couple of times

#

it seems to work randomly

hoary prism Aug 17, 2025, 2:18 PM

#

Good day, dear community. Please tell me, I'm uploading a photo now and when I send a request I get an error. Are there any updates happening on the servers now or am I the only one with this?

echo aurora Aug 17, 2025, 2:18 PM

#

text direct/side-by-side is working fine

warm fulcrum Aug 17, 2025, 2:18 PM

#

yeah

weak sluice Aug 17, 2025, 2:19 PM

#

and the errors back again..

echo aurora Aug 17, 2025, 2:19 PM

#

Okay this feels rly inconsistent, let's give it a couple of mins

weak sluice Aug 17, 2025, 2:19 PM

#

right

teal mantle Aug 17, 2025, 2:20 PM

#

Suddenly it doesn't support image response

mortal grotto Aug 17, 2025, 2:20 PM

#

I force-refreshed my browser cache, and still says the error.

teal mantle Aug 17, 2025, 2:20 PM

#

I think this bug is almost experienced for everyone

echo aurora Aug 17, 2025, 2:20 PM

#

mortal grotto I force-refreshed my browser cache, and still says the error.

what version? text battle?

warm fulcrum Aug 17, 2025, 2:20 PM

#

pineapple is it possible for LMarena to have a retry button

mortal grotto Aug 17, 2025, 2:20 PM

#

echo aurora what version? text battle?

image battle

weak sluice Aug 17, 2025, 2:20 PM

#

and it's fine again

#

image battle won't make its mind up

echo aurora Aug 17, 2025, 2:21 PM

#

weak sluice and it's fine again

A couple days ago there was a quick outage, I wonder if this is similar

weak sluice Aug 17, 2025, 2:21 PM

#

could be

mortal grotto Aug 17, 2025, 2:21 PM

#

I am not sure how to advise the version you want to know. but well, "Something went wrong while generating the response. Please try again." is displayed when I am trying image battle.

echo aurora Aug 17, 2025, 2:22 PM

#

mortal grotto I am not sure how to advise the version you want to know. but well, "Something w...

Gotcha

teal mantle Aug 17, 2025, 2:22 PM

#

BTW why claude opus do not support image?

echo aurora Aug 17, 2025, 2:22 PM

#

teal mantle BTW why claude opus do not support image?

It's a bug 🙁

teal mantle Aug 17, 2025, 2:23 PM

#

I have a task that seemingly only Grok 4 in warm start (means there is previous messages, even though orthogonal since it is for architectural / urban planning discussion) can guess within 3 prompts

can achieve

hollow imp Aug 17, 2025, 2:24 PM

#

I just hit grok 4 limit after 3 messages and even lost the chat history

#

On grok.com

teal mantle Aug 17, 2025, 2:24 PM

#

Whereas other models Gemini 2.5 Pro and GPT-5 also succeeded, but with 6-7 prompts and 9 prompts respectively

#

It is about guessing artstyle btw

#

The task is deceptively simple: guess from subject matter

hollow imp Aug 17, 2025, 2:25 PM

#

😭

#

😭😭😭

#

😭

teal mantle Aug 17, 2025, 2:25 PM

#

I am trying Grok 4 cold start to be equal

mortal grotto Aug 17, 2025, 2:27 PM

#

error seems to be gone. Thanks

echo aurora Aug 17, 2025, 2:30 PM

#

mortal grotto error seems to be gone. Thanks

same same

#

how about you @weak sluice ?

weak sluice Aug 17, 2025, 2:32 PM

#

It's good now!

crisp ocean Aug 17, 2025, 2:34 PM

#

Hi everyone! I’m new here, excited to discover LMArena and to experiment with video and image generations. Looking forward to learning from you all!

echo aurora Aug 17, 2025, 2:36 PM

#

crisp ocean Hi everyone! I’m new here, excited to discover LMArena and to experiment with vi...

Hello and welcome birbwave

teal mantle Aug 17, 2025, 2:51 PM

#

why is the security verification delayed?

#

been wondering this

languid crescent Aug 17, 2025, 2:53 PM

#

uh oh is lmarena down?

#

im also stuck in verification

#

nvm i reset my brwoser it works now

jovial sapphire Aug 17, 2025, 2:56 PM

#

Yeah

#

I can't get to edit an image

#

annoying

stray aspen Aug 17, 2025, 3:00 PM

#

any gemini 3 news

#

bro lmarena wont open

#

whats going on

jovial sapphire Aug 17, 2025, 3:01 PM

#

It works

stray aspen Aug 17, 2025, 3:01 PM

#

jovial sapphire Aug 17, 2025, 3:01 PM

#

I'm on it right now

#

It's your internet connection

#

Also, don't spam refresh cuz it mmay block you

stray aspen Aug 17, 2025, 3:01 PM

#

alright its working now

quick turret Aug 17, 2025, 3:02 PM

#

Hello

jovial sapphire Aug 17, 2025, 3:16 PM

#

"Generate a composite image of the model showcasing side, back, and all perspective views, all combined into a single image."

#

Nano-banana result:

#

Excellent result

teal mantle Aug 17, 2025, 3:23 PM

#

o3 still edges out GPT-5

#

still unsupported?

stray aspen Aug 17, 2025, 3:31 PM

#

jovial sapphire Nano-banana result:

great instruction following

teal mantle Aug 17, 2025, 3:32 PM

#

all claude family model do not support image input

mental briar Aug 17, 2025, 3:33 PM

#

teal mantle all claude family model do not support image input

Maybe Claude is too expensive

#

So they cut image input function

ocean vortex Aug 17, 2025, 3:34 PM

#

teal mantle o3 still edges out GPT-5

gpt5-high > o3-high

teal mantle Aug 17, 2025, 3:34 PM

#

ocean vortex gpt5-high > o3-high

maybe I need more iterations to test

rare python Aug 17, 2025, 3:34 PM

#

echo aurora It's a bug 🙁

is this bug hard to fix 😢

teal mantle Aug 17, 2025, 3:36 PM

#

but so far for frontier models, gpt-5 (these prompts were not yet optimized since they are deliberately reactive and iterated) took 2 prompts more to guess

ocean vortex Aug 17, 2025, 3:37 PM

#

teal mantle but so far for frontier models, gpt-5 (these prompts were not yet optimized sinc...

what was the test, exactly?

teal mantle Aug 17, 2025, 3:37 PM

#

ocean vortex what was the test, exactly?

artstyle guessing

ocean vortex Aug 17, 2025, 3:38 PM

#

teal mantle artstyle guessing

can you paste the image/prompt? Curious to see it 👀

teal mantle Aug 17, 2025, 3:38 PM

#

I see if there is any sharable

#

lmarena doesn't have sharing

#

mind sharing grok? (since it seems to be one of the fastest contender)

swift vapor Aug 17, 2025, 3:43 PM

#

hey i want to know what generation model was used - and i got 3 -3 votes..... wh y can't i see ? @echo aurora

gaunt meteor Aug 17, 2025, 3:43 PM

#

What is the rate limit on image gen

wild quartz Aug 17, 2025, 3:43 PM

#

Hi

gaunt meteor Aug 17, 2025, 3:44 PM

#

The image didn't appeare and I clicked regen a bit too many times

#

And it says I need to wait for an hour

swift vapor Aug 17, 2025, 3:44 PM

#

swift vapor hey i want to know what generation model was used - and i got 3 -3 votes..... wh...

do anyone know ?? I really wanna know

wild quartz Aug 17, 2025, 3:44 PM

#

@gaunt meteor ya it take a long

#

Well perplexity is doing great , i just wanna to know how their "discover" feature works the news (th news that summerized by ai and it gets uploaded by it self ) and it covers all categories , it's kinda amazing

fast halo Aug 17, 2025, 3:51 PM

#

Why can I not generate, I try click agree and it gives me an error

ocean vortex Aug 17, 2025, 3:51 PM

#

yeah it is very good. But I think they did a mistake calling everything "gpt5". Especially calling it gpt5 for free users....

wild quartz Aug 17, 2025, 3:51 PM

#

ocean vortex Aug 17, 2025, 3:51 PM

#

Free users the best that they can get is gpt5-thinking-mini (low to medium reasoning effort)

#

that is unimpressive at all

teal mantle Aug 17, 2025, 3:52 PM

#

I think I am getting ChatGPT team for more testing

ocean vortex Aug 17, 2025, 3:53 PM

#

and then gpt5-minimal is just bad...

teal mantle Aug 17, 2025, 3:54 PM

#

ocean vortex and then gpt5-minimal is just bad...

what is that even

#

oh btw is GPT 5 Pro worth it?

#

the model

ocean vortex Aug 17, 2025, 3:54 PM

#

IMO they probably should have called gpt5-chat and gpt5-minimal - gpt4.2, and then medium to high reasoning effort as gpt5. With being explicit when it's gpt5-mini (free users)

teal mantle Aug 17, 2025, 3:54 PM

#

seems underdiscussed

#

the minimal one is quite lazy imo

gaunt meteor Aug 17, 2025, 3:56 PM

#

The one plus users get in app is ass

ocean vortex Aug 17, 2025, 3:56 PM

#

The way it is now they are kinda dilluting gpt5 name into models that do not perform...

gaunt meteor Aug 17, 2025, 3:57 PM

#

Still cannot solve 5.9 = x + 5.11

#

🤣

ocean vortex Aug 17, 2025, 4:00 PM

#

teal mantle what is that even

gpt5 with reasoning effort set to minimal. Minimal is below "low" and basically means no reasoning

echo aurora Aug 17, 2025, 4:02 PM

#

swift vapor hey i want to know what generation model was used - and i got 3 -3 votes..... wh...

Hey sorry to say this is a bug we're working on fixing, I anticipate it'll be working by end of day Monday

chrome flume Aug 17, 2025, 4:06 PM

#

yo glad to be here

wild quartz Aug 17, 2025, 4:09 PM

#

if a model like GPT-5 had real memory instead of just context windows, would that feel a bit like AGI? Or nah

swift vapor Aug 17, 2025, 4:09 PM

#

echo aurora Hey sorry to say this is a bug we're working on fixing, I anticipate it'll be wo...

okay... hopefully it does

solid brook Aug 17, 2025, 4:18 PM

#

jovial sapphire Nano-banana result:

Damn this model is really good

jovial sapphire Aug 17, 2025, 4:23 PM

#

solid brook Damn this model is really good

It's very bad if your prompt is not detailed enough

crystal jasper Aug 17, 2025, 4:24 PM

#

I need video prompt generate any recommend models?

jovial sapphire Aug 17, 2025, 4:24 PM

#

Like other models are better

ocean vortex Aug 17, 2025, 4:25 PM

#

ocean vortex gpt5 with reasoning effort set to minimal. Minimal is below "low" and basically ...

But the performance is very poor when you do that... gpt4o level. Which is where this naming ambiguity comes from

crystal jasper Aug 17, 2025, 4:25 PM

#

crystal jasper I need video prompt generate any recommend models?

..

jovial sapphire Aug 17, 2025, 4:27 PM

#

Sometimes Nano banana gives me back the exact same image I sent lol

worthy sleet Aug 17, 2025, 4:34 PM

#

I'm getting this error "❌ Generation failed. Failed to create evaluation session." all the time. What's that about?

#

in the video chats

jovial sapphire Aug 17, 2025, 4:34 PM

#

Okay, I have something interesting about Nano Banana

#

I sent this meme

#

and I told it : "edit this reddit post like it was from an alternate reality, the guy actually got a tattoo, edit the title so, and add a tattoo to his arm"

#

And it's the best result I got so far!

#

It understood how to edit the text etc

clear spear Aug 17, 2025, 4:38 PM

#

magnificent

lofty elm Aug 17, 2025, 4:39 PM

#

Hi i just joined to ask some question from curiosity, can LMArena generate images while using ST? tia!

fossil fable Aug 17, 2025, 4:39 PM

#

ocean vortex But the performance is very poor when you do that... gpt4o level. Which is where...

i wish the immediate gpt-5 was genuinely improved upon

clear spear Aug 17, 2025, 4:39 PM

#

lofty elm Hi i just joined to ask some question from curiosity, can LMArena generate image...

what in the world is st

jovial sapphire Aug 17, 2025, 4:39 PM

#

amazing gif

clear spear Aug 17, 2025, 4:40 PM

#

STI

fossil fable Aug 17, 2025, 4:40 PM

#

amazing gif

jovial sapphire Aug 17, 2025, 4:40 PM

#

fossil fable amazing gif

npc aaah reaction

#

exotic nebula Aug 17, 2025, 4:40 PM

#

magnificent gif

jovial sapphire Aug 17, 2025, 4:40 PM

#

blud thinks he's me

clear spear Aug 17, 2025, 4:41 PM

#

jovial sapphire

IT'S IN FRENCH

lofty elm Aug 17, 2025, 4:41 PM

#

clear spear what in the world is st

Silly Tavern

jovial sapphire Aug 17, 2025, 4:41 PM

#

yea cuz i'm french duh

clear spear Aug 17, 2025, 4:41 PM

#

jovial sapphire yea cuz i'm french duh

HOW U DID DAT

jovial sapphire Aug 17, 2025, 4:41 PM

#

do what?

clear spear Aug 17, 2025, 4:41 PM

#

jovial sapphire do what?

https://tenor.com/view/fight-mad-drag-angry-pull-hair-gif-17222106

Tenor

fossil fable Aug 17, 2025, 4:41 PM

#

jovial sapphire

npc aaah reaction

meager harbor Aug 17, 2025, 4:41 PM

#

keen beacon You can encrypt whatever you want to say with a ROTn cipher and tell the model t...

Okay, but my point is: why would LMArena censor prompts in the first place? Censorship itself should be part of a model’s benchmarking. If you exclude censored prompts, you end up biasing the rankings because you’re no longer measuring how the models actually respond in real-world conditions

jovial sapphire Aug 17, 2025, 4:41 PM

#

clear spear https://tenor.com/view/fight-mad-drag-angry-pull-hair-gif-17222106

meow

fossil fable Aug 17, 2025, 4:42 PM

#

meager harbor Okay, but my point is: why would LMArena censor prompts in the first place? Cens...

wait this is a very good point

jovial sapphire Aug 17, 2025, 4:42 PM

#

quentin is onto something

#

https://tenor.com/view/elon-musk-this-is-spacex-tesla-this-is-elon-musk-gif-24512168

Tenor

#

quentin be like

#

"ok but my point was, why woud lma rena censor things ? censoring must be part of a model benchmark. By excluding censored prompt, you bias the ranking"

#

genius aaah insight

keen beacon Aug 17, 2025, 4:42 PM

#

meager harbor Okay, but my point is: why would LMArena censor prompts in the first place? Cens...

They probs don't want much corn in the dataset

jovial sapphire Aug 17, 2025, 4:42 PM

#

bro tryna be smart 😂

lofty elm Aug 17, 2025, 4:43 PM

#

lofty elm Hi i just joined to ask some question from curiosity, can LMArena generate image...

already checked this but still doesn't work, maybe LMArena doesn't support Image generation API to ST directly

jovial sapphire Aug 17, 2025, 4:43 PM

#

guys

#

some people have nano banana

#

on discord

#

how???

clear spear Aug 17, 2025, 4:43 PM

#

WHAT

exotic nebula Aug 17, 2025, 4:43 PM

#

jovial sapphire some people have nano banana

source?

clear spear Aug 17, 2025, 4:43 PM

#

I almost forgot I'm downloading valorant!

jovial sapphire Aug 17, 2025, 4:44 PM

#

exotic nebula source?

i've seen screenshots

#

like a guy receives a message

#

and the account name is "nano banana

#

"

clear spear Aug 17, 2025, 4:44 PM

#

WHO DID DAT

exotic nebula Aug 17, 2025, 4:45 PM

#

Mods ig

jovial sapphire Aug 17, 2025, 4:45 PM

#

hey it's a dog

#

are you like

clear spear Aug 17, 2025, 4:45 PM

#

PINEAPLLE

jovial sapphire Aug 17, 2025, 4:45 PM

#

14?

#

just asking ^^'

lofty elm Aug 17, 2025, 4:45 PM

#

it seems it doesn't really work in ST, i kept trying it

jovial sapphire Aug 17, 2025, 4:45 PM

#

or you're on coke

clear spear Aug 17, 2025, 4:45 PM

#

COME OUT HO

echo aurora Aug 17, 2025, 4:45 PM

#

Yeak keep conversation related to AI and safe for work please.

jovial sapphire Aug 17, 2025, 4:45 PM

#

echo aurora Yeak keep conversation related to AI and safe for work please.

pineapple

#

do you know how people get to use

meager harbor Aug 17, 2025, 4:45 PM

#

jovial sapphire genius aaah insight

https://tenor.com/view/gigachad-chad-gif-20773266

Tenor

clear spear Aug 17, 2025, 4:45 PM

#

echo aurora Yeak keep conversation related to AI and safe for work please.

you're so nice

jovial sapphire Aug 17, 2025, 4:45 PM

#

models on discord?

lofty elm Aug 17, 2025, 4:46 PM

#

echo aurora Yeak keep conversation related to AI and safe for work please.

I have a question, do image gen works as back end for ST? thanks

echo aurora Aug 17, 2025, 4:46 PM

#

jovial sapphire models on discord?

Yeah more info is in #1397655624103493813

echo aurora Aug 17, 2025, 4:46 PM

#

lofty elm I have a question, do image gen works as back end for ST? thanks

ST?

exotic nebula Aug 17, 2025, 4:46 PM

#

lofty elm Hi i just joined to ask some question from curiosity, can LMArena generate image...

ST is interesting material bro 😭 what type of images you wanna create?

lofty elm Aug 17, 2025, 4:46 PM

#

exotic nebula ST is interesting material bro 😭 what type of images you wanna create?

how?

exotic nebula Aug 17, 2025, 4:46 PM

#

echo aurora ST?

Silly Tavern

worthy sleet Aug 17, 2025, 4:46 PM

#

with /image-to-video it seems that if you upload a .avif file it will fail with an uninformative error

clear spear Aug 17, 2025, 4:47 PM

#

I love eating pineapples btw

#

https://tenor.com/view/stan-twitter-girl-biting-finger-at-football-game-gif-14721946310722903004

Tenor

lofty elm Aug 17, 2025, 4:47 PM

#

exotic nebula ST is interesting material bro 😭 what type of images you wanna create?

i don't understand the accusations

echo aurora Aug 17, 2025, 4:47 PM

#

lofty elm I have a question, do image gen works as back end for ST? thanks

I'm not sure

lofty elm Aug 17, 2025, 4:47 PM

#

just curious for the possbilities

#

okay then thanks

exotic nebula Aug 17, 2025, 4:47 PM

#

lofty elm i don't understand the accusations

Uh my bad, my friends use it for very questionable stuff. Forgive me for my rudeness.

tame horizon Aug 17, 2025, 4:48 PM

#

Good afternoon, what site is this? Can someone tell me, and what are the other sites?

jovial sapphire Aug 17, 2025, 4:48 PM

#

echo aurora Yeah more info is in <#1397655624103493813>

yeah but we can't send images to edit

tame horizon Aug 17, 2025, 4:50 PM

#

Does anyone know how to put a website inside a modal? For example, websites block x frame, does anyone know an alternative way? I've already tried Opus but it doesn't even solve the problem.

jovial sapphire Aug 17, 2025, 4:50 PM

#

Try gpt5

echo aurora Aug 17, 2025, 4:50 PM

#

jovial sapphire yeah but we can't send images to edit

going to try to repro 👍

jovial sapphire Aug 17, 2025, 4:51 PM

#

thanks!

tame horizon Aug 17, 2025, 4:51 PM

#

@echo aurora can you help me, anyone?

jovial sapphire Aug 17, 2025, 4:51 PM

#

it's not related to ai or llm arena

#

Nano banana

#

Edit the image so that half of the face is Vladimir Putin’s face, and the other half is his skull. The skull side should not have long hair like in the original image; instead, it should have the same hairstyle as Vladimir Putin. The two halves should blend seamlessly, with no visible line of separation, just like in the original image’s style.

tame horizon Aug 17, 2025, 4:52 PM

#

tame horizon Good afternoon, what site is this? Can someone tell me, and what are the other s...

If you can answer this I would appreciate it

fossil fable Aug 17, 2025, 4:52 PM

#

where tf can a girl go to nerd out about models

blazing bison Aug 17, 2025, 4:52 PM

#

jovial sapphire Nano banana

it's actually not good

echo aurora Aug 17, 2025, 4:53 PM

#

tame horizon Does anyone know how to put a website inside a modal? For example, websites bloc...

I'm not sure I'm following the question

jovial sapphire Aug 17, 2025, 4:53 PM

#

blazing bison it's actually not good

Yeah the separation

#

is not good

#

But it's difficult

fossil fable Aug 17, 2025, 4:53 PM

#

fossil fable where tf can a girl go to nerd out about models

huggingface server is skull

jovial sapphire Aug 17, 2025, 4:53 PM

#

The task isn't easy

echo aurora Aug 17, 2025, 4:53 PM

#

fossil fable where tf can a girl go to nerd out about models

here here

tame horizon Aug 17, 2025, 4:53 PM

#

tame horizon Does anyone know how to put a website inside a modal? For example, websites bloc...

Please someone help me

clear spear Aug 17, 2025, 4:53 PM

#

echo aurora I'm not sure I'm following the question

can I have nitro papi

lofty elm Aug 17, 2025, 4:53 PM

#

echo aurora I'm not sure

I really am curious since i did it before but with separated API of image generation and LLM that combine to work on Chat and Visuals, but it's kinda difficult and messy to set it up so that's why i asked if i could use it as a chat as well as image generations for immersive interaction

echo aurora Aug 17, 2025, 4:53 PM

#

clear spear can I have nitro papi

win one of our generation contests!

clear spear Aug 17, 2025, 4:53 PM

#

echo aurora win one of our generation contests!

GURL PLEASE

#

IT'S 10 BUCKS

#

JUST GIMME

lofty elm Aug 17, 2025, 4:54 PM

#

its been 2 years anyway so it doesn't matter anymore, just way out of curiousity

exotic nebula Aug 17, 2025, 4:54 PM

#

clear spear GURL PLEASE

ayo bro calm down before you get yeeted outta the server

fossil fable Aug 17, 2025, 4:54 PM

#

echo aurora here here

uhhhhhhhhhhhhhhhhhhhhhhhhhhhwell i would

if that's what ppl were focused on

well they are but not quite

i always end up nerding out to ai instead of actual human beings so

clear spear Aug 17, 2025, 4:54 PM

#

exotic nebula ayo bro calm down before you get yeeted outta the server

pineapple loves meh

lofty elm Aug 17, 2025, 4:54 PM

#

clear spear pineapple loves meh

I love pineapple pizza

jovial sapphire Aug 17, 2025, 4:54 PM

#

ew

clear spear Aug 17, 2025, 4:54 PM

#

lofty elm I love pineapple pizza

me too

jovial sapphire Aug 17, 2025, 4:54 PM

#

ew

echo aurora Aug 17, 2025, 4:55 PM

#

jovial sapphire yeah but we can't send images to edit

#video-arena-3 message seems to be working for me, are you running into an error?

lofty elm Aug 17, 2025, 4:55 PM

#

jovial sapphire ew

you hate it cuz society tells you to

#

so meanie

clear spear Aug 17, 2025, 4:55 PM

#

jovial sapphire ew

only if "pineapple" is on my pizza 🫦

jovial sapphire Aug 17, 2025, 4:55 PM

#

echo aurora https://discord.com/channels/1340554757349179412/1400148597768720384/14066822510...

you cant sned images

#

when you're in /image prompt:

tame horizon Aug 17, 2025, 4:55 PM

#

tame horizon Does anyone know how to put a website inside a modal? For example, websites bloc...

Do you have any solution to put a website inside a modal and make it load? Or do websites not block iframes? Is there any other way? Do you know anyone who can help me? @echo aurora or does anyone know?

jovial sapphire Aug 17, 2025, 4:55 PM

#

lofty elm you hate it cuz society tells you to

nah lmao

#

it's just disgusting

#

and has nothing to do on a pizza

lofty elm Aug 17, 2025, 4:55 PM

#

See

#

you just proved my point

jovial sapphire Aug 17, 2025, 4:56 PM

#

ok light yagami

lofty elm Aug 17, 2025, 4:56 PM

#

Pineapple Pizzas are the best

jovial sapphire Aug 17, 2025, 4:56 PM

#

ew

#

average american:

lofty elm Aug 17, 2025, 4:56 PM

#

cry about it

neon idol Aug 17, 2025, 4:56 PM

#

lofty elm Pineapple Pizzas are the best

Stfu

jovial sapphire Aug 17, 2025, 4:56 PM

#

based

exotic nebula Aug 17, 2025, 4:56 PM

#

neon idol Stfu

Ayo wsp brother

#

How you doing

neon idol Aug 17, 2025, 4:57 PM

#

exotic nebula Ayo wsp brother

Yo

lofty elm Aug 17, 2025, 4:57 PM

#

What's so based about hating pinapple pizza

#

lol

jovial sapphire Aug 17, 2025, 4:57 PM

#

cuz it's just a disgrace to italia's culture

neon idol Aug 17, 2025, 4:57 PM

#

exotic nebula How you doing

Everithing good and you?

tame horizon Aug 17, 2025, 4:57 PM

#

@lofty elm Can you help me someone?

neon idol Aug 17, 2025, 4:57 PM

#

jovial sapphire cuz it's just a disgrace to italia's culture

I am italian lol

lofty elm Aug 17, 2025, 4:57 PM

#

jovial sapphire cuz it's just a disgrace to italia's culture

Well too bad im asian

jovial sapphire Aug 17, 2025, 4:57 PM

#

neon idol I am italian lol

yeah i understand you then lol

lofty elm Aug 17, 2025, 4:57 PM

#

cry about it

jovial sapphire Aug 17, 2025, 4:57 PM

#

lofty elm Well too bad im asian

who cares, it's just disgusting

#

it's like eating noodles with ketchup

#

or ramen with burgers

lofty elm Aug 17, 2025, 4:58 PM

#

jovial sapphire who cares, it's just disgusting

i cut pasta to half when i cook spaghetti

jovial sapphire Aug 17, 2025, 4:58 PM

#

neon idol Aug 17, 2025, 4:58 PM

#

jovial sapphire or ramen with burgers

Or put soya sauce on rice

jovial sapphire Aug 17, 2025, 4:58 PM

#

nano banana is so weird

lofty elm Aug 17, 2025, 4:58 PM

#

tame horizon <@749840497094164532> Can you help me someone?

what

exotic nebula Aug 17, 2025, 4:58 PM

#

neon idol Everithing good and you?

Fine, how's your testing coming along? Have made any benchmarks?

jovial sapphire Aug 17, 2025, 4:58 PM

#

lofty elm i cut pasta to half when i cook spaghetti

EW

jovial sapphire Aug 17, 2025, 4:58 PM

#

jovial sapphire

sometimes it's good

#

sometimes its the worse model ever

echo aurora Aug 17, 2025, 4:58 PM

#

Hey so as much as I love convos about food let's not discuss that here. This should be a channel dedicated to discuss AI related topics in good faith.

jovial sapphire Aug 17, 2025, 4:58 PM

#

i don't know

#

i'm dead

#

his profile is so goofy

lofty elm Aug 17, 2025, 4:59 PM

#

jovial sapphire

how do i access this?

jovial sapphire Aug 17, 2025, 4:59 PM

#

but he remains so serious

exotic nebula Aug 17, 2025, 4:59 PM

#

jovial sapphire

am I tripping or is that skeleton having hair? Mind sharing the prompt?

gaunt meteor Aug 17, 2025, 4:59 PM

#

What is the actual rate limit for image gen

exotic nebula Aug 17, 2025, 4:59 PM

#

lofty elm how do i access this?

Battle mode in image generation

lofty elm Aug 17, 2025, 4:59 PM

#

exotic nebula Battle mode in image generation

ohhhh

jovial sapphire Aug 17, 2025, 4:59 PM

#

exotic nebula am I tripping or is that skeleton having hair? Mind sharing the prompt?

the original image is

echo aurora Aug 17, 2025, 4:59 PM

#

gaunt meteor What is the actual rate limit for image gen

We don't have this info listed somewhere, we're looking into possibly sharing

jovial sapphire Aug 17, 2025, 4:59 PM

#

so i asked it to make the same with putin

#

but it failed 🙁

lofty elm Aug 17, 2025, 4:59 PM

#

isn't it tiring to write a prompt? i've been there before

tame horizon Aug 17, 2025, 4:59 PM

#

tame horizon Good afternoon, what site is this? Can someone tell me, and what are the other s...

Does anyone know a solution for the site to load inside a modal? And does anyone know the name of this site?

jovial sapphire Aug 17, 2025, 4:59 PM

#

blud is lazy

fossil fable Aug 17, 2025, 5:00 PM

#

-# this place currently sounds like a minecraft pvp smp hosted by a 13 year old 😭

Screenshot_2025-08-17-17-59-57-55_572064f74bd5f9fa804b05334aa4f912.jpg

echo aurora Aug 17, 2025, 5:00 PM

#

tame horizon Does anyone know a solution for the site to load inside a modal? And does anyone...

No sorry I don't know the answer to your question.

lofty elm Aug 17, 2025, 5:00 PM

#

jovial sapphire blud is lazy

it's 2023 image gen prompts so i guess you wouldn't understand how it's hard to generate good outputs before

fossil fable Aug 17, 2025, 5:00 PM

#

i wish there was an app where i could just benchmark models on anything on demand

jovial sapphire Aug 17, 2025, 5:01 PM

#

lofty elm it's 2023 image gen prompts so i guess you wouldn't understand how it's hard to ...

meow meow meow

fossil fable Aug 17, 2025, 5:01 PM

#

fossil fable i wish there was an app where i could just benchmark models on anything on deman...

it feels very possible

#

right

tame horizon Aug 17, 2025, 5:01 PM

#

lofty elm what

I want to know, I'm asking for real help to load the site within a modal when clicking, for example, on a button with the link, and to know the name of the site that was in the image. I'm looking for solutions

exotic nebula Aug 17, 2025, 5:01 PM

#

fossil fable i wish there was an app where i could just benchmark models on anything on deman...

Interesting idea, know of any sites?

jovial sapphire Aug 17, 2025, 5:01 PM

#

Guys...

#

I think Nano Banana is over hyped...

#

It's super unstable

lofty elm Aug 17, 2025, 5:02 PM

#

tame horizon I want to know, I'm asking for real help to load the site within a modal when cl...

i have no idea what you're talking about, is it from LMArena?

fossil fable Aug 17, 2025, 5:02 PM

#

exotic nebula Interesting idea, know of any sites?

... kek

i wouldn't be saying this if i did

exotic nebula Aug 17, 2025, 5:03 PM

#

fossil fable ...<:kek:1037963761618784406> i wouldn't be saying this if i did

soka

tame horizon Aug 17, 2025, 5:06 PM

#

lofty elm i have no idea what you're talking about, is it from LMArena?

I'll explain it better when you click on the icon with the link, I want it to open in the modal, you know, inside a box inside an iframe, but they block the sites and iFrames, you know. And then I'm trying to find out if you know anything, and if you know anything about some technique, some way to make the site load inside an iframe, you know. I'm also asking you and others who have information, for this information. I'm also asking for information about whether anyone knows about this site, this site that the guy showed that GPT was at the top.

lofty elm Aug 17, 2025, 5:07 PM

#

tame horizon I'll explain it better when you click on the icon with the link, I want it to op...

i have no idea im sorry, my only focus is on ST from only chatting,

lofty elm Aug 17, 2025, 5:07 PM

#

echo aurora No sorry I don't know the answer to your question.

Do negative prompts work? curious

fossil fable Aug 17, 2025, 5:08 PM

#

exotic nebula soka

...

#

you can't vibe code if you have no coding knowledge like me! :3

#

and no money

lofty elm Aug 17, 2025, 5:09 PM

#

omg just tested one of my old prompts

#

what model is this being used

#

oohhh flux 1

exotic nebula Aug 17, 2025, 5:11 PM

#

tame horizon I'll explain it better when you click on the icon with the link, I want it to op...

As far as I know, if the site blocks iframes, you cannot do much about it. There are no techniques to bypass it. Maybe instead of trying to load the entire site, how about you pull certain data with APIs?

As for your second doubt, I see it is some sort of benchmark, I tried using Google lens and AI vision but can't find it, just search the 'benchmark-type' + benchmark and surf tbe net.

lofty elm Aug 17, 2025, 5:11 PM

#

can i change it though?, the image gen model

#

oh wow

elfin nova Aug 17, 2025, 5:28 PM

#

hello

#

guys

potent glacier Aug 17, 2025, 5:37 PM

#

Is the site being extra slow?

#

Over 200 seconds in Battle and the images haven’t genned yet

#

Nevermind

#

It’s only on mobile when it acts up

#

On mobile you have to keep making new chats and can’t continue to make new battles in the same chat

keen beacon Aug 17, 2025, 5:49 PM

#

@echo aurora

keen beacon Aug 17, 2025, 6:00 PM

#

lofty elm Well too bad im asian

Are you good at math

#

Also do you speak Japanese

ocean vortex Aug 17, 2025, 6:01 PM

#

fossil fable i wish the immediate gpt-5 was genuinely improved upon

Yeah but I think it's very hard to do as the model size is not huge and performance of gpt4.1 was already very respectable. And then when you do hybrid reasoning, it gets even harder, quite significantly harder lol

#

It has to improve on non-reasoning WHILE also do better than o3 when it is reasoning. And all of that without increasing model size

#

mission impossible. Which is why gpt5-minimal performs worse than gpt4.1 🤷‍♂️

#

That's a more acceptable compromise than it not being able to beat o3 when reasoning

woeful gust Aug 17, 2025, 6:05 PM

#

hello

fleet lintel Aug 17, 2025, 6:11 PM

#

#

Is difference smaller now? just 6 points between 2.5 pro and Gpt-5-high.?

#

I thought it was 21?

solid brook Aug 17, 2025, 6:26 PM

#

fleet lintel Is difference smaller now? just 6 points between 2.5 pro and Gpt-5-high.?

Idk but gpt 5 high is way better than gemini 2.5

#

Google is benchmaxing

stray aspen Aug 17, 2025, 6:37 PM

#

gpt-5 high is so good

stray aspen Aug 17, 2025, 6:37 PM

#

solid brook Google is benchmaxing

nice

#

your learning from craig

fleet lintel Aug 17, 2025, 6:40 PM

#

stray aspen your learning from craig

lol 🙂

fleet lintel Aug 17, 2025, 6:40 PM

#

solid brook Google is benchmaxing

More like GPT-5 is good but not as good as we hoped for

golden ocean Aug 17, 2025, 6:41 PM

#

stray aspen your learning from craig

It is true probably

#

i provided complex mc(block game) problem using a mod library that is like unknown by anyone and gemini couldnt do sh t and didnt even come close to not writing errors in the code

#

to gemini 2.5 pro

#

and claude 4.1 opus absolutely destroyed gemini in that

#

provide unique problem to gemini: it dies; 0 iq

fleet lintel Aug 17, 2025, 6:43 PM

#

i thought it is well known that coding is one part where gemini is behind a bit.

#

in other tasks, I think it is on par

#

in any case, I am just surprised that GPT-5-high is only 6 points above 2.5-pro.

white hatch Aug 17, 2025, 6:46 PM

#

golden ocean i provided complex mc(block game) problem using a mod library that is like unkno...

^

tame horizon Aug 17, 2025, 6:53 PM

#

exotic nebula As far as I know, if the site blocks iframes, you cannot do much about it. There...

Thank you very much, friend. I found the solution. I'm going to put a browser in the mini browser, something like this, and when someone clicks on this frame, it will be linked to a project of mine in the repository, which is a browser. You know, a web browser, a browser page, you know. I'll try to use as much programming as possible to make it work, but that's basically it. We'll have to create a browser so that when someone clicks, it won't be directly in that iframe script thing; it will be connected to this browser. Project complete.

tame horizon Aug 17, 2025, 6:54 PM

#

stray aspen gpt-5 high is so good

what's up friend?

#

How are you working here, by the way, isn't it?

torn mantle Aug 17, 2025, 6:55 PM

#

lofty elm oh wow

ive seen many models with the same quality

#

its not challenging to gen something like that

tame horizon Aug 17, 2025, 6:55 PM

#

torn mantle ive seen many models with the same quality

Hi Asura, how are you?

sullen depot Aug 17, 2025, 6:57 PM

#

Hi. I'm new can anyone tech me how to create videos

torn mantle Aug 17, 2025, 7:03 PM

#

tame horizon Hi Asura, how are you?

i just blocked this guy

#

https://x.com/teortaxesTex

Teortaxes▶️ (DeepSeek 推特🐋铁粉 2023 – ∞) (@teortaxe...

We're in a race. It's not USA vs China but humans and AGIs vs ape power centralization.
@deepseek_ai stan #1, 2023–Deep Time

«C’est la guerre.» ®1

#

should i unblock or nah?

#

im feeling good tho

leaden meteor Aug 17, 2025, 7:08 PM

#

Did anyone notice any change in gpt-5-high behaviour in last two days? Twitter says there was an update couple days ago to make it 'warmer' but I dont see much difference...

stray aspen Aug 17, 2025, 7:10 PM

#

tame horizon How are you working here, by the way, isn't it?

wassup

weak sluice Aug 17, 2025, 7:11 PM

#

I love the nano banana model i saw in battle mode

leaden meteor Aug 17, 2025, 7:12 PM

#

This is image model, isnt it?

tame horizon Aug 17, 2025, 7:14 PM

#

torn mantle i just blocked this guy

She did the same thing as Dogedesigner did with teortaxes, I think she was influenced, sorry for being kind, everything is the opposite for her nowadays

#

So sorry, at least I don't want to shame myself here.

wicked root Aug 17, 2025, 7:18 PM

#

Discussion time: Do you guys think Claude is severely underrated? Its win rates against GPT5 is impressive.

echo aurora Aug 17, 2025, 7:24 PM

#

weak sluice I love the nano banana model i saw in battle mode

We've made a new channel to discuss the topic #nano-banana 🍌

ocean vortex Aug 17, 2025, 7:25 PM

#

torn mantle should i unblock or nah?

You did good. That acc probably in-need of reporting as well... 🧐

weak sluice Aug 17, 2025, 7:32 PM

#

echo aurora We've made a new channel to discuss the topic <#1406720250778615868> 🍌

oooo goodie

hasty compass Aug 17, 2025, 7:45 PM

#

what is lm arena

#

guys

marsh stratus Aug 17, 2025, 7:46 PM

#

hasty compass what is lm arena

It’s an arena for LMs

torn bison Aug 17, 2025, 7:47 PM

#

fleet lintel Is difference smaller now? just 6 points between 2.5 pro and Gpt-5-high.?

I suspect that summit and gpt-5high are not exactly the same

fleet lintel Aug 17, 2025, 7:47 PM

#

torn bison I suspect that summit and gpt-5high are not exactly the same

was summit better ?

hasty compass Aug 17, 2025, 7:47 PM

#

marsh stratus It’s an arena for LMs

what is Lm

torn bison Aug 17, 2025, 7:47 PM

#

fleet lintel was summit better ?

imo yes

#

Although OpenAI says they are the same model.
They at least changed some of the system prompts

marsh stratus Aug 17, 2025, 7:48 PM

#

hasty compass what is Lm

It’s an m that is l

leaden meteor Aug 17, 2025, 7:49 PM

#

hasty compass what is Lm

Languge models. I am curious why are you on this discord if you wont know what LMArena is?

torn bison Aug 17, 2025, 7:51 PM

#

It might also be because people tended to vote for summit and zenith directly in the anon testing phase when trying to find them

next dagger Aug 17, 2025, 7:59 PM

#

YESSIR

torn bison Aug 17, 2025, 7:59 PM

#

imagine all the messy environments and compilation issues they have to prepare for 😂

torn mantle Aug 17, 2025, 8:07 PM

#

ocean vortex You did good. That acc probably in-need of reporting as well... 🧐

yes please

torn mantle Aug 17, 2025, 8:08 PM

#

tame horizon She did the same thing as Dogedesigner did with teortaxes, I think she was influ...

its ok

tawdry rose Aug 17, 2025, 8:12 PM

#

hello 👋

uneven falcon Aug 17, 2025, 8:24 PM

#

Hello😊

celest briar Aug 17, 2025, 8:24 PM

#

Hello y'all. I like to make AI tests.

undone pier Aug 17, 2025, 8:29 PM

#

hello good

#

Does anyone know what happened to flux contex max?

glacial mulch Aug 17, 2025, 8:33 PM

#

lmao why did nano banana get a whole channel

nimble trail Aug 17, 2025, 8:33 PM

#

glacial mulch lmao why did nano banana get a whole channel

It's just that good.

glacial mulch Aug 17, 2025, 8:34 PM

#

i need it GA so bad

nimble trail Aug 17, 2025, 8:34 PM

#

We will see next month ig

undone pier Aug 17, 2025, 8:37 PM

#

undone pier Does anyone know what happened to flux contex max?

?

ocean vortex Aug 17, 2025, 8:38 PM

#

gpt5-chat....

#

I suppose people *really *do not like the style of gpt5-chat lol

undone pier Aug 17, 2025, 8:40 PM

#

undone pier Does anyone know what happened to flux contex max?

Does anyone know anything?

ocean vortex Aug 17, 2025, 8:40 PM

#

yeah it has all the reasons to do well. o4-mini except improved

#

their previous naming was selling that model better ngl

#

He was probably confused

#

and like comparing gpt5-mini with reasoning against gpt5-chat

ornate agate Aug 17, 2025, 8:42 PM

#

why is it called nano banana btw

ocean vortex Aug 17, 2025, 8:43 PM

#

it may be better in some isolated scenarios but for the most part with the same settings it is worse like for like. But you can't be comparing one with reasoning the other one without, or different reasoning efforts / verbosity

#

If you do gpt5-mini-high vs gpt5-medium, then I'm sure they are comparable... Like how it was with o3-medium vs o4-mini-high. But these are not the same settings

undone pier Aug 17, 2025, 8:45 PM

#

undone pier Does anyone know anything?

¿?

obsidian cargo Aug 17, 2025, 8:47 PM

#

no I was born without a brain. just a lowly brain stem barely able to keep my basic biological functions running

tired dust Aug 17, 2025, 8:55 PM

#

Hey on the new LMArena can we use repo on it like in the legacy ?

vivid cargo Aug 17, 2025, 8:58 PM

#

how can I make videos here?

jade egret Aug 17, 2025, 9:10 PM

#

jade egret Aug 17, 2025, 9:21 PM

#

ocean vortex gpt5-chat....

😭

warm hare Aug 17, 2025, 9:24 PM

#

Hi

jade egret Aug 17, 2025, 9:25 PM

#

warm hare Hi

hi

jade egret Aug 17, 2025, 9:26 PM

#

jade egret

why gemini 2.5 pro deep think tho

stray aspen Aug 17, 2025, 9:27 PM

#

because its great

jade egret Aug 17, 2025, 9:34 PM

#

stray aspen because its great

yea

weak swan Aug 17, 2025, 9:37 PM

#

There used to be a graph that compared the cost per prompt vs the score on the leaderboard. Is that no longer maintained for the new site?

toxic whale Aug 17, 2025, 9:45 PM

#

i think i broke Gemini 2.5 pro and Grok-4

#

#

it started saying nonsense

scenic salmon Aug 17, 2025, 9:47 PM

#

jade egret

Really depends on the context, most knowledgeable without having to look things up, best at problem solving, fewest hallucinations, etc….

toxic whale Aug 17, 2025, 9:48 PM

#

does anyone here have access to Gemini 2.5 Pro deepthink? i would love to run a simple benchmark but dont have access to it

#

ive ran the benchmark for alot of models and only 2.5 pro deepthink is left basically

#

its only 10 questions

wicked root Aug 17, 2025, 10:15 PM

#

I would but how do I know you’re not a hacker named 4chan

stray aspen Aug 17, 2025, 10:23 PM

#

thats crazy

#

rishab is here

empty stump Aug 17, 2025, 10:47 PM

#

Didn't expect that

#

Joined 10 days ago

hollow imp Aug 17, 2025, 11:00 PM

#

poll_question_text

WHAT WILL COME FIRST?

victor_answer_votes

13

total_votes

16

victor_answer_id

1

victor_answer_text

GEMINI 3

willow grail Aug 17, 2025, 11:01 PM

#

who here doing game dev VIBES only via GPT5 MEDIUM TO HIGH?

#

glacial mulch Aug 17, 2025, 11:11 PM

#

bruh

stray aspen Aug 17, 2025, 11:54 PM

#

lmao

wintry citrus Aug 18, 2025, 12:14 AM

#

willow grail

everyone is dumb including me. Wow

wintry citrus Aug 18, 2025, 12:15 AM

#

willow grail

wait a min it is the smartest tho

#

it's kinda true

#

🙏

#

next dagger Aug 18, 2025, 12:43 AM

#

should the word "clank*r" be censored in this server guys? it's offensive to the ai bots that helps us all

native flame Aug 18, 2025, 12:44 AM

#

Hii, so the legacy site is definitely dead :'vv??

potent glacier Aug 18, 2025, 1:30 AM

#

next dagger should the word "clank*r" be censored in this server guys? it's offensive to the...

potent glacier Aug 18, 2025, 1:30 AM

#

native flame Hii, so the legacy site is definitely dead :'vv??

Yeah I ranted about it in here last night…I use it a lot more than the current site.

#

You can change the temperature and amount of tokens used on the legacy site while on the new one you can’t

#

Apparently they’re ‘looking into it’

#

@echo aurora filled me in as much as they were able to

jade egret Aug 18, 2025, 1:50 AM

#

willow grail

why is all the option here bad?

#

is gpt-5-high = gpt-5-pro?

golden ocean Aug 18, 2025, 1:52 AM

#

potent glacier

lmfao

broken coyote Aug 18, 2025, 1:52 AM

#

Nope

golden ocean Aug 18, 2025, 1:52 AM

#

tf

jade egret Aug 18, 2025, 1:53 AM

#

broken coyote Nope

o

#

my guess is pro is better right

broken coyote Aug 18, 2025, 1:55 AM

#

willow grail

i don't know why Google released such an expansive model, similar to the o1-pro, just to top the benchmarks, no one even uses that model lol

wintry tinsel Aug 18, 2025, 2:16 AM

#

My prediction, nano bannana is the native Text generation model for Gemini 3 flash, which will release alongside Gemini 3 pro, sometime in September

#

Gemini 3 pro, will be much better at coding and math, while being a little more sterile at creative writing than 2.5 pro, it will get a 65%-69% on simple bench , and be a net improvement to 2.5 pro, by a solid margin, but not a breakthrough

#

I will also bite off all my nails waiting for it since it’s been way too long since a true sota model released all the way back with Claude opus

sullen quest Aug 18, 2025, 3:03 AM

#

toxic whale i think i broke Gemini 2.5 pro and Grok-4

can I have the prompt?

floral gyro Aug 18, 2025, 3:23 AM

#

hello

wicked root Aug 18, 2025, 3:25 AM

#

You said gpt5 would beat gemini

scenic salmon Aug 18, 2025, 3:25 AM

#

wintry citrus

16 or so hockey rinks long

#

Pretty far

wicked root Aug 18, 2025, 3:30 AM

#

not without style control

ripe mountain Aug 18, 2025, 3:32 AM

#

scenic salmon Aug 18, 2025, 3:33 AM

#

Best at what catsip

ripe mountain Aug 18, 2025, 3:35 AM

#

scenic salmon Best at what <:catsip:770049584738205696>

overall

#

value for money maybe

scenic salmon Aug 18, 2025, 3:36 AM

#

Google wins easy then, cloud storage comes with the plan

ripe mountain Aug 18, 2025, 3:37 AM

#

scenic salmon Google wins easy then, cloud storage comes with the plan

Which one as an API?

scenic salmon Aug 18, 2025, 3:37 AM

#

Now you complicate things Hmm

ripe mountain Aug 18, 2025, 3:38 AM

#

scenic salmon Now you complicate things <:Hmm:965716034432696380>

a little

scenic salmon Aug 18, 2025, 3:42 AM

#

Today aside, I think Google will be the victor long term, OpenAI needs to turn a profit sooner or later, Google doesn’t, they can just funnel all of the extra data collected to help their advertising platform, since that’s what they are at the end of the day, an ad platform

ripe mountain Aug 18, 2025, 3:46 AM

#

scenic salmon Today aside, I think Google will be the victor long term, OpenAI needs to turn a...

Since Microsoft is backing OpenAI, I don't think money is an issue. Although Google has more money than OpenAI, it released Gemini 2.5 Pro as its flagship AI model. However, Grok 4, a better model, was released a month later.

scenic salmon Aug 18, 2025, 3:48 AM

#

Microsoft hasn’t given them any more since the initial investment and there has been friction between them lately with OpenAI trying to execute the AGI clause in their deal to stop having to share research with msft

mellow salmon Aug 18, 2025, 3:48 AM

#

hey guys , does anyone find o3 search more professional and organised than gpt search?

ripe mountain Aug 18, 2025, 3:48 AM

#

scenic salmon Microsoft hasn’t given them any more since the initial investment and there has ...

Despite everything, GPT 5 came to Copilot from day one.

mellow salmon Aug 18, 2025, 3:49 AM

#

I found o3 search more organised and professional

#

than gpt 5 search

keen beacon Aug 18, 2025, 3:59 AM

#

ripe mountain

R2 when it finally comes out.

scenic salmon Aug 18, 2025, 3:59 AM

#

Google throws money at the wall all the time to see what sticks, that’s literally all they do… the more ads they can serve to people, the happier their shareholders are

#

They profit from delivering more ads in more places…

ripe mountain Aug 18, 2025, 4:00 AM

#

keen beacon R2 when it finally comes out.

I heard that Deepseek R2 was postponed because it wasn't good enough.

keen beacon Aug 18, 2025, 4:01 AM

#

ripe mountain I heard that Deepseek R2 was postponed because it wasn't good enough.

Shhhh, let em cook

ripe mountain Aug 18, 2025, 4:01 AM

#

be chill

scenic salmon Aug 18, 2025, 4:02 AM

#

People are feeding their deepest desires and secrets into Gemini, that will enable them to target ads better than ever before

#

Advertisers pay more for better targeted ads

#

catthinking

#

Yes

#

Google is an ad platform… it’s what they do

verbal nimbus Aug 18, 2025, 4:23 AM

#

Any promising stealth models?

lofty elm Aug 18, 2025, 4:23 AM

#

torn mantle its not challenging to gen something like that

can do the same as i did?

unkempt bison Aug 18, 2025, 4:25 AM

#

hello

verbal nimbus Aug 18, 2025, 4:30 AM

#

They de-associate chats with users before sending I think

#

The data might still be useful for advertising algorithms

#

Especially if it has anonymized demographic data attached (age group and gender buckets)

keen beacon Aug 18, 2025, 4:54 AM

#

verbal nimbus They de-associate chats with users before sending I think

Sure they do

brave orbit Aug 18, 2025, 5:14 AM

#

icy forge Aug 18, 2025, 5:33 AM

#

In my opinion, the difference in output style between GPT-5 and Gemini 2.5 Pro stems partly from an overemphasis on reinforcement learning and an expanded knowledge base in fields like mathematics and physics, while lacking the corresponding human alignment seen in Gemini. I believe this is a positive trend. As a model's intelligence increasingly surpasses the collective thinking of any single human group, and with the prevalent use of parallel computing to enhance AI capabilities in the most advanced versions, GPT-5 sets a precedent for the future: one where humanity, in turn, aligns itself with the AI's trajectory.

rocky mauve Aug 18, 2025, 5:39 AM

#

icy forge In my opinion, the difference in output style between GPT-5 and Gemini 2.5 Pro s...

did u use ai to write this 😂

keen beacon Aug 18, 2025, 5:45 AM

#

https://ehudreiter.com/2025/01/03/we-need-better-llm-benchmarks/

Ehud Reiter's Blog

ehudreiter

We need better LLM benchmarks

Current benchmark (suites) for evaluating LLMs are disappointing. I describe the properties that I think good benchmarks and benchmark suites should have, but often do not, such as being correct, c…

icy forge Aug 18, 2025, 5:53 AM

#

rocky mauve did u use ai to write this 😂

How could you think like that?

jolly pilot Aug 18, 2025, 7:59 AM

#

How to create a vedio in this

ocean vortex Aug 18, 2025, 8:08 AM

#

keen beacon https://ehudreiter.com/2025/01/03/we-need-better-llm-benchmarks/

Contamination is not a huge problem if the dataset is diverse and substantial enough. Though the biggest needle movers are creating new benchmarks and releasing related research papers. Rather than writing a blogpost about how much everything supposedly sucks. 👀

sly estuary Aug 18, 2025, 8:19 AM

#

all model is error ?

warped ocean Aug 18, 2025, 8:31 AM

#

any people who works at lmarena here, maybe increase the threshold of nano-banana appearing by +50%? 👉👈

inland quest Aug 18, 2025, 8:48 AM

#

Why not 150%?

lilac pagoda Aug 18, 2025, 8:50 AM

#

sly estuary all model is error ?

Refresh the page

sly estuary Aug 18, 2025, 8:57 AM

#

yes i had, but not work

weak sluice Aug 18, 2025, 9:29 AM

#

dang error again...

#

now it's fine

#

phew

ocean vortex Aug 18, 2025, 9:35 AM

#

no work

digital pier Aug 18, 2025, 9:35 AM

#

anyone facing error?

ocean vortex Aug 18, 2025, 9:35 AM

#

work not found

rough monolith Aug 18, 2025, 9:39 AM

#

digital pier anyone facing error?

+1

ocean vortex Aug 18, 2025, 9:40 AM

#

Just checked. It seems unstable and some errors yeah, though some requests get through:

sly estuary Aug 18, 2025, 9:45 AM

#

it's fixed ?

#

i still got er

willow grail Aug 18, 2025, 10:09 AM

#

wintry tinsel Gemini 3 pro, will be much better at coding and math, while being a little more ...

okso why didnt we test gemin i3 in aistudio

willow grail Aug 18, 2025, 10:13 AM

#

wintry tinsel Gemini 3 pro, will be much better at coding and math, while being a little more ...

and thanks for reminding which benchmark can be fully ignored. simplebench

#

aka a bench which has nothing to do with real life tasks

#

just no. just no.... sigh sigh. so many sighs. just sigh.

#

is https://nano-banana.org/ scam?

Nano Banana

Nano Banana — Google’s Next-Gen Image Editing AI

Nano Banana — preview AI for inpainting, outpainting and background replacement. Mask-free, text-guided edits for product and creative workflows.

keen beacon Aug 18, 2025, 10:16 AM

#

willow grail is https://nano-banana.org/ scam?

Looks like it was AI generated.

willow grail Aug 18, 2025, 10:16 AM

#

i cant find banana model there

keen beacon Aug 18, 2025, 10:16 AM

#

And looks ugly

lofty elm Aug 18, 2025, 10:23 AM

#

i really got Very impressive responses

tired dust Aug 18, 2025, 10:27 AM

#

Hey on the new LMArena can we use repo from github on it like in the legacy ?

earnest rover Aug 18, 2025, 10:32 AM

#

where is flux kontext max in direct chat or not even in battle mode

ionic idol Aug 18, 2025, 10:35 AM

#

ocean vortex Aug 18, 2025, 10:35 AM

#

willow grail aka a bench which has nothing to do with real life tasks

It's actually a decent benchmark for reasoning, spatial reasoning to a good extent. Reasoning can trickle down to and affect execution of most IRL tasks

ionic idol Aug 18, 2025, 10:35 AM

#

pls fix

earnest rover Aug 18, 2025, 10:35 AM

#

they are doing something IG, something new or updating something

ocean vortex Aug 18, 2025, 10:37 AM

#

So you don't get issues like instructing it to change some visual element on the website and it's modifying the wrong property or not accounting to it's positioning in relation to everything else properly etc

pastel badge Aug 18, 2025, 11:03 AM

#

Is the reason why I can't create pictures or videos now simply because of the large number of users?

tough relic Aug 18, 2025, 11:08 AM

#

guys where can i get nano banana api

whole wraith Aug 18, 2025, 11:09 AM

#

tough relic guys where can i get nano banana api

No API yet it's a beta

#

I have this bug since 15 minutes now can't generate anything 😂

Screenshot_2025-08-18-13-09-21-56_4641ebc0df1485bf6b47ebd018b5ee76.jpg

jolly meadow Aug 18, 2025, 11:10 AM

#

Same.

pastel badge Aug 18, 2025, 11:14 AM

#

Same.

tough relic Aug 18, 2025, 11:15 AM

#

whole wraith I have this bug since 15 minutes now can't generate anything 😂

how long do u think it gonna take them to produce that comerccial api sir

whole wraith Aug 18, 2025, 11:15 AM

#

tough relic how long do u think it gonna take them to produce that comerccial api sir

Idk, but for now just use it as much as you can, we might need to pay for it later

pure comet Aug 18, 2025, 11:16 AM

#

zдраvстvуйте

tough relic Aug 18, 2025, 11:19 AM

#

whole wraith Idk, but for now just use it as much as you can, we might need to pay for it lat...

I wanted to build saas and connect it my app so I don't want to use by myself but for other people

keen beacon Aug 18, 2025, 11:19 AM

#

Hot take:

If GPT-5-chat is complete garbage without reasoning - 40 ELO below GPT-5-high and worse than 4.5 on LMArena leaderboard, suppose the new DeepSeek-R2 base model will be at least as good as latest Kimi - just around 5-Chat's performance