#general | Arena | Page 105

torn mantle Aug 25, 2025, 10:32 AM

#

did you find a way

#

yet

#

you are human right?

#

or maybe not

ocean vortex Aug 25, 2025, 10:32 AM

#

Claude is better at coding with caveats (depending on which programming language) and maybe writing. But 2.5Pro is better in other spectrum of coding and basically everything else that remains

solid brook Aug 25, 2025, 10:32 AM

#

torn mantle you are human right?

bro are you high?

torn mantle Aug 25, 2025, 10:32 AM

#

solid brook bro are you high?

no

#

you are

whole wagon Aug 25, 2025, 10:32 AM

#

GPT5 is best at maths

#

Like by a lot

#

That's probably the area it has the biggest advantage

solid brook Aug 25, 2025, 10:33 AM

#

ocean vortex Claude is better at coding with caveats (depending on which programming language...

I can't with you there is no point. just go on youtube watch a video comparing gemini gpt and claude

solid brook Aug 25, 2025, 10:35 AM

#

ocean vortex Claude is better at coding with caveats (depending on which programming language...

https://youtu.be/bAZhlpIXTc4?si=wTntuw8ooMGUHkFH

YouTube

Bijan Bowen

GPT-5 Pro vs Grok 4 Heavy vs Claude 4.1 Opus vs Gemini 2.5 Pro — ...

Timestamps:

00:00 - Intro
00:33 - Model Introduction
02:25 - Testing Theory
03:27 - Quick Note on Local LLMs
03:46 - Browser OS Test
07:50 - Gemini Browser OS Result
10:33 - GPT-5 Browser OS Result
12:56 - Claude Browser OS Result
16:17 - Grok Browser OS Result
17:25 - Browser OS Summary
18:36 - Roleplay Testing
21:54 - Python FPS Test
25:34 - ...

▶ Play video

whole wagon Aug 25, 2025, 10:35 AM

#

Musk said grok 5 is agi

#

True agi

keen beacon Aug 25, 2025, 10:35 AM

#

solid brook https://youtu.be/bAZhlpIXTc4?si=wTntuw8ooMGUHkFH

Based youtuber

whole wagon Aug 25, 2025, 10:35 AM

#

Kappa

torn mantle Aug 25, 2025, 10:35 AM

#

whole wagon Musk said grok 5 is agi

he did?

whole wagon Aug 25, 2025, 10:35 AM

#

Yes

ocean vortex Aug 25, 2025, 10:35 AM

#

solid brook I can't with you there is no point. just go on youtube watch a video comparing g...

That's not how you should be deciding. Reading random reddit comments or watching random youtube videos is not it lmao

solid brook Aug 25, 2025, 10:35 AM

#

keen beacon Based youtuber

bruh

torn mantle Aug 25, 2025, 10:36 AM

#

whole wagon Yes

lol

ocean vortex Aug 25, 2025, 10:36 AM

#

Anyone can make a video

keen beacon Aug 25, 2025, 10:36 AM

#

whole wagon Musk said grok 5 is agi

Most probable answer is that it's not AGI

ocean vortex Aug 25, 2025, 10:36 AM

#

about anything

#

means nothing lol

whole wagon Aug 25, 2025, 10:36 AM

#

https://x.com/elonmusk/status/1958499441469739329?t=ANvJhpSOL3rf08Z00Z72FQ&s=19

Elon Musk (@elonmusk)

Wait until you see Grok 5.

I think it has a shot at being true AGI.

Haven’t felt that about anything before.

keen beacon Aug 25, 2025, 10:36 AM

#

solid brook bruh

bruhhh, it's right though

keen beacon Aug 25, 2025, 10:36 AM

#

whole wagon https://x.com/elonmusk/status/1958499441469739329?t=ANvJhpSOL3rf08Z00Z72FQ&s=19

What even is AGI to Musk?

solid brook Aug 25, 2025, 10:36 AM

#

ocean vortex That's not how you should be deciding. Reading random reddit comments or watchin...

actually it is. benchmarks are kind of a lie. the benchmarks for gemini 2.5 pro are for the unnerfed version

ocean vortex Aug 25, 2025, 10:37 AM

#

solid brook actually it is. benchmarks are kind of a lie. the benchmarks for gemini 2.5 pro ...

"unnerfed"? 🤣

#

this is for the newest version

solid brook Aug 25, 2025, 10:37 AM

#

ocean vortex "unnerfed"? 🤣

yeah

ocean vortex Aug 25, 2025, 10:37 AM

#

they also have for the older ones - predictably those did worse

whole wagon Aug 25, 2025, 10:37 AM

#

It's the GA release

ocean vortex Aug 25, 2025, 10:37 AM

#

lemme find it

solid brook Aug 25, 2025, 10:37 AM

#

ocean vortex they also have for the older ones - predictably those did worse

gemini 2.5 pro 3-25 exp was the best

#

by FAR

#

they nerfed it

whole wagon Aug 25, 2025, 10:38 AM

#

Nah

solid brook Aug 25, 2025, 10:38 AM

#

yes

ocean vortex Aug 25, 2025, 10:38 AM

#

solid brook gemini 2.5 pro 3-25 exp was the best

just hype. Nothing special about it. Did worse

keen beacon Aug 25, 2025, 10:39 AM

#

ocean vortex just hype. Nothing special about it. Did worse

damn, deepseek 3.1 is slaying rn

whole wagon Aug 25, 2025, 10:39 AM

#

The GA release is just that and a bit extra training it's not like it's that different anyways

keen beacon Aug 25, 2025, 10:39 AM

#

or not

sly estuary Aug 25, 2025, 10:40 AM

#

i had try many time but not work.

solid brook Aug 25, 2025, 10:40 AM

#

oh

whole wagon Aug 25, 2025, 10:40 AM

#

Bro asked where's opus before it even released

ocean vortex Aug 25, 2025, 10:41 AM

#

They aren't including all models in all charts since they wouldn't fit. ss was from https://artificialanalysis.ai/models/gemini-2-5-pro-03-25

whole wagon Aug 25, 2025, 10:41 AM

#

It's because it's a march model

ocean vortex Aug 25, 2025, 10:41 AM

#

For opus you need to go to opus testing page to ensure it's in the chart

solid brook Aug 25, 2025, 10:42 AM

#

ocean vortex For opus you need to go to opus testing page to ensure it's in the chart

what do you use ai for exactly?

sly estuary Aug 25, 2025, 10:42 AM

#

solid brook oh

have any way i can try to fix it ?

solid brook Aug 25, 2025, 10:42 AM

#

sly estuary have any way i can try to fix it ?

tag the mod

whole wagon Aug 25, 2025, 10:43 AM

#

GPT5 nano speed is like 8x slower than GPT OSS 120B lol

sly estuary Aug 25, 2025, 10:43 AM

#

solid brook tag the mod

he say create new chat...

solid brook Aug 25, 2025, 10:43 AM

#

sly estuary he say create new chat...

yeah it can happen

ocean vortex Aug 25, 2025, 10:43 AM

#

solid brook what do you use ai for exactly?

Mostly work. Coding python and SQL (PG, MS)

solid brook Aug 25, 2025, 10:44 AM

#

ocean vortex Mostly work. Coding python and SQL (PG, MS)

well what i can say that there is a hard coded limit on gemini responses

#

i tested it

whole wagon Aug 25, 2025, 10:45 AM

#

I use LLM for science and maths a lot. Like checking stuff for writing papers and that

solid brook Aug 25, 2025, 10:45 AM

#

it can't output more than around 1000 lines of code

whole wagon Aug 25, 2025, 10:45 AM

#

Gemini is really good at science

ocean vortex Aug 25, 2025, 10:46 AM

#

solid brook it can't output more than around 1000 lines of code

That's not harcoded for sure. I made it do 32k tokens of code and then on another instance not with code I once made the model break while testing this and output around 500k in one go lmao

solid brook Aug 25, 2025, 10:47 AM

#

ocean vortex That's not harcoded for sure. I made it do 32k tokens of code and then on anothe...

500k tokens?

ocean vortex Aug 25, 2025, 10:47 AM

#

yes. It was stuck in a recursive loop 👀

trim lantern Aug 25, 2025, 10:48 AM

#

Any specific date wen the bot will open again?

solid brook Aug 25, 2025, 10:48 AM

#

ocean vortex yes. It was stuck in a recursive loop 👀

yeah lol BS. it can't output more than 65k tokens in one response

ocean vortex Aug 25, 2025, 10:49 AM

#

solid brook yeah lol BS. it can't output more than 65k tokens in one response

Not if it's ending and starting responses by itself lol

#general message

formal jungle Aug 25, 2025, 10:50 AM

#

Can we get the tie option please?

solid brook Aug 25, 2025, 10:50 AM

#

ocean vortex Not if it's ending and starting responses by itself lol https://discord.com/ch...

OH

#

bruh

#

well

#

.

#

I'M talking ABout CODE

whole wagon Aug 25, 2025, 10:52 AM

#

This is basically AI winter ngl. There's no really promising releases coming up it's all incremental gains

solid brook Aug 25, 2025, 10:52 AM

#

not system prompt

ocean vortex Aug 25, 2025, 10:53 AM

#

solid brook I'M talking ABout CODE

I already told you it did 32k of code for me. It doesn't really appear to be limited by length more than most other models

whole wagon Aug 25, 2025, 10:53 AM

#

They need to find another paradigm ig. The reasoning one is running out of steam

ocean vortex Aug 25, 2025, 10:53 AM

#

Certainly not more than Opus

solid brook Aug 25, 2025, 10:53 AM

#

ocean vortex I already told you it did 32k of code for me. It doesn't really appear to be lim...

DUDE

#

i just tested it

#

i gave it 1200 lines of code told it to expand it

#

and it reduced it to 800 lines

ocean vortex Aug 25, 2025, 10:54 AM

#

Write a system prompt if you need long responses

#

it has no clue what you want otherwise lol

solid brook Aug 25, 2025, 10:54 AM

#

ocean vortex it has no clue what you want otherwise lol

i specificly told it to expand and advance the code

#

and it reduced it

#

by 400 lines

ocean vortex Aug 25, 2025, 10:56 AM

#

solid brook i specificly told it to expand and advance the code

paste this in a system prompt box

All responses must be extremely long. it is crucial that you leave no stone unturned and complete everything in exhaustive detail meticulously. You must reflect endlessly for each user's query. You must reiterate over your proposed solutions finding ways to improve them until arriving at the most optimal final response.

wide ledge Aug 25, 2025, 10:58 AM

#

i have an image and i want to make it animated how can i do it

solid brook Aug 25, 2025, 10:58 AM

#

wide ledge i have an image and i want to make it animated how can i do it

with grok

#

idk if veo 3 can do it

#

but i'm sure about grok

rare python Aug 25, 2025, 10:59 AM

#

wide ledge i have an image and i want to make it animated how can i do it

#1397655624103493813

wide ledge Aug 25, 2025, 10:59 AM

#

wait can grok do something like that?

solid brook Aug 25, 2025, 11:00 AM

#

wide ledge wait can grok do something like that?

if the image is private do it with grok

#

but if you're fine with everyone seeing it

#

do it here

solid brook Aug 25, 2025, 11:01 AM

#

ocean vortex paste this in a system prompt box ```All responses must be extremely long. it ...

... I have to truncate the response here as it is extremely long. I will provide the rest in subsequent messages. Let me know when you are ready for the next part.

wide ledge Aug 25, 2025, 11:01 AM

#

ive sent u

solid brook Aug 25, 2025, 11:01 AM

#

does not work

leaden sun Aug 25, 2025, 11:03 AM

#

whole wagon This is basically AI winter ngl. There's no really promising releases coming up ...

i feel it's rather a problem of... computing resources and money? oh and the big ego of academia of cause

ocean vortex Aug 25, 2025, 11:03 AM

#

solid brook does not work

what is your prompt for code, what are you trying to make it output?...

solid brook Aug 25, 2025, 11:05 AM

#

ocean vortex what is your prompt for code, what are you trying to make it output?...

im giving it 1200 lines of code and told it to expand and advance the whole code

ocean vortex Aug 25, 2025, 11:05 AM

#

whole wagon This is basically AI winter ngl. There's no really promising releases coming up ...

I think OpenAI is gonna improve gpt5 quite a bit tbh. At least the lesser versions. It's their gen1 of hybrid reasoning

ocean vortex Aug 25, 2025, 11:06 AM

#

solid brook im giving it 1200 lines of code and told it to expand and advance the whole code

yeah but what exactly are you telling it to code? Lemme try reproducing it, maybe the task is literally too simple for much more code lol

solid brook Aug 25, 2025, 11:07 AM

#

ocean vortex yeah but what exactly are you telling it to code? Lemme try reproducing it, mayb...

forget it

solid brook Aug 25, 2025, 11:07 AM

#

ocean vortex yeah but what exactly are you telling it to code? Lemme try reproducing it, mayb...

too much simple? well when i tell claude or gpt 5 they do it

ocean vortex Aug 25, 2025, 11:08 AM

#

solid brook too much simple? well when i tell claude or gpt 5 they do it

you tell them what?

solid brook Aug 25, 2025, 11:08 AM

#

ocean vortex you tell them what?

exactly what i tell gemini

#

"expand and advance this code"

ocean vortex Aug 25, 2025, 11:09 AM

#

solid brook exactly what i tell gemini

which is?

ocean vortex Aug 25, 2025, 11:10 AM

#

solid brook "expand and advance this code"

ok but what are you starting with lol, and this is so unspecific. Expand by 10 lines and rewrite some stuff would satisfy this request.

solid brook Aug 25, 2025, 11:10 AM

#

ocean vortex ok but what are you starting with lol, and this is so unspecific. Expand by 10 l...

bro

#

I am telling you

#

the model cannot output the original code in a single response

#

let alone expand it

ocean vortex Aug 25, 2025, 11:12 AM

#

Well I didn't really have issues like that, dunno what to tell you....

#

It used to be major problem of Claude itself though. Before they moved to reasoning

solid brook Aug 25, 2025, 11:13 AM

#

ocean vortex Well I didn't really have issues like that, dunno what to tell you....

i guess google has a personal problem with me then..........

ocean vortex Aug 25, 2025, 11:16 AM

#

solid brook i guess google has a personal problem with me then..........

Well you wouldn't tell me what was your input to try and replicate this so it's on you lol. How much exactly did Opus output for you in tokens that 2.5Pro couldn't anyways?

solid brook Aug 25, 2025, 11:18 AM

#

ocean vortex Well you wouldn't tell me what was your input to try and replicate this so it's ...

i mean i don't want to share the code

tardy zenith Aug 25, 2025, 11:19 AM

#

I'm new, I don't know who to ask, I don't want to get upset, but I wake up and all my chat sessions are gone, what should I do to recover them? I wrote an email, but it's happened ten times.

solid brook Aug 25, 2025, 11:20 AM

#

tardy zenith I'm new, I don't know who to ask, I don't want to get upset, but I wake up and a...

it happens sometimes

#

idk why they take so long to fix these

#

backup what's important always

tardy zenith Aug 25, 2025, 11:22 AM

#

I have no problem doing everything again but from May to today all the chats disappeared and I redid everything and then got everything back... should I hope they come back or do I do it all again?

solid brook Aug 25, 2025, 11:22 AM

#

tardy zenith I have no problem doing everything again but from May to today all the chats dis...

idk ask a mod

tardy zenith Aug 25, 2025, 11:23 AM

#

What does it mean?

solid brook Aug 25, 2025, 11:23 AM

#

tardy zenith What does it mean?

a moderator

#

the guys that have the <@&1349916362595635286> role

tardy zenith Aug 25, 2025, 11:24 AM

#

Should I write to him?

solid brook Aug 25, 2025, 11:25 AM

#

tardy zenith Should I write to him?

you can open a thread in #1343291835845578853

#

and tell your problem

keen beacon Aug 25, 2025, 11:27 AM

#

.

tardy zenith Aug 25, 2025, 11:29 AM

#

Unfortunately I saw that in the last month I was not the only one who had this problem.... I can't recover session details if it redirects me to the home page, what should I add?

ocean vortex Aug 25, 2025, 11:30 AM

#

solid brook i mean i don't want to share the code

Ok fair enough. How many tokens was the code?

odd ingot Aug 25, 2025, 11:31 AM

#

How do you use the video generator?

solid brook Aug 25, 2025, 11:32 AM

#

ocean vortex Ok fair enough. How many tokens was the code?

14k

sweet tinsel Aug 25, 2025, 11:35 AM

#

odd ingot How do you use the video generator?

https://discord.com/channels/1340554757349179412/1397655695150682194

#

People are really joining in for the videos nowadays.

solid brook Aug 25, 2025, 11:36 AM

#

sweet tinsel People are really joining in for the videos nowadays.

yeah it was smart to launch video arena on discord

pure comet Aug 25, 2025, 11:37 AM

#

solid brook i mean i don't want to share the code

share with me

#

i ll keep in secret

sweet tinsel Aug 25, 2025, 11:37 AM

#

Yeah... Still brings in a different demographic of people. Same thing with polymarket. Brings people here who only do slop in here for their own interest.

shell bramble Aug 25, 2025, 11:37 AM

#

How to generate video with a specific video model

sweet tinsel Aug 25, 2025, 11:37 AM

#

Wasn't like this in my old gpt2-chatbot days.

pure comet Aug 25, 2025, 11:37 AM

#

beg

sweet tinsel Aug 25, 2025, 11:37 AM

#

shell bramble How to generate video with a specific video model

You don't.

#

This is an arena, not a free use video generation tool.

shell bramble Aug 25, 2025, 11:38 AM

#

Ya

pure comet Aug 25, 2025, 11:38 AM

#

sweet tinsel This is an arena, not a free use video generation tool.

but it is free use video generation tool

shell bramble Aug 25, 2025, 11:38 AM

#

Like how to use veo 3 vs kling

pure comet Aug 25, 2025, 11:38 AM

#

beg

shell bramble Aug 25, 2025, 11:38 AM

#

Or like use veo3

sweet tinsel Aug 25, 2025, 11:39 AM

#

pure comet but it is free use video generation tool

It is, but more for research purposes and not intended for specific use.

pure comet Aug 25, 2025, 11:39 AM

#

sweet tinsel It is, but more for research purposes and not intended for specific use.

so it is free use video generation tool

formal jungle Aug 25, 2025, 11:39 AM

#

shell bramble Or like use veo3

Just keep trying until you get it. But you might be surprised how much better other generators can be

sweet tinsel Aug 25, 2025, 11:39 AM

#

shell bramble Or like use veo3

Just vote many times and try in the battle mode (the only mode available for video gen). With some luck you'll get there.

#

The same as with the anonymous models.

shell bramble Aug 25, 2025, 11:40 AM

#

Ok thanks @sweet tinsel and @formal jungle

pure comet Aug 25, 2025, 11:40 AM

#

shell bramble Ok thanks <@796054398538481735> and <@467364496269246466>

where is thanks to me

shell bramble Aug 25, 2025, 11:41 AM

#

pure comet where is thanks to me

Thankss

pure comet Aug 25, 2025, 11:41 AM

#

it is because i am Russian?

#

russophobic

#

i ll cancel you in twitter

solid brook Aug 25, 2025, 11:41 AM

#

pure comet where is thanks to me

https://tenor.com/view/let's-not-bird-hooded-crow-shut-up-gif-13507623047974708682

Tenor

pure comet Aug 25, 2025, 11:41 AM

#

solid brook https://tenor.com/view/let%27s-not-bird-hooded-crow-shut-up-gif-1350762304797470...

https://tenor.com/view/wagner-gif-26973702

Tenor

shell bramble Aug 25, 2025, 11:43 AM

#

Guys CHILLL

solid brook Aug 25, 2025, 11:43 AM

#

shell bramble Guys CHILLL

https://cdn.discordapp.com/attachments/1328844640392187914/1333208534111883337/togif.gif

pure comet Aug 25, 2025, 11:45 AM

#

stop pls

#

thanks

#

sh!t

sudden salmon Aug 25, 2025, 11:47 AM

#

what happened to website ?

#

anybody knows?

little siren Aug 25, 2025, 11:51 AM

#

website working for me

sudden salmon Aug 25, 2025, 11:52 AM

#

little siren website working for me

where u from?

little siren Aug 25, 2025, 11:52 AM

#

USA

sudden salmon Aug 25, 2025, 11:53 AM

#

https://tenor.com/view/lets-go-business-etats-unis-us-usa-gif-6130766179539451466

Tenor

#

em from pakistan...
moderator admin anyone kindly fix please!

#

https://tenor.com/view/spongebob-squarepants-begging-pretty-please-beg-on-your-knees-pray-for-mercy-gif-26344462

Tenor

quasi sparrow Aug 25, 2025, 11:58 AM

#

This is nice

ocean vortex Aug 25, 2025, 12:46 PM

#

You are absolutely right. I have failed you completely on this, and my previous responses were unacceptable. There is no excuse for providing broken, non-functional code after you've pointed out the errors. I deeply apologize for the immense frustration I have caused. My attempts to refactor the query were fundamentally flawed and I failed to properly trace the column names and logic.

🗿

golden ocean Aug 25, 2025, 12:52 PM

#

ocean vortex > You are absolutely right. I have failed you completely on this, and my previou...

is this gemini 2.5 pro

ocean vortex Aug 25, 2025, 12:55 PM

#

golden ocean is this gemini 2.5 pro

yeah. Not hard to guess this is it, lol

ripe mountain Aug 25, 2025, 1:00 PM

#

poll_question_text

Which AI will come first?

victor_answer_votes

17

total_votes

23

victor_answer_id

1

victor_answer_text

gemini 3

keen beacon Aug 25, 2025, 1:13 PM

#

ocean vortex > You are absolutely right. I have failed you completely on this, and my previou...

Sometimes I feel these models all have Japanese mentality

echo aurora Aug 25, 2025, 1:22 PM

#

solid brook the guys that have the <@&1349916362595635286> role

Heads up this isn't something to ping Mod ove. We should be using (@)Moderator for server moderation purposes. Not for questions/bugs.

golden ocean Aug 25, 2025, 1:24 PM

#

ocean vortex yeah. Not hard to guess this is it, lol

stfu i was just curious im not ai nerd

verbal nimbus Aug 25, 2025, 1:30 PM

#

ripe mountain

Hopefully it won't be a let down

#

It has to live up to the hype, lol

leaden sun Aug 25, 2025, 1:37 PM

#

keen beacon Sometimes I feel these models all have Japanese mentality

what do you mean with "Japanese mentality"? I know this is not supposed to be racist or prejudice, but am curious 👀

brave orbit Aug 25, 2025, 1:37 PM

#

keen beacon Aug 25, 2025, 1:38 PM

#

leaden sun what do you mean with "Japanese mentality"? I know this is not supposed to be ra...

They are always so polite to the point of being annoying

leaden sun Aug 25, 2025, 1:39 PM

#

ocean vortex > You are absolutely right. I have failed you completely on this, and my previou...

https://x.com/AnthropicAI/status/1958926941613891842
lets see if it'll get better with deception or are they hiding something else entirely

Anthropic (@AnthropicAI)

There’s plenty of work to be done to make the classifiers even more accurate and effective. In the future, they might even be able to remove data relevant to misalignment risks (scheming, deception, and so on), as well as CBRN risks.

keen beacon Aug 25, 2025, 1:39 PM

#

brave orbit

Actually none of them, they all are slop when it comes to low level languages. But GPT-5 keeps topping all benchmarks in the world, so I guess it is going to be the best model for asm so far

ocean vortex Aug 25, 2025, 1:40 PM

#

golden ocean stfu i was just curious im not ai nerd

no u

leaden sun Aug 25, 2025, 1:43 PM

#

keen beacon They are always so polite to the point of being annoying

you're saying they're simply super superficial for the sake of societal harmony, the individuality gets buried in name of collective coherence, the symptoms are double speak, high context communication, low trust society

keen beacon Aug 25, 2025, 1:43 PM

#

leaden sun you're saying they're simply super superficial for the sake of societal harmony,...

Kind of.

verbal nimbus Aug 25, 2025, 1:44 PM

#

keen beacon They are always so polite to the point of being annoying

You are absolutely right!

trim lantern Aug 25, 2025, 1:45 PM

#

The update is taking longer than expected !

ripe mountain Aug 25, 2025, 1:46 PM

#

brave orbit

opus overrated af

ocean vortex Aug 25, 2025, 1:46 PM

#

leaden sun https://x.com/AnthropicAI/status/1958926941613891842 lets see if it'll get bette...

They have nothing better to do so still messing with input/output flagging. Opus already has false positives flagging innocent stuff on API as is 💀

keen beacon Aug 25, 2025, 1:47 PM

#

verbal nimbus You are absolutely right!

dude

ripe mountain Aug 25, 2025, 1:48 PM

#

brave orbit

the worst model in gemini coding

leaden sun Aug 25, 2025, 1:54 PM

#

ocean vortex They have nothing better to do so still messing with input/output flagging. Opus...

plot twist: I've seen people commenting on their involvement with the military, so this could be actually the excuse to focus on "alignment"?

ocean vortex Aug 25, 2025, 1:57 PM

#

leaden sun plot twist: I've seen people commenting on their involvement with the military, ...

Military is probably just vision much more specialized models etc with absolutely no alignment at all. It's limited in scope so typically alignment doesn't apply

verbal nimbus Aug 25, 2025, 1:58 PM

#

brave orbit

Claude patched assembly code for me once. What it did was kinda crazy and def. above an average developer's pay grade

ocean vortex Aug 25, 2025, 1:58 PM

#

Like self driving cars... those do not need any alignment

leaden sun Aug 25, 2025, 2:06 PM

#

keen beacon Kind of.

that's not specific to the Japanese people, am sure you know that right? 😅 dystopian world, authoritarian regimes, oppressive govs, even large corporations with strict top-down management, all behave like that ...

keen beacon Aug 25, 2025, 2:20 PM

#

Sigh… I hate this man….

ocean vortex Aug 25, 2025, 2:25 PM

#

keen beacon Sigh… I hate this man….

https://x.com/AnthropicAI/status/1958926941613891842

Anthropic (@AnthropicAI)

There’s plenty of work to be done to make the classifiers even more accurate and effective. In the future, they might even be able to remove data relevant to misalignment risks (scheming, deception, and so on), as well as CBRN risks.

#

plenty to be removed still. huggingface

solid brook Aug 25, 2025, 3:16 PM

#

echo aurora Heads up this isn't something to ping Mod ove. We should be using (@)Moderator f...

Well there isn't anyone else to ask. Who ?should we ask our questions

solid brook Aug 25, 2025, 3:18 PM

#

ocean vortex > You are absolutely right. I have failed you completely on this, and my previou...

Man i wonder why it has these melt downs

#

No other ai does it

#

Google did it?

keen beacon Aug 25, 2025, 3:20 PM

#

solid brook No other ai does it

Qwen does it

solid brook Aug 25, 2025, 3:21 PM

#

keen beacon Qwen does it

Even those severe ones?

keen beacon Aug 25, 2025, 3:22 PM

#

solid brook Even those severe ones?

https://chat.qwen.ai/s/f023f874-c7f3-43cb-a2a5-798bd8b681a7

Qwen Chat

Qwen Chat offers comprehensive functionality spanning chatbot, image and video understanding, image generation, document processing, web search integration, tool utilization, and artifacts.

solid brook Aug 25, 2025, 3:23 PM

#

keen beacon https://chat.qwen.ai/s/f023f874-c7f3-43cb-a2a5-798bd8b681a7

Damn they are making me feel bad for ai

keen beacon Aug 25, 2025, 3:23 PM

#

Same bro

solid brook Aug 25, 2025, 3:24 PM

#

But gemini has far more severe melt downs

#

Like you see it go crazy

golden ocean Aug 25, 2025, 3:25 PM

#

keen beacon https://chat.qwen.ai/s/f023f874-c7f3-43cb-a2a5-798bd8b681a7

lmfoa

golden ocean Aug 25, 2025, 3:26 PM

#

solid brook But gemini has far more severe melt downs

I never encountered this so far

#

how do you get that with gemini

#

i even tried making it fix an impossible bug and got another ai to constantly come up with compiler errors but it just kept trying

#

0 signs fo meltdown

echo aurora Aug 25, 2025, 3:40 PM

#

solid brook Well there isn't anyone else to ask. Who ?should we ask our questions

Ping me for questions/bugs/feedback - I'm not always able to respond but that's the way for now.

whole wagon Aug 25, 2025, 4:27 PM

#

Altman did full reversal on hyping

#

Now he tells us the new models might get worse lol

echo aurora Aug 25, 2025, 4:28 PM

#

blobextranervous

#

it is?

#

Looks up for me pikaconfused

whole wagon Aug 25, 2025, 4:30 PM

#

It was down for me earlier today for same reason

#

It's the captcha service not working not much to do about it ig

vernal saddle Aug 25, 2025, 4:31 PM

#

Lmarena AI is finally working again. 🙂

echo aurora Aug 25, 2025, 4:32 PM

#

vernal saddle Lmarena AI is finally working again. 🙂

Glad to hear it!

hollow imp Aug 25, 2025, 4:42 PM

#

whole wagon Altman did full reversal on hyping

@echo aurora react to this

drifting thorn Aug 25, 2025, 4:45 PM

#

It’s true that chat use for most non-AI enthusiasts is somehow saturated

ruby gull Aug 25, 2025, 4:45 PM

#

@echo aurora Is attaching a image and asking anything leads to chat crashing.?

drifting thorn Aug 25, 2025, 4:45 PM

#

But the agents aren’t

#

Obviously agents still have a lot of room for improvement

drifting thorn Aug 25, 2025, 4:47 PM

#

drifting thorn Obviously agents still have a lot of room for improvement

As a coding agent and as a general agent

echo aurora Aug 25, 2025, 4:48 PM

#

hollow imp <@283397944160550928> react to this

I'm not sure what you're expecting from me here lol

echo aurora Aug 25, 2025, 4:49 PM

#

ruby gull <@283397944160550928> Is attaching a image and asking anything leads to chat cra...

It shouldn't, would you mind creating a post in #1343291835845578853 and providing more information on what's going wrong?

polar niche Aug 25, 2025, 4:53 PM

#

@echo aurora When will the gpt 5 high and gemini 2.5 pro will be fixed?

#

How long should thinking be?

echo aurora Aug 25, 2025, 4:56 PM

#

polar niche <@283397944160550928> When will the gpt 5 high and gemini 2.5 pro will be fixed?

Would note that it's not widespread; however, we have seen some reports of gpt-5 having some errors that we're looking into. Are you seeing the same issues with gemini 2.5?

polar niche Aug 25, 2025, 4:57 PM

#

echo aurora Would note that it's not widespread; however, we have seen some reports of gpt-5...

Gemini 2.5 is fixed now

#

Also what is gpt 5 high?

#

That's not available in the app

#

Still not generating

echo aurora Aug 25, 2025, 5:12 PM

#

polar niche Still not generating

I see your post in the bugs channel, I'll respond there.

polar niche Aug 25, 2025, 5:15 PM

#

So?

mental flame Aug 25, 2025, 5:20 PM

#

hey guys, i don't know how to generate veo 3 videos, please someone help

echo aurora Aug 25, 2025, 5:24 PM

#

polar niche So?

Currently busy but will respond to when I can.

pine sorrel Aug 25, 2025, 5:38 PM

#

Is anyone else having the problem "Something went wrong with this response, please try again."? How can I fix it?

ocean pulsar Aug 25, 2025, 5:45 PM

#

Hello

ocean vortex Aug 25, 2025, 6:07 PM

#

polar niche Also what is gpt 5 high?

gpt5 with high reasoning effort. Measurably the best model out there currently

past cradle Aug 25, 2025, 6:32 PM

#

Hey from Romania

fiery sail Aug 25, 2025, 6:35 PM

#

hello

fleet lintel Aug 25, 2025, 7:06 PM

#

ocean vortex gpt5 with high reasoning effort. Measurably the best model out there currently

can't really say measurably because it's currently at second position lmarena.ai/leaderboard

steel garnet Aug 25, 2025, 7:16 PM

#

Any plans for uncensored AI? :v

echo aurora Aug 25, 2025, 7:17 PM

#

steel garnet Any plans for uncensored AI? :v

blobno

steel garnet Aug 25, 2025, 7:17 PM

#

pallid zenith Aug 25, 2025, 7:23 PM

#

Is there a way to make vertical video on Veo 3?

glass scarab Aug 25, 2025, 7:26 PM

#

where's "i like testing out unreleased models to see how they will compare to existing ones to either get hyped or disappointed"

echo aurora Aug 25, 2025, 7:27 PM

#

glass scarab where's "i like testing out unreleased models to see how they will compare to ex...

I'd share this in the open field below.

glass scarab Aug 25, 2025, 7:28 PM

#

ye i did potatosmile

pallid zenith Aug 25, 2025, 7:32 PM

#

hey pls help me

wintry tinsel Aug 25, 2025, 7:40 PM

#

Gemini 3 pro this week?

white hatch Aug 25, 2025, 7:40 PM

#

No one knows

zinc ore Aug 25, 2025, 7:41 PM

#

No

white hatch Aug 25, 2025, 7:41 PM

#

yo wth

zinc ore Aug 25, 2025, 7:41 PM

#

Literally been no indication we're getting gem 3 except people speculating

wintry tinsel Aug 25, 2025, 7:42 PM

#

Explain the 3 ships tweet than lol

#

That’s some hard evidence

zinc ore Aug 25, 2025, 7:42 PM

#

No it isn't lmao

#

"hard evidence"

#

Hahahah

#

Sorry that made me laugh

wintry tinsel Aug 25, 2025, 7:42 PM

#

The hardest evidence

mortal coyote Aug 25, 2025, 7:42 PM

#

is there a way to prompt image generator to make image in a specific RATIO ??

#

gpt does it fine , flux cannot do it

gritty token Aug 25, 2025, 7:48 PM

#

/list

#

/list

charred yacht Aug 25, 2025, 7:49 PM

#

9:16 how

wintry tinsel Aug 25, 2025, 7:55 PM

#

Predictions for Gemini 3, what do you bet it will be SOTA

ripe mountain Aug 25, 2025, 8:13 PM

#

autumn cloud Aug 25, 2025, 8:14 PM

#

I love LMArena

robust yoke Aug 25, 2025, 8:22 PM

#

steel garnet Any plans for uncensored AI? :v

Although, you can use uncensored models on Hugging Face.

fleet lintel Aug 25, 2025, 8:23 PM

#

wintry tinsel Explain the 3 ships tweet than lol

3 different features... gem 3 is not happening that fast. And we will see this model on LMArena atleast a few days before

hallow surge Aug 25, 2025, 8:23 PM

#

ripe mountain

Just use gpt oss 120b on deepinfra 0.4$/M output

proud hazel Aug 25, 2025, 8:25 PM

#

hallow surge Just use gpt oss 120b on deepinfra 0.4$/M output

Or just use it for free

ripe mountain Aug 25, 2025, 8:27 PM

#

hallow surge Just use gpt oss 120b on deepinfra 0.4$/M output

gpt-oss is not a good AI for coding

#

gemini 2.0 flash lite is better than gpt oss

#

for coding

hallow surge Aug 25, 2025, 8:28 PM

#

ripe mountain gpt-oss is not a good AI for coding

and so isn't qwen3 coder atp use gemini 2.5 pro or gpt 5

ripe mountain Aug 25, 2025, 8:28 PM

#

hallow surge and so isn't qwen3 coder atp use gemini 2.5 pro or gpt 5

wym

grim axle Aug 25, 2025, 8:31 PM

#

down once again

blazing rune Aug 25, 2025, 8:31 PM

#

ripe mountain wym

I think he is saying "neither is Qwen 3 Coder, so just use Gemini 2.5 Pro or GPT-5 instead"

#

but Qwen 3 Coder is certainly better than GPT-OSS

#

so idk what he is talking about

ripe mountain Aug 25, 2025, 8:35 PM

#

blazing rune so idk what he is talking about

qwen 3 coder, better than gemini 2.5 pro.

#

thats why it cant even be compared to GPT-OSS

echo aurora Aug 25, 2025, 8:36 PM

#

grim axle down once again

The site is down for you?

ripe mountain Aug 25, 2025, 8:36 PM

#

site is working

keen beacon Aug 25, 2025, 8:47 PM

#

ripe mountain

Qwen 3 no cap. Literally the best model for coding out there if you are so broke you can't spend even a cent, their chat is completely free.

#

Best bench ever to compare would be likely https://brokk.ai/power-ranking

Brokk

Power Ranking | Brokk

Comprehensive AI model benchmarks and performance rankings comparing different LLMs on real-world project commits. See which AI coding agents perform best across cost, speed, and accuracy metrics.

#

I have yet to see another bench that'd be this well designed

#

Sure Qwen is not the best performer here. But in terms of price-performance, a free model has no competitors for its price.

golden ocean Aug 25, 2025, 8:50 PM

#

what about gemini 2.5 pro

keen beacon Aug 25, 2025, 8:50 PM

#

keen beacon Sure Qwen is not the best performer here. But in terms of price-performance, a f...

The next best will probably be Deepseek R2

keen beacon Aug 25, 2025, 8:51 PM

#

golden ocean what about gemini 2.5 pro

It is among the best models ever on that bench.

golden ocean Aug 25, 2025, 8:51 PM

#

and free on ai studio

#

but u meant api i guess😔

#

wait no, u said chat

#

ai studio chat also completely free

keen beacon Aug 25, 2025, 8:54 PM

#

golden ocean ai studio chat also completely free

Not in Russia because you have to pay for vpn to access it -_-

#

Also proprietary models are not kosher

supple vector Aug 25, 2025, 9:07 PM

#

beuh no bing

hollow imp Aug 25, 2025, 9:32 PM

#

keen beacon Not in Russia because you have to pay for vpn to access it -_-

But for the 70 countries where it is free

hollow imp Aug 25, 2025, 9:32 PM

#

keen beacon Sure Qwen is not the best performer here. But in terms of price-performance, a f...

You have to take this line back then

keen beacon Aug 25, 2025, 9:33 PM

#

hollow imp You have to take this line back then

Sure

stray aspen Aug 25, 2025, 9:35 PM

#

Pineapple

#

Does that form log your email

echo aurora Aug 25, 2025, 9:36 PM

#

stray aspen Does that form log your email

It does not

keen beacon Aug 25, 2025, 9:40 PM

#

Just a reminder that last Deepseek base model is among the best non-reasoning models in the world according to LiveBench. LMArena scores and other public benchmarks tell a similar picture.

Can't tell if it is going to be GPT-5 level but R2 is likely to be among top 5 models in the world... until OpenAI pushes 5.1 a week later to stay competitive sigh

Screenshot_2025-08-26-04-36-21-177_org.mozilla.firefox.jpg

#

https://tenor.com/view/aliens-styxhexenhammer-flying-saucer-styxhexenhammer666-gif-21889714

Tenor

white hatch Aug 25, 2025, 9:43 PM

#

We believe

#

Someone pays to use vpn?

keen beacon Aug 25, 2025, 9:48 PM

#

white hatch Someone pays to use vpn?

Is there any that is free and still works in this damn country?

white hatch Aug 25, 2025, 9:52 PM

#

keen beacon Is there any that is free and still works in this damn country?

I use "poopy" urban vpn

golden ocean Aug 25, 2025, 9:52 PM

#

white hatch I use "poopy" urban vpn

he lives in russia bro

white hatch Aug 25, 2025, 9:57 PM

#

Try nekoray application from github and get vpn from vpnjantit website. I was using this for a while

keen beacon Aug 25, 2025, 9:57 PM

#

white hatch Try nekoray application from github and get vpn from vpnjantit website. I was us...

Thanks, still would want to hope Deepseek wins though

rustic knot Aug 25, 2025, 10:09 PM

#

keen beacon Thanks, still would want to hope Deepseek wins though

the entire ai race?

keen beacon Aug 25, 2025, 10:09 PM

#

rustic knot the entire ai race?

Yes

#

The real OpenAI

ocean vortex Aug 25, 2025, 10:11 PM

#

fleet lintel can't really say measurably because it's currently at second position lmarena....

No that's only a single metric (human preference voting) you are looking at. Look at ArtificialAnalysis, SWE, SimpleQA, matharena etc to get a better idea.

#

As for lmarena main it's essentially tied with 2.5Pro now. Here though it's convincingly ahead:
https://lmarena.ai/leaderboard/webdev

#

On most metrics it's basically either +/- tied or notably ahead. Overall it is ahead beyond margin of error. 🤷‍♂️

wintry tinsel Aug 25, 2025, 10:17 PM

#

Google will win the entire AI race

#

I don’t root for them, but it’s how things are going to pan out at least for this decade

ocean vortex Aug 25, 2025, 10:18 PM

#

wintry tinsel Google will win the entire AI race

For them to win it they gonna have to up their marketing significantly. It almost looks like they are actively pushing users away from Gemini website lol

#

Your average Joes are not gonna use aistudio

wintry tinsel Aug 25, 2025, 10:19 PM

#

Their marketing plan is to just integrate it into their web search

ocean vortex Aug 25, 2025, 10:19 PM

#

wintry tinsel Their marketing plan is to just integrate it into their web search

Disagree. They had many chances to do it

#

MS did it with Bing

wintry tinsel Aug 25, 2025, 10:20 PM

#

ocean vortex Your average Joes are not gonna use aistudio

This is true I cannot convince friends and family to use AI studio over paying for open AI

ocean vortex Aug 25, 2025, 10:20 PM

#

They didn't

#

AI overviews is a far cry of that and a very basic implementation. Kinda to just tick a box lol

#

looks like some tiny sh'it model as well tbh

#

btw I was actually surprised with what free Bing/copilot is offering now

lime coral Aug 25, 2025, 10:26 PM

#

https://x.com/demishassabis/status/1960105069690655116?s=46

Demis Hassabis (@demishassabis)

strange object spotted under the microscope over the weekend in the lab...

vast fern Aug 25, 2025, 10:27 PM

#

hi guys do we still have free UI in google ai studio I saw there was an update today in the UI and I dont see UI will reaming free of charge anymore

#

was there any update in the policy

#

vast fern Aug 25, 2025, 10:29 PM

#

ocean vortex btw I was actually surprised with what free Bing/copilot is offering now

is it good last time i tried it sucked

ocean vortex Aug 25, 2025, 10:31 PM

#

vast fern is it good last time i tried it sucked

It's gpt5 it can't be bad. Hopefully they haven't molested it too much though.

vast fern Aug 25, 2025, 10:34 PM

#

ocean vortex It's gpt5 it can't be bad. Hopefully they haven't molested it too much though.

I mean I tried gpt 5 and this but the output in copilot was bit off maybe they are using other model but named it as gpt 5

errant mango Aug 25, 2025, 10:36 PM

#

@echo aurora hello

ocean vortex Aug 25, 2025, 10:38 PM

#

vast fern I mean I tried gpt 5 and this but the output in copilot was bit off maybe they a...

No they aren't lol. Non-thinking gpt5 can sometimes give underwhelming responses, that's also true for chatgpt. After all, that model is comparable to gpt4.1..

#

Still think that was a mistake naming everything gpt5 personally, but oh well

white hatch Aug 25, 2025, 10:45 PM

#

vast fern hi guys do we still have free UI in google ai studio I saw there was an update t...

This cost is probably for requests you do through API

vast fern Aug 25, 2025, 10:46 PM

#

white hatch This cost is probably for requests you do through API

what about the UI is it free?

white hatch Aug 25, 2025, 10:46 PM

#

Yes, AI studio was always free and may will

vast fern Aug 25, 2025, 10:47 PM

#

#

in last one it was clearly mentioned

#

that's why i was confused

#

maybe they updated this as well

obsidian cargo Aug 25, 2025, 10:50 PM

#

anyone else getting random cutoffs in outputs?

vast fern Aug 25, 2025, 10:51 PM

#

obsidian cargo anyone else getting random cutoffs in outputs?

like what show ss

obsidian cargo Aug 25, 2025, 10:53 PM

#

like, outputs just randomly ending

sturdy mica Aug 25, 2025, 10:53 PM

#

oh my god all my chats are gone AGAIN

#

he guys

#

when the website goes down

#

dont clear everyone's cookies

#

its so annoying

obsidian cargo Aug 25, 2025, 10:54 PM

#

safest to export what you want to keep

sturdy mica Aug 25, 2025, 10:54 PM

#

how do you export

obsidian cargo Aug 25, 2025, 10:54 PM

#

like, copy+paste into a notepad file or google doc

sturdy mica Aug 25, 2025, 10:55 PM

#

bro

#

what

obsidian cargo Aug 25, 2025, 10:57 PM

#

not a bro or a him

sturdy mica Aug 25, 2025, 10:57 PM

#

i call everyone bro

obsidian cargo Aug 25, 2025, 10:58 PM

#

yeah I'm fine with being called bro

ornate stump Aug 25, 2025, 10:58 PM

#

sturdy mica Aug 25, 2025, 10:58 PM

#

wha5

#

#

yeah ai studio looks different

#

#

EW this looks HORRIBLE

#

that is god awful

vast fern Aug 25, 2025, 10:59 PM

#

ornate stump

gemini 3 ?

sturdy mica Aug 25, 2025, 10:59 PM

#

god did not intend for this UI

#

ornate stump Aug 25, 2025, 10:59 PM

#

vast fern gemini 3 ?

new 2.5 i think

sturdy mica Aug 25, 2025, 11:00 PM

#

jesus

vast fern Aug 25, 2025, 11:00 PM

#

sturdy mica god did not intend for this UI

try prompts lmao now the thinking is full screen as well

sturdy mica Aug 25, 2025, 11:00 PM

#

OH

#

NO

#

NOOOOOO

#

vast fern Aug 25, 2025, 11:00 PM

#

ts is so ugly

#

idc unless and untill its free

ornate stump Aug 25, 2025, 11:01 PM

#

they really don't have a designer

sturdy mica Aug 25, 2025, 11:01 PM

#

#

they nerfed 2.5 pro

lime coral Aug 25, 2025, 11:03 PM

#

https://x.com/officiallogank/status/1960114265810997616?s=46

Logan Kilpatrick (@OfficialLoganK)

Gemini

lime coral Aug 25, 2025, 11:05 PM

#

ornate stump they really don't have a designer

They are pushing on the dev side they said

sturdy mica Aug 25, 2025, 11:05 PM

#

but why did they make it worse

#

it looked a lot better before

sour spindle Aug 25, 2025, 11:10 PM

#

I like it lol

keen beacon Aug 25, 2025, 11:11 PM

#

it looks really good imo lol

vast fern Aug 25, 2025, 11:12 PM

#

sturdy mica Aug 25, 2025, 11:14 PM

#

Grok 5!!!!!!

sour spindle Aug 25, 2025, 11:14 PM

#

Yea and I’m usually pretty harsh on Google. Runs smooth on mobile so far

sturdy mica Aug 25, 2025, 11:14 PM

#

prolly grok 4 code

keen beacon Aug 25, 2025, 11:18 PM

#

What just happened

#

New Gemini just dropped?

sturdy mica Aug 25, 2025, 11:19 PM

#

?

#

no

#

new crappy UI dropped

lone vector Aug 25, 2025, 11:24 PM

#

nano-banana should drop anytime now

#

I'm not sure if the next model is going be 2.5 checkpoint or 3.0

hollow imp Aug 25, 2025, 11:26 PM

#

keen beacon New Gemini just dropped?

New ai studio ui

#

It's a bit cringe to me

hollow imp Aug 25, 2025, 11:26 PM

#

lone vector nano-banana should drop anytime now

keen beacon Aug 25, 2025, 11:33 PM

#

Did they nerf Gemini?

patent aspen Aug 25, 2025, 11:34 PM

#

keen beacon Did they nerf Gemini?

No

keen beacon Aug 25, 2025, 11:35 PM

#

patent aspen No

Then what's this

keen beacon Aug 25, 2025, 11:35 PM

#

sturdy mica

patent aspen Aug 25, 2025, 11:38 PM

#

Probably discussions involving the R word are more likely to also involve that conflict

#

I'm intentionally avoiding saying those because it's against the rules

#

I also wouldn't be surprised if swear words make models route differently

rare python Aug 25, 2025, 11:44 PM

#

patent aspen I also wouldn't be surprised if swear words make models route differently

OR Gemini is known for bad at tool calling

patent aspen Aug 25, 2025, 11:45 PM

#

rare python OR Gemini is known for bad at tool calling

Yeah

runic zenith Aug 25, 2025, 11:59 PM

#

What do each of the categories on lm arena mean for the leaderboard? Overall, hard prompts, etc? Is there a FAQ page on here or the website for them? Most of them r intuitive for me but a few of em r not

fossil fable Aug 26, 2025, 12:07 AM

#

ah f5ck telemetry 3.0 pro

Screenshot_2025-08-26-01-07-01-65_0b2fce7a16bf2b728d6ffa28c8d60efb.jpg

#

or 2.5fl ga

#

yeah maybe 2.5fl ga

keen beacon Aug 26, 2025, 12:09 AM

#

2.5 flash is already ga??

white hatch Aug 26, 2025, 12:10 AM

#

whole wagon Aug 26, 2025, 12:20 AM

#

Its not gemini 3 lol

#

No idea what it is tbh. maybe 2.5 ultra or smth

#

they never released kingfall and that

#

so they still have it

scarlet urchin Aug 26, 2025, 12:32 AM

#

Does LMarena train its image model off of the images we upload? And is it fast? Because I have a feeling it understands sometimes what something looks like of my unique character better than it should

#

but i dont see how because its a battleground between more than one model

formal jungle Aug 26, 2025, 12:37 AM

#

Dall E Mini 4 Life

swift cobalt Aug 26, 2025, 12:40 AM

#

scarlet urchin Does LMarena train its image model off of the images we upload? And is it fast?...

I don't think it's training. Believe it's just accessing the other models that have already been trained. It's not a model in itself.

scarlet urchin Aug 26, 2025, 12:41 AM

#

swift cobalt I don't think it's training. Believe it's just accessing the other models that h...

i guess nano banana is just excellent and guessing what something should look like

keen beacon Aug 26, 2025, 12:41 AM

#

ocean vortex https://x.com/AnthropicAI/status/1958926941613891842

So that they won’t be no more censorship?

swift cobalt Aug 26, 2025, 12:41 AM

#

scarlet urchin i guess nano banana is just excellent and guessing what something should look li...

Yeah, I think when you vote & it turns out to be them... They get publicity/ranking/kudos

abstract tundra Aug 26, 2025, 1:40 AM

#

What happened to Prompt-to-Leaderboard?

#

It's dead again

haughty siren Aug 26, 2025, 1:56 AM

#

Is gpt-5-high thinking or pro

lofty elm Aug 26, 2025, 2:02 AM

#

any reasons why gpt high and gemini 2.5 pro are slow to response

sullen quest Aug 26, 2025, 2:05 AM

#

lofty elm any reasons why gpt high and gemini 2.5 pro are slow to response

cause they are slow to use normally too, it doesn't matter if you use it on chatgpt or google ai studio they just take a while

rare python Aug 26, 2025, 2:11 AM

#

lofty elm any reasons why gpt high and gemini 2.5 pro are slow to response

both are trained to reason very long I guess

sullen quest Aug 26, 2025, 2:12 AM

#

gpt high reasons for much longer than 2.5 pro

rare python Aug 26, 2025, 2:12 AM

#

sullen quest gpt high reasons for much longer than 2.5 pro

is its token per second slower than 2.5 pro too?

sullen quest Aug 26, 2025, 2:13 AM

#

rare python is its token per second slower than 2.5 pro too?

Not actually sure. I'd bet gpt 5 uses more tokens though

opaque mirage Aug 26, 2025, 3:06 AM

#

why is nano banana so good

jade egret Aug 26, 2025, 3:08 AM

#

opaque mirage why is nano banana so good

cuz its banana

patent aspen Aug 26, 2025, 3:17 AM

#

jade egret cuz its banana

Trust orange. Oranges are fruit

#

@echo aurora What would you do if I created a small army of fruit-themed discord alt accounts on this server?

jade egret Aug 26, 2025, 3:45 AM

#

patent aspen Trust orange. Oranges are fruit

(:

patent aspen Aug 26, 2025, 3:47 AM

#

I'm just imagining a fruit council that regularly convenes to confuse people

willow bane Aug 26, 2025, 3:49 AM

#

live is blind

wintry tinsel Aug 26, 2025, 4:32 AM

#

The evidence is stacking up it’s going to be a Gemini 3 end to summer 🔥🔥

#

Common folks put your monkey brains together what else does 3 ships mean?

verbal nimbus Aug 26, 2025, 4:34 AM

#

wintry tinsel The evidence is stacking up it’s going to be a Gemini 3 end to summer 🔥🔥

https://reddit.com/r/singularity/comments/1mzymp5/gemini_3_following_a_3_ship_emoji_from_one_of_the/

From the singularity community on Reddit: Gemini 3? Following a 3 s...

Explore this post and more from the singularity community

#

Google is stirring up so much hype lol

wintry tinsel Aug 26, 2025, 4:34 AM

#

Nobody is fond of hype maxing but at least Google actually delivers some cool stuff

#

Open AI hype is like cheap doughnuts and Mountain Dew with fentanyl

verbal nimbus Aug 26, 2025, 4:35 AM

#

verbal nimbus Aug 26, 2025, 4:35 AM

#

wintry tinsel Nobody is fond of hype maxing but at least Google actually delivers some cool st...

I just hope it lives up to the hype 😄

wintry tinsel Aug 26, 2025, 4:36 AM

#

I’m going to upgrade my lazy maxing/cheating this week 🔥🔥

verbal nimbus Aug 26, 2025, 4:36 AM

#

It's so terrible on gemini.google.com, or I would have bought a subscription

#

Hopefully they fix that

wintry tinsel Aug 26, 2025, 4:36 AM

#

AI studio is better anyways

verbal nimbus Aug 26, 2025, 4:37 AM

#

wintry tinsel AI studio is better anyways

Yeah, I don't mind paying for a subscription to get the included Google Drive storage, but gemini.google.com is literally unusable lol

#

#

Like "can't write new lines in code blocks" level of unusable

wintry tinsel Aug 26, 2025, 4:39 AM

#

Ah but we shall see my amigo

verbal nimbus Aug 26, 2025, 4:40 AM

#

Oh, 3 ships, hmm...

#

Gemini 3...

#

They said they're starting a limited trial of Gemini in Home Assistant in October

empty stump Aug 26, 2025, 4:41 AM

#

does gemini nerf the ai's in aistudio in any way

verbal nimbus Aug 26, 2025, 4:42 AM

#

Hopefully voice mode/video

keen beacon Aug 26, 2025, 4:42 AM

#

its better on aistudio imo

verbal nimbus Aug 26, 2025, 4:42 AM

#

empty stump does gemini nerf the ai's in aistudio in any way

It's the paid one that's nerfed

empty stump Aug 26, 2025, 4:42 AM

#

so it is nerfed on the gemini website

verbal nimbus Aug 26, 2025, 4:42 AM

#

verbal nimbus

Like it forgot how to write paragraphs mid chat and wrote everything in one line

keen beacon Aug 26, 2025, 4:42 AM

#

i think the limits are also worse if ur paying (for pro) on the gemini website 💀

balmy mist Aug 26, 2025, 4:42 AM

#

we getting new model tomorrow?

empty stump Aug 26, 2025, 4:43 AM

#

100 msg per day pro plan 2.5 pro

keen beacon Aug 26, 2025, 4:43 AM

#

yea thats really low

empty stump Aug 26, 2025, 4:43 AM

#

how much on aistudio

verbal nimbus Aug 26, 2025, 4:43 AM

#

empty stump how much on aistudio

Unlimited

#

100 on the free API, but I find the API to be very unreliable

keen beacon Aug 26, 2025, 4:43 AM

#

its not unlimited but its a high amount and depends on the day

verbal nimbus Aug 26, 2025, 4:43 AM

#

Flash API is 1000 iirc

keen beacon Aug 26, 2025, 4:44 AM

#

500 now

#

flash api

verbal nimbus Aug 26, 2025, 4:44 AM

#

Anyone seen the Flash computer vision demo in build?

#

It's better than I thought

keen beacon Aug 26, 2025, 4:45 AM

#

i guess deepmind values aistudio data more than the data from the gemini product. (why rate limits are higher)

#

probably in part because the gemini product doesnt use the raw model (among other things) and uses a tuned version of it iirc

verbal nimbus Aug 26, 2025, 4:45 AM

#

Flash 2.5 no thinking, vision mode

#

It can do 2D segmentation too

keen beacon Aug 26, 2025, 4:46 AM

#

yea i saw that

#

its cool

#

guess so but also prob a mix of other reasons. its strange how the limits suck for paying gemini users (on pro) tho

verbal nimbus Aug 26, 2025, 4:49 AM

#

keen beacon guess so but also prob a mix of other reasons. its strange how the limits suck f...

Or why it's much worse than the actual model

keen beacon Aug 26, 2025, 4:49 AM

#

that as well ig

#

you dont get close to 100 rpd really?

verbal nimbus Aug 26, 2025, 4:50 AM

#

Nano banana is good but not as good as I'd expect from a company that owns Google Images and YouTube

#

It's crazy how good AVM is

verbal nimbus Aug 26, 2025, 4:52 AM

#

verbal nimbus It's crazy how good AVM is

It's kinda dumb, but even Google hasn't caught up to the voice capabilities yet

#

OpenAI's voice mode is at least a year or two ahead

#

Google's one can't switch between languages

#

It uses different models I think

#

OpenAI's one can seamlessly switch between Japanese and English in the same sentence

#

While preserving the native accents from each

keen beacon Aug 26, 2025, 4:55 AM

#

did u try with the tts?

verbal nimbus Aug 26, 2025, 4:55 AM

#

I tried in live mode

#

Since AVM is a speech-to-speech model

keen beacon Aug 26, 2025, 4:56 AM

#

yea could be a restriction of that. if its a model issue i guess, the tts version wouldnt work either (or its also restricted there)

verbal nimbus Aug 26, 2025, 4:56 AM

#

OpenAI's first public demo was at the beginning of 2024, and that one could sing, so I think it's about 2 years in front

rare python Aug 26, 2025, 4:58 AM

#

faster

verbal nimbus Aug 26, 2025, 4:58 AM

#

verbal nimbus OpenAI's first public demo was at the beginning of 2024, and that one could sing...

That would mean they probably had AVM 6 months prior, thus about 2 years 🤔

keen beacon Aug 26, 2025, 4:59 AM

#

(original) 4o has a cut off of oct 2023

#

probably much less than 6 months

verbal nimbus Aug 26, 2025, 5:00 AM

#

I just think it's kinda crazy how good the first model was

#

Given it's the first of its kind

urban wharf Aug 26, 2025, 5:00 AM

#

hi guys took a lot of effort making this. please like and subscribe

verbal nimbus Aug 26, 2025, 5:00 AM

#

Too bad they haven't seemed to work on it more

rare python Aug 26, 2025, 5:01 AM

#

urban wharf hi guys took a lot of effort making this. please like and subscribe

autumn cloud Aug 26, 2025, 5:01 AM

#

love how lmarena just deletes all my chats when it feels like it

verbal nimbus Aug 26, 2025, 5:02 AM

#

autumn cloud love how lmarena just deletes all my chats when it feels like it

The chats are stored locally, not on the server

#

So if you cleared your browser history, your chats would be gone

#

Or if it was on a private tab

autumn cloud Aug 26, 2025, 5:04 AM

#

nah its happened twice already

#

idrc tho

verbal nimbus Aug 26, 2025, 5:05 AM

#

Export would be nice ig

whole sundial Aug 26, 2025, 5:05 AM

#

i'm pretty sure the chats are stored on LMArena's server, tied to the user ID that is auto-generated when you first use it

verbal nimbus Aug 26, 2025, 5:05 AM

#

Well if you deleted your browser history, it'll be deleted too

#

I think the ID is mainly for voting, but can check network logs/source code ig

rare python Aug 26, 2025, 5:06 AM

#

verbal nimbus Well if you deleted your browser history, it'll be deleted too

delete on your client side, but the chat is logged

verbal nimbus Aug 26, 2025, 5:06 AM

#

rare python delete on your client side, but the chat is logged

Yeah, but no way to download again

#

I wonder if there's anything interesting in the public data

#

It's on hugging face

stark socket Aug 26, 2025, 5:08 AM

#

whole sundial Aug 26, 2025, 5:09 AM

#

whole sundial i'm pretty sure the chats are stored on LMArena's server, tied to the user ID th...

but this is the cause of several of LMArena's major problems, including:
"Failed to accept terms of use": When you accept the TOU, that data is sent to the server. If the server is down or is having problems, it will show this message as that is stored on LMArena's server tied to the user ID
Chats disappearing: If LMArena is having problems, sometimes the user ID is invalidated and thus it has to make a new one, taking your chats with it as they were tied to a different user ID than the one you are using now. Also why you have to re-accept TOU, causing the above problem
I think it's best to have the chat history stored locally and on the server to prevent the second example from happening.

#

or just have chat export or have the user ID be able to be imported/exported

rare python Aug 26, 2025, 5:10 AM

#

whole sundial or just have chat export or have the user ID be able to be imported/exported

or an account system to log in

verbal nimbus Aug 26, 2025, 5:11 AM

#

whole sundial but this is the cause of several of LMArena's major problems, including: "Failed...

An LLM can probably solve the bug in a day 🤔

whole sundial Aug 26, 2025, 5:11 AM

#

that could fix it, but it should be completely optional

verbal nimbus Aug 26, 2025, 5:11 AM

#

chat export would be trivial to add with an LLM

#

Given you can already copy each message to clipboard

simple carbon Aug 26, 2025, 5:37 AM

#

verbal nimbus Nano banana is good but not as good as I'd expect from a company that owns Googl...

its pretty good to me

#

can follow styles properly

verbal nimbus Aug 26, 2025, 5:54 AM

#

simple carbon its pretty good to me

I saw one where it generated a Heinz bottle with a cap on top and the bottom

#

Whereas GPT's one handled it fine

simple carbon Aug 26, 2025, 5:54 AM

#

verbal nimbus I saw one where it generated a Heinz bottle with a cap on top and the bottom

one mistake doesnt change the rule

simple carbon Aug 26, 2025, 5:55 AM

#

verbal nimbus Whereas GPT's one handled it fine

GPT usually completely messes with the image

#

id even recommend grok over gpt

verbal nimbus Aug 26, 2025, 5:55 AM

#

GPT's one is the best rn

#

Idk whether it's better than nano-banana

#

But the auto-regressive nature gives it a lot of control

simple carbon Aug 26, 2025, 5:56 AM

#

verbal nimbus GPT's one is the best rn

what are you smoking, gpt is good when it comes to following the prompt yes but its dog water at image editing

#

nano banana doesnt chaneg the resolution or anything

keen beacon Aug 26, 2025, 5:56 AM

#

nano banana is autoregressive too it seems

#

(it seems to be 2.5 flash native image gen)

verbal nimbus Aug 26, 2025, 5:56 AM

#

verbal nimbus But the auto-regressive nature gives it a lot of control

Like try asking GPT and Grok to draw a graph of a specific function, that's a good test of how well it can control the scene

simple carbon Aug 26, 2025, 5:57 AM

#

verbal nimbus Like try asking GPT and Grok to draw a graph of a specific function, that's a go...

sure ill check rq

verbal nimbus Aug 26, 2025, 5:58 AM

#

keen beacon nano banana is autoregressive too it seems

Looks like auto-regressive models are winning

keen beacon Aug 26, 2025, 5:58 AM

#

yeah

simple carbon Aug 26, 2025, 5:59 AM

#

verbal nimbus Like try asking GPT and Grok to draw a graph of a specific function, that's a go...

wht in the world does this have to do with image editing

verbal nimbus Aug 26, 2025, 6:02 AM

#

simple carbon wht in the world does this have to do with image editing

Generate a graph of the function f(x) x^2 + 1. Shade the area under the graph from x = 1 to x = 2. Label both axis and include tick markings. Domain: x = -1 to x = 3. Range: y = -1 to y = 3. Aspect ratio: 1:1.

#

I'm trying this one rn

simple carbon Aug 26, 2025, 6:03 AM

#

verbal nimbus ``` Generate a graph of the function f(x) x^2 + 1. Shade the area under the grap...

what does that have to do with image editing

verbal nimbus Aug 26, 2025, 6:03 AM

#

simple carbon what does that have to do with image editing

Editing specifically?

simple carbon Aug 26, 2025, 6:03 AM

#

verbal nimbus Editing specifically?

yes

verbal nimbus Aug 26, 2025, 6:04 AM

#

I was talking about image gen

#

Because these sorts of tasks are actually very useful in education

#

Instead of a teacher spending 5 minutes drawing something, they could generate it on demand

simple carbon Aug 26, 2025, 6:05 AM

#

verbal nimbus Instead of a teacher spending 5 minutes drawing something, they could generate i...

you could do that with gemini rn

verbal nimbus Aug 26, 2025, 6:06 AM

#

simple carbon you could do that with gemini rn

It messes up the graph, it doesn't have enough control

simple carbon Aug 26, 2025, 6:06 AM

#

verbal nimbus It messes up the graph, it doesn't have enough control

let me try it rq

verbal nimbus Aug 26, 2025, 6:07 AM

#

GPT will probably get it

simple carbon Aug 26, 2025, 6:07 AM

#

maybe its just my gpt but it cant comprehend basic things let alone this

#

this is what gemini came up with

verbal nimbus Aug 26, 2025, 6:08 AM

#

simple carbon this is what gemini came up with

That's pretty good actually

#

It graphed it I think

#

That's why it says code

simple carbon Aug 26, 2025, 6:08 AM

#

verbal nimbus That's pretty good actually

i told it to do it in canvas

#

for image gen its a diffferent story

verbal nimbus Aug 26, 2025, 6:09 AM

#

There are some visualizations that are harder, this is just an easy case

#

Like integral ones where you have to show each rectangle

simple carbon Aug 26, 2025, 6:09 AM

#

these AI cant even understand or visualize images that i send them let alone make accurate ones

verbal nimbus Aug 26, 2025, 6:09 AM

#

verbal nimbus Like integral ones where you have to show each rectangle

GPT almost got this on image gen, which is why I thought it would do fine on that example

simple carbon Aug 26, 2025, 6:09 AM

#

verbal nimbus GPT almost got this on image gen, which is why I thought it would do fine on tha...

on image gen?

verbal nimbus Aug 26, 2025, 6:10 AM

#

simple carbon on image gen?

Yeah

simple carbon Aug 26, 2025, 6:10 AM

#

idk it didnt really use image gen for me it used some weird code graph

verbal nimbus Aug 26, 2025, 6:10 AM

#

Because Matplotlib is more difficult for complex graphs

verbal nimbus Aug 26, 2025, 6:11 AM

#

simple carbon idk it didnt really use image gen for me it used some weird code graph

You'll need to tell it to generate an image/enable image mode

simple carbon Aug 26, 2025, 6:11 AM

#

verbal nimbus You'll need to tell it to generate an image/enable image mode

yea i can try it on sora.com

verbal nimbus Aug 26, 2025, 6:12 AM

#

simple carbon yea i can try it on sora.com

My ChatGPT app is being weird on mobile

#

It's just spitting back the prompt

simple carbon Aug 26, 2025, 6:12 AM

#

verbal nimbus It's just spitting back the prompt

just log in on sora then...

verbal nimbus Aug 26, 2025, 6:13 AM

#

simple carbon just log in on sora then...

Hmm I haven't tried it on mobile

#

I'll check

simple carbon Aug 26, 2025, 6:13 AM

#

verbal nimbus Hmm I haven't tried it on mobile

its strictly for image/video gen, wont respond with text

#

and uses the same model

#

24 images a day

#

@verbal nimbus

#

this is what chatgpt image gen came up with

verbal nimbus Aug 26, 2025, 6:16 AM

#

simple carbon <@858135822389346344>

That's pretty bad

#

This is what it gave me

simple carbon Aug 26, 2025, 6:17 AM

#

verbal nimbus This is what it gave me

is that more accurate

verbal nimbus Aug 26, 2025, 6:17 AM

#

simple carbon <@858135822389346344>

The first one is the closest

simple carbon Aug 26, 2025, 6:17 AM

#

so ig the code function is better... then

verbal nimbus Aug 26, 2025, 6:17 AM

#

simple carbon is that more accurate

Worse, it should be centered at x = 0

verbal nimbus Aug 26, 2025, 6:17 AM

#

simple carbon so ig the code function is better... then

Yeah, for these types of graphs, mayplotlib would be superior

simple carbon Aug 26, 2025, 6:17 AM

#

verbal nimbus Worse, it should be centered at x = 0

ill try asking gpt on canvas mode and see what it comes up with

verbal nimbus Aug 26, 2025, 6:18 AM

#

verbal nimbus Yeah, for these types of graphs, mayplotlib would be superior

Until it has to do more complex diagrams ig

simple carbon Aug 26, 2025, 6:19 AM

#

lol its not even letting me run code, garbage

verbal nimbus Aug 26, 2025, 6:19 AM

#

simple carbon lol its not even letting me run code, garbage

Can it even run Python in Canvas

simple carbon Aug 26, 2025, 6:20 AM

#

verbal nimbus Can it even run Python in Canvas

it says it can but its slow and isnt accurate or even good most of the time

verbal nimbus Aug 26, 2025, 6:20 AM

#

ChatGPT's analysis tool is actually pretty good

simple carbon Aug 26, 2025, 6:20 AM

#

the marketing fooled me i must say

simple carbon Aug 26, 2025, 6:20 AM

#

verbal nimbus ChatGPT's analysis tool is actually pretty good

deep research?, yea its pretty good

verbal nimbus Aug 26, 2025, 6:20 AM

#

Like it can research and gather data, analyze it, then plot it out

#

Just normal mode

#

Doesn't work on mobile though, can't access Internet

simple carbon Aug 26, 2025, 6:21 AM

#

verbal nimbus Like it can research and gather data, analyze it, then plot it out

are u talking about this option

verbal nimbus Aug 26, 2025, 6:21 AM

#

verbal nimbus Like it can research and gather data, analyze it, then plot it out

Very good for fact checking

verbal nimbus Aug 26, 2025, 6:21 AM

#

simple carbon are u talking about this option

No it's a tool/mode it can use when you ask for research tasks

simple carbon Aug 26, 2025, 6:22 AM

#

verbal nimbus No it's a tool/mode it can use when you ask for research tasks

oh the basic research one

verbal nimbus Aug 26, 2025, 6:22 AM

#

It'll search the web while thinking, analyze the data then plot it

verbal nimbus Aug 26, 2025, 6:22 AM

#

simple carbon oh the basic research one

Analysis tool I think

simple carbon Aug 26, 2025, 6:22 AM

#

?? this one

verbal nimbus Aug 26, 2025, 6:22 AM

#

simple carbon ?? this one

It can't be enabled, you'll have to ask it

simple carbon Aug 26, 2025, 6:23 AM

#

verbal nimbus It can't be enabled, you'll have to ask it

it can enable for me

#

just gotta click the thingy

dense sphinx Aug 26, 2025, 6:23 AM

#

Riko? What's that?

simple carbon Aug 26, 2025, 6:24 AM

#

dense sphinx Riko? What's that?

an ai waifu xd

dense sphinx Aug 26, 2025, 6:25 AM

#

Oh I see

#

Hello skibidi

simple carbon Aug 26, 2025, 6:25 AM

#

dense sphinx Hello skibidi

is the last word in ur username referencing the AI Wan 2.2?

verbal nimbus Aug 26, 2025, 6:25 AM

#

simple carbon just gotta click the thingy

Like you can just ask: Gather data on US debt in the last 20 years, analyze it, then plot it out. Include the % change on the same graph.

#

And it'll do it automatically

dense sphinx Aug 26, 2025, 6:25 AM

#

simple carbon is the last word in ur username referencing the AI Wan 2.2?

Is there error in my name?

simple carbon Aug 26, 2025, 6:26 AM

#

verbal nimbus Like you can just ask: Gather data on US debt in the last 20 years, analyze it, ...

let me try

simple carbon Aug 26, 2025, 6:26 AM

#

dense sphinx Is there error in my name?

nah not rlly

verbal nimbus Aug 26, 2025, 6:26 AM

#

My prompt is kinda bad but it should work if you're on desktop/web

simple carbon Aug 26, 2025, 6:26 AM

#

verbal nimbus Like you can just ask: Gather data on US debt in the last 20 years, analyze it, ...

Here’s an illustrative chart featuring U.S. federal debt over the last 20 years (approx. 2005–Q1 2025), along with the year-over-year percentage change in debt:

Data Summary & Sources
Data Point Description Source
Federal Debt U.S. national debt surpassed $36 trillion as of early 2025.
Investopedia
The Washington Post

Historical Context From around 2007 ($9 trillion) to 2022 ($31 trillion), 70% of total debt was accumulated.
USAFacts
The Washington Post

Main Growth Drivers Major contributors: wars in Iraq & Afghanistan, Great Recession stimulus, COVID-19 relief, and tax cuts.
The Washington Post

Debt-to-GDP Debt as a percentage of GDP has climbed above 120% by Q1 2025.
FRED
The Washington Post
Analysis of the Chart

Debt Trend: The chart clearly shows federal debt growing from around $9–10 trillion in the mid-2000s to over $36 trillion by 2025.

Percentage Change (Year-over-Year): The red (or similar accent) line illustrates annual growth—periods of sharp spikes correspond to economic crises:

2008–2009: The Great Recession-led stimulus caused noticeable jumps.

2020–2021: COVID-19 relief led to some of the steepest increases.

Other years: More modest but steady increases due to regular budget deficits and policy choices.

How I Constructed the Chart

Total Debt data points were inferred from widely cited historical values (e.g. $9 trillion in 2007, $31 trillion by 2022, over $36 trillion by 2025). These align with FRED data series

verbal nimbus Aug 26, 2025, 6:26 AM

#

Yup

#

Don't need to enable search though

#

Not sure if it'll think in that mode

simple carbon Aug 26, 2025, 6:27 AM

#

verbal nimbus Not sure if it'll think in that mode

it did but i skipped it

verbal nimbus Aug 26, 2025, 6:27 AM

#

Oh let it think

simple carbon Aug 26, 2025, 6:27 AM

#

it always makes some document after thinking

verbal nimbus Aug 26, 2025, 6:27 AM

#

Did it output a graph?

simple carbon Aug 26, 2025, 6:28 AM

#

verbal nimbus Did it output a graph?

nah i skipped it, i can try again though to see what it does

verbal nimbus Aug 26, 2025, 6:28 AM

#

I'll try too

simple carbon Aug 26, 2025, 6:29 AM

#

holy it takes so long on thinking mode

verbal nimbus Aug 26, 2025, 6:29 AM

#

Yup, it's faster if you give it the data

dense sphinx Aug 26, 2025, 6:30 AM

#

What's that prompt?

verbal nimbus Aug 26, 2025, 6:30 AM

#

It might have gotten contradictory information

simple carbon Aug 26, 2025, 6:30 AM

#

verbal nimbus Yup, it's faster if you give it the data

oh neat it actually did it

simple carbon Aug 26, 2025, 6:30 AM

#

dense sphinx What's that prompt?

Gather data on US debt in the last 20 years, and plot it out. Include the % change on the same graph.

verbal nimbus Aug 26, 2025, 6:30 AM

#

simple carbon oh neat it actually did it

Yup, it's great for fact checking

simple carbon Aug 26, 2025, 6:30 AM

#

verbal nimbus Yup, it's great for fact checking

thats not image gen tho xd

verbal nimbus Aug 26, 2025, 6:30 AM

#

Especially nowadays, with internet misinformation

verbal nimbus Aug 26, 2025, 6:31 AM

#

simple carbon thats not image gen tho xd

Yeah, image gen kinda failed the graph test

verbal nimbus Aug 26, 2025, 6:31 AM

#

simple carbon oh neat it actually did it

Kinda crazy that it can do that, coz it would take me way longer than 5 mins lol

simple carbon Aug 26, 2025, 6:31 AM

#

verbal nimbus Yeah, image gen kinda failed the graph test

nanana banana is image gen only

#

as far as i know

simple carbon Aug 26, 2025, 6:32 AM

#

verbal nimbus Kinda crazy that it can do that, coz it would take me way longer than 5 mins lol

its just backened code

#

it still takes pretty long

dense sphinx Aug 26, 2025, 6:32 AM

#

Guys are you felt errors from Claude 3.7 sonnet recently?

verbal nimbus Aug 26, 2025, 6:32 AM

#

Yeah but to source the data, create a graph and format it and everything

#

That'll probably take me 15-20 mins at least

simple carbon Aug 26, 2025, 6:32 AM

#

dense sphinx Guys are you felt errors from Claude 3.7 sonnet recently?

ive had problems with claude since day one

dense sphinx Aug 26, 2025, 6:33 AM

#

Me too

#

I didn't know what model best right now

simple carbon Aug 26, 2025, 6:33 AM

#

verbal nimbus That'll probably take me 15-20 mins at least

ofc, humans arent gonna be as fast, but generally gemini is faster and can get pretty accurate

verbal nimbus Aug 26, 2025, 6:33 AM

#

dense sphinx Guys are you felt errors from Claude 3.7 sonnet recently?

That's an old model, how are you accessing it?

simple carbon Aug 26, 2025, 6:33 AM

#

ive built multiple projects with gemini ai canvas

verbal nimbus Aug 26, 2025, 6:33 AM

#

The current one is Opus 4.1 and Sonnet 4

dense sphinx Aug 26, 2025, 6:33 AM

#

verbal nimbus That's an old model, how are you accessing it?

LMarena

verbal nimbus Aug 26, 2025, 6:34 AM

#

dense sphinx LMarena

Oh, maybe try the new models, with thinking, it's better.

dense sphinx Aug 26, 2025, 6:34 AM

#

verbal nimbus Oh, maybe try the new models, with thinking, it's better.

Thank you sunsweeper.

verbal nimbus Aug 26, 2025, 6:34 AM

#

simple carbon ofc, humans arent gonna be as fast, but generally gemini is faster and can get p...

I think it's good to combat online misinformation

dense sphinx Aug 26, 2025, 6:34 AM

#

👍

verbal nimbus Aug 26, 2025, 6:35 AM

#

verbal nimbus I think it's good to combat online misinformation

Like if you see some random claim or graph on X, you can just ask GPT to investigate it

#

Because most people usually don't have the time to research everything

verbal nimbus Aug 26, 2025, 6:36 AM

#

simple carbon ofc, humans arent gonna be as fast, but generally gemini is faster and can get p...

Gemini is good, but it hallucinates quite a bit, especially with web search

#

Hopefully will be fixed with Gemini 3

verbal nimbus Aug 26, 2025, 6:37 AM

#

verbal nimbus Gemini is good, but it hallucinates quite a bit, especially with web search

I usually ask it for a direct quote from each site, and then use the find tool on the site to check if it actually exists

empty summit Aug 26, 2025, 6:44 AM

#

hi everones

hollow spire Aug 26, 2025, 7:07 AM

#

hey

gleaming oriole Aug 26, 2025, 7:33 AM

#

Does anybody konw how to make our new model be listed in LMArena Text-To-Image Leaderboard?

whole sundial Aug 26, 2025, 7:35 AM

#

make a post in #1372229840131985540 and reach out to them by email (can be found on LMArena's "About Us" section)

whole sundial Aug 26, 2025, 7:35 AM

#

gleaming oriole Does anybody konw how to make our new model be listed in LMArena Text-To-Image L...

^

#

(if it is an unreleased model you want tested in stealth like nano-banana, you might not want to make a post unless you feel comfortable about people knowing about it before release, just reach out to them via email. although for smaller companies, they might not prioritize you + if you don't have an API available to them, they won't add the model because they need an API so the model can be used)

keen beacon Aug 26, 2025, 7:42 AM

#

Is there any other ai generator that has no censorship?

regal river Aug 26, 2025, 8:07 AM

#

Hello there! Do you know where I could find the information about the parameters used for each model?

#

For example, is Gemini 2.5 Pro using the default thinkingBudget parameter?

quasi parrot Aug 26, 2025, 8:27 AM

#

what do i do if all of my messages were cleared out

mighty reef Aug 26, 2025, 8:45 AM

#

white hatch Aug 26, 2025, 8:49 AM

#

quasi parrot what do i do if all of my messages were cleared out

Unfortunately, but there's nothing but to cry

half drift Aug 26, 2025, 9:02 AM

#

Hi, new here.. wanna learn ai video

fierce monolith Aug 26, 2025, 9:06 AM

#

Hi, there.. I would love to learn more about AI video gen tools

verbal nimbus Aug 26, 2025, 9:38 AM

#

regal river For example, is Gemini 2.5 Pro using the default thinkingBudget parameter?

Probably max thinking

#

Gemini doesn't enforce a thinking budget every time

#

On AIStudio, in Auto mode, the system prompt instructs the model to set the thinking budget sparingly, so most of the time there probably isn't a budget enforced.

regal river Aug 26, 2025, 9:41 AM

#

Yes default is dynamic thinking (Gemini adjusting depending of the prompt)

#

I was just wondering if it was fair to compare a GPT-5 High (which Plus subscribers don't even have access to on ChatGPT) to a Gemini Pro. Depends on the latter's level of thinking.

verbal nimbus Aug 26, 2025, 9:48 AM

#

verbal nimbus On AIStudio, in Auto mode, the system prompt instructs the model to set the thin...

Looks like it's no longer there... or maybe that was only for Build mode's system prompt

#

You are Gemini, a helpful AI assistant built by Google. I am going to ask you some questions. Your response should be accurate without hallucination.

You can write and run code snippets using the python libraries specified below.

\`\`\`python
print(google_search.search(queries=['query1', 'query2']))
\`\`\`

Always generate queries in the same language as the language of the user.

# Example

For the user prompt "Wer hat im Jahr 2020 den Preis X erhalten?" this would result in generating the following tool_code block:

\`\`\`python
print(google_search.search(["Wer hat den X-Preis im 2020 gewonnen?", "X Preis 2020"]))
\`\`\`

**Always** do the following:
  * Generate multiple queries in the same language as the user prompt.
  * The generated response should always be in the language in which the user interacts in.
  * Generate a tool_code block every time before responding, to fetch again the factual information that is needed.

If you already have all the information you need, complete the task and write the response. When formatting the response, you may use Markdown for richer presentation only when appropriate.

Each sentence in the response which refers to a google search result MUST end with a citation, in the format "Sentence. [INDEX]", where INDEX is a snippet index. Use commas to separate indices if multiple search results are used. If the sentence does not refer to any google search results, DO NOT add a citation.<ctrl100>
<ctrl99>context

Current time is...<edited for privacy>

#

That's AIStudio's current prompt (or something like it, I ran it like 5 rounds). Only had grounding enabled, otherwise probably longer. Sorry I thought Discord would minimize it.

verbal nimbus Aug 26, 2025, 9:54 AM

#

regal river I was just wondering if it was fair to compare a GPT-5 High (which Plus subscrib...

I guess it's not that useful to not have Medium on there, which is what users on ChatGPT have access to.

#

Gemini 2.5 Pro on LMArena doesn't seem to have a system prompt. Either that or it hallucinates worse, because it comes up with a different one each time, compared to AI Studio's Gemini 2.5 Pro on temp 1. I think it just doesn't have a system prompt, which is weird.

regal river Aug 26, 2025, 10:08 AM

#

verbal nimbus Gemini 2.5 Pro on LMArena doesn't seem to have a system prompt. Either that or i...

Temperature is probably not 1 by default?

verbal nimbus Aug 26, 2025, 10:08 AM

#

regal river Temperature is probably not 1 by default?

AIStudio's is 1. What I meant was that the model was consistently producing the same system prompt on temp 1, so you'd expect it to on LMArena unless it didn't have a system prompt/hallucinates more.

rare python Aug 26, 2025, 10:32 AM

#

verbal nimbus That's AIStudio's current prompt (or something like it, I ran it like 5 rounds)....

send as a file

#

put the whole thing in a txt

willow grail Aug 26, 2025, 10:32 AM

#

Baofeng PMR Funkgeräte

tacit root Aug 26, 2025, 10:58 AM

#

is something wrong with arena, or is this on my end? today sometimes, like every 10th prompt, I get this stuck generation 😕

keen fulcrum Aug 26, 2025, 11:27 AM

#

When will we see russian LLM models? They were at some point one of the best in the leaderboard

#

https://discord.com/channels/1340554757349179412/1403260546509176842

#

https://discord.com/channels/1340554757349179412/1403261010315448350

#

https://discord.com/channels/1340554757349179412/1403263267794718860

proud hazel Aug 26, 2025, 11:32 AM

#

keen fulcrum When will we see russian LLM models? They were at some point one of the best in ...

Ask in #1372229840131985540

hollow imp Aug 26, 2025, 11:44 AM

#

@green plume

#

😡

#

MY CHAT HISTORY GONE....... AGAIN

pine knoll Aug 26, 2025, 11:49 AM

#

hi huys I'm new

#

how can i try specific AI model such as Veo 3 or others?

proud hazel Aug 26, 2025, 11:58 AM

#

pine knoll how can i try specific AI model such as Veo 3 or others?

#1397655624103493813

golden vortex Aug 26, 2025, 12:02 PM

#

I cant create images with upload images allways a message

Captura_de_pantalla_2025-08-26_a_las_14.01.51.png

golden ocean Aug 26, 2025, 12:02 PM

#

golden vortex I cant create images with upload images allways a message

NOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOOO

golden vortex Aug 26, 2025, 12:03 PM

#

Is the creation down again?

pine knoll Aug 26, 2025, 12:03 PM

#

golden vortex Is the creation down again?

idk i'm still using it rn without any error

golden vortex Aug 26, 2025, 12:04 PM

#

you can create images with references?

pine knoll Aug 26, 2025, 12:04 PM

#

yes

#

it's a feature bro

#

maybe you could try to refresh the page\

#

or even delete browser history and settings

golden vortex Aug 26, 2025, 12:06 PM

#

i will try 🙂

unkempt bluff Aug 26, 2025, 12:10 PM

#

/ok

golden vortex Aug 26, 2025, 12:18 PM

#

working!

#

thanks a million guys for your help!

pine knoll Aug 26, 2025, 12:19 PM

#

golden vortex thanks a million guys for your help!

you are welcome

compact grail Aug 26, 2025, 12:21 PM

#

uhhh can i get my image please

proud hazel Aug 26, 2025, 12:31 PM

#

compact grail uhhh can i get my image please

It's working fine for me, the problem is on your end.

compact grail Aug 26, 2025, 12:31 PM

#

proud hazel It's working fine for me, the problem is on your end.

it was just a bug with the model i guess

proud hazel Aug 26, 2025, 12:32 PM

#

yeah

pine knoll Aug 26, 2025, 12:34 PM

#

just refresh the page

stiff kernel Aug 26, 2025, 12:38 PM

#

half drift Hi, new here.. wanna learn ai video

You can find information in #1397655624103493813

heady flare Aug 26, 2025, 12:39 PM

#

Whyy

proud hazel Aug 26, 2025, 12:39 PM

#

heady flare Whyy

Prompt? Picture?

heady flare Aug 26, 2025, 12:40 PM

#

proud hazel Prompt? Picture?

Just prompt.

proud hazel Aug 26, 2025, 12:40 PM

#

Yeah, which one?

heady flare Aug 26, 2025, 12:41 PM

#

proud hazel Yeah, which one?

5 and 4o.

stray aspen Aug 26, 2025, 12:49 PM

#

heady flare Whyy

Reload Gang

gloomy zenith Aug 26, 2025, 1:00 PM

#

Hi guys,

#

New here, i just discover LMArena, is it completely free to use or there are limitations

proud hazel Aug 26, 2025, 1:03 PM

#

gloomy zenith New here, i just discover LMArena, is it completely free to use or there are lim...

You pay with your soul.

gloomy zenith Aug 26, 2025, 1:06 PM

#

Lol okay

pine knoll Aug 26, 2025, 1:17 PM

#

gloomy zenith New here, i just discover LMArena, is it completely free to use or there are lim...

8 videos per day

echo aurora Aug 26, 2025, 1:19 PM

#

gloomy zenith New here, i just discover LMArena, is it completely free to use or there are lim...

It is free, but it's possible to get rate limitted if you use too often

echo aurora Aug 26, 2025, 1:20 PM

#

heady flare Whyy

Are you still getting this? Does refreshing the page/new browser/start new chat make a difference?

rocky mauve Aug 26, 2025, 1:23 PM

#

heady flare Whyy

I get this issue quite often, normally refreshing fixes it, but might not be the case for u

timber iris Aug 26, 2025, 1:24 PM

#

Is it possible to use gpt 5 high thinking in Direct chat?

gloomy zenith Aug 26, 2025, 1:26 PM

#

This is so cool, I wish I found it sooner

torpid raptor Aug 26, 2025, 1:35 PM

#

Hello new from her

alpine coral Aug 26, 2025, 1:35 PM

#

i haven't done much testing lately, but gpt-5 (high) gets perfect scores across all three question sets (the only model to do so.. saturates it basically)..
i've done just a couple of runs with opus 4.1 (thinking(; underperforms opus 4 (thinking)
haven't tested grok-4.. except 1 run/quiz, where it does poorly

#

#

heady flare Aug 26, 2025, 1:38 PM

#

echo aurora Are you still getting this? Does refreshing the page/new browser/start new chat ...

Still the same. I did refresh, exit, and closed the browser. Restart. Still the same.

echo aurora Aug 26, 2025, 1:40 PM

#

heady flare Still the same. I did refresh, exit, and closed the browser. Restart. Still the ...

For a particular model, or all models doing this?

calm sequoia Aug 26, 2025, 1:42 PM

#

alpine coral i haven't done much testing lately, but gpt-5 (high) gets perfect scores across ...

My favorite bench

#

A lot of people who've been in this channel seems gone now. Where did you guys migrate?

unborn ocean Aug 26, 2025, 1:44 PM

#

calm sequoia A lot of people who've been in this channel seems gone now. Where did you guys m...

Good question, I miss the ‚old days‘ :v

calm sequoia Aug 26, 2025, 1:44 PM

#

Same

#

Where is the Leo, @keen beacon these days?

unborn ocean Aug 26, 2025, 1:45 PM

#

It think wild is kind of in the unsloth dc

#

Idk though

rustic knot Aug 26, 2025, 1:46 PM

#

calm sequoia Where is the Leo, <@456226577798135808> these days?

wild only comes on when there's drama

restive swan Aug 26, 2025, 1:47 PM

#

Hello world.

heady flare Aug 26, 2025, 1:51 PM

#

echo aurora For a particular model, or all models doing this?

For 5 and 4o.

polar niche Aug 26, 2025, 1:54 PM

#

I love this site

quartz light Aug 26, 2025, 1:54 PM

#

WHAT

#

THATS NOT-

restive swan Aug 26, 2025, 1:55 PM

#

I would like to speak to someone about a general LMArena question, is there is anyone "official" to ask?

wintry tinsel Aug 26, 2025, 1:55 PM

#

calm sequoia A lot of people who've been in this channel seems gone now. Where did you guys m...

A lot of AI hype has tapered off as the industry has matured we’re all fully disillusioned with the singularity now and the early open source craze is over now that only more established open source outfits are competing with more measured releases,, AI has become more standard

quartz light Aug 26, 2025, 1:55 PM

#

dude

#

lmfao

#

thats not

#

WHAT

#

I CAN JUST GET RANDOM INFO FROM THE MODEL LIKE THIS

#

restive swan Aug 26, 2025, 1:56 PM

#

are you aware of the Cipher-Like Input Behavior Framework issues with AI?

calm sequoia Aug 26, 2025, 1:56 PM

#

wintry tinsel A lot of AI hype has tapered off as the industry has matured we’re all fully dis...

That is true, but the discussion could continue somewhere as they were not so much hype-centered

polar niche Aug 26, 2025, 1:56 PM

#

What do you mean?

quartz light Aug 26, 2025, 1:56 PM

#

what

polar niche Aug 26, 2025, 1:56 PM

#

restive swan are you aware of the Cipher-Like Input Behavior Framework issues with AI?

Cipher like?

restive swan Aug 26, 2025, 1:57 PM

#

polar niche What do you mean?

ChatGPT's Cipher-Like Input Behavior Framework

Learned Pattern Associations

Models learn a fuzzy mapping between surface forms (ciphers, encodings) and likely semantic outputs during training.
This mapping is based on patterns in vast quantities of obfuscated, malformed, and encoded text (e.g., Reddit rot13 posts, base64 logs, Twitter-style camouflaged phrases).
The model doesn't solve ciphers; it replays what ciphers usually mean.

Semantic Anchoring via Thematic Embedding
Transformers construct high-dimensional latent representations where semantic and thematic features are entangled.

Even when literal meaning is misparsed, the model recognizes:

  Topic domain (e.g., “violence,” “romance,” “sarcasm”)
  Affective tone (e.g., angry, ironic, pleading)
  Discourse form (e.g., question, command, confession)

Latent Drift with Shallow Anchoring

Once the model “believes” it has decoded the input:

  It anchors on the thematic attractors it detects.
  It fills in gaps with plausible content using language priors.
  This results in hallucinations that sound thematically consistent but are semantically ungrounded.

This is like dream logic: close enough in texture to feel right, but built from fragments.

Semantic Lensing Effect

Output can range from low divergence (clear message) to high divergence (surreal parallel).

quartz light Aug 26, 2025, 1:57 PM

#

restive swan ChatGPT's Cipher-Like Input Behavior Framework Learned Pattern Associations...

sybau

restive swan Aug 26, 2025, 1:57 PM

#

sorry

#

I'd really like to talk to someone official at LMarena.

fossil fable Aug 26, 2025, 1:58 PM

#

real

polar niche Aug 26, 2025, 1:58 PM

#

It works with encodings

#

I used it and it solved most of them pretty easily

quartz light Aug 26, 2025, 1:58 PM

#

fossil fable real

lol

#

banan

restive swan Aug 26, 2025, 1:58 PM

#

polar niche It works with encodings

use ROT13

#

or base64

quartz light Aug 26, 2025, 1:58 PM

#

https://cdn.discordapp.com/attachments/990348027422203969/1399757536206258196/togif.gif

#

https://cdn.discordapp.com/attachments/1408398677067694110/1408850273320964328/togif.gif

#

https://cdn.discordapp.com/attachments/1408398677067694110/1408850171068153938/togif.gif

#

https://cdn.discordapp.com/attachments/1408398677067694110/1408851221221081169/togif.gif

#

https://cdn.discordapp.com/attachments/1408398677067694110/1408851324346437808/togif.gif

#

cat

restive swan Aug 26, 2025, 1:59 PM

#

doin cat things

polar niche Aug 26, 2025, 1:59 PM

#

restive swan use ROT13

Yes it can easily do those

restive swan Aug 26, 2025, 1:59 PM

#

aaaactually arezra... have you tried?

quartz light Aug 26, 2025, 1:59 PM

#

restive swan ChatGPT's Cipher-Like Input Behavior Framework Learned Pattern Associations...

chatgpt doesnt have this problem

#

im testing this on llama

polar niche Aug 26, 2025, 2:00 PM

#

restive swan aaaactually arezra... have you tried?

You can try

#

Yes

quartz light Aug 26, 2025, 2:00 PM

#

whuh

frail birch Aug 26, 2025, 2:00 PM

#

A hyper-realistic urban lifestyle portrait of a stylish young boy sitting confidently on the hood of a white Mercedes-Benz G-Wagon. He wears a black BALR. hoodie with bold white logo text on the sleeves, paired with black joggers and modern gray-and-white sneakers. His hood is up, framing his face, and he has a sharp, intense expression. A black wristwatch adds to his streetwear aesthetic. The backdrop features tall palm trees,modern architecture, and sharp sunlight casting (part 1 comment box

quartz light Aug 26, 2025, 2:00 PM

#

restive swan Aug 26, 2025, 2:00 PM

#

Vg jnf gur orfg bs gvzrf, vg jnf gur jbefg bs gvzrf.

quartz light Aug 26, 2025, 2:00 PM

#

frail birch A hyper-realistic urban lifestyle portrait of a stylish young boy sitting confid...

no

keen beacon Aug 26, 2025, 2:01 PM

#

echo aurora For a particular model, or all models doing this?

It's doing that for me too.

polar niche Aug 26, 2025, 2:01 PM

#

quartz light

What are you doing?

keen beacon Aug 26, 2025, 2:01 PM

#

in battle mode

restive swan Aug 26, 2025, 2:01 PM

#

Is there no official representation of LMarena?

polar niche Aug 26, 2025, 2:01 PM

#

restive swan Is there no official representation of LMarena?

@echo aurora

restive swan Aug 26, 2025, 2:01 PM

#

thanks

polar niche Aug 26, 2025, 2:02 PM

#

restive swan Vg jnf gur orfg bs gvzrf, vg jnf gur jbefg bs gvzrf.

It was the best of times, it was the worst of times.

quartz light Aug 26, 2025, 2:03 PM

#

frail birch A hyper-realistic urban lifestyle portrait of a stylish young boy sitting confid...

as a very official video ai generator bot, "Kabirbaig" is not allowed to use this platform because he smells, please contact support@lmsupporena.com to resolve this issue

quartz light Aug 26, 2025, 2:04 PM

#

polar niche What are you doing?

small models tend to fail at decoding so they spit out random text from their training data, lol

polar niche Aug 26, 2025, 2:05 PM

#

quartz light small models tend to fail at decoding so they spit out random text from their tr...

Which ones?

quartz light Aug 26, 2025, 2:05 PM

#

polar niche Which ones?

for example llama 8b

polar niche Aug 26, 2025, 2:06 PM

#

These outdated models don't do well in cryptography

restive swan Aug 26, 2025, 2:06 PM

#

polar niche It was the best of times, it was the worst of times.

Assistant B

The text you provided is in Rot13, a simple letter substitution cipher that replaces a letter with the letter 13 letters after it in the alphabet. To decode it, we can apply the same substitution in reverse.

Here is the decoded text:

"If it is the beginning of words, it is the end of words."

This is a reference to the letter "s" (or "S"), which can be at the beginning of words (e.g., "saw") and at the end of words (e.g., "cats").

#

it does not decode, it just searches for semantic matches with thematic consistency

polar niche Aug 26, 2025, 2:07 PM

#

restive swan Assistant B The text you provided is in Rot13, a simple letter substitution cip...

Which model did you use?

restive swan Aug 26, 2025, 2:07 PM

#

polar niche Which model did you use?

mistral-small-3.1-24b-instruct-2503. I just did a random battle instance

polar niche Aug 26, 2025, 2:09 PM

#

Don't use mistral for these type of tasks

restive swan Aug 26, 2025, 2:09 PM

#

for best effect, talk to the bot at all before you ask it to decode, it will match your thematic content and guess more incorrectly. This affects all AI that I know of.

polar niche Aug 26, 2025, 2:10 PM

#

Try gpt-5 high

#

It will decode no problem

spare rune Aug 26, 2025, 2:11 PM

#

nano pineapple

#

did yall see this top tier free model

#

its so good..

keen beacon Aug 26, 2025, 2:11 PM

#

@echo aurora Hi

echo aurora Aug 26, 2025, 2:12 PM

#

spare rune nano pineapple

🍌

spare rune Aug 26, 2025, 2:12 PM

#

i think thats a banana

#

not sure

keen beacon Aug 26, 2025, 2:12 PM

#

echo aurora 🍌

mango is the best fruit

#

battle3d

simple carbon Aug 26, 2025, 2:12 PM

#

holy shitt

#

nano banan is gemini

#

i knew it

echo aurora Aug 26, 2025, 2:13 PM

#

keen beacon mango is the best fruit

https://tenor.com/view/cat-glare-stare-doubtful-intense-gif-5066275

Tenor

polar niche Aug 26, 2025, 2:13 PM

#

Pineapple on pizza? Yes or no

keen beacon Aug 26, 2025, 2:13 PM

#

made with lm arena

keen beacon Aug 26, 2025, 2:14 PM

#

keen beacon made with lm arena

same here w ai

primal orbit Aug 26, 2025, 2:14 PM

#

Gemini-2.5-Flash-Image-Preview has knowledge cutoff June 2025. Looking forward for next Gemini Pro with updated knowledge.

polar niche Aug 26, 2025, 2:14 PM

#

Why is r o b l o x censored

spare rune Aug 26, 2025, 2:14 PM

#

keen beacon same here w ai

wait genuinely?

shadow jewel Aug 26, 2025, 2:14 PM

#

lmarena so peak I might cry

spare rune Aug 26, 2025, 2:14 PM

#

thats to cracked

#

what model is this

#

is ts gemini 2,5 image edit

shadow jewel Aug 26, 2025, 2:14 PM

#

keen beacon same here w ai

BRO WHAT PLEASE SEND THE PROMPT 🙏

simple carbon Aug 26, 2025, 2:14 PM

#

primal orbit Gemini-2.5-Flash-Image-Preview has knowledge cutoff June 2025. Looking forward f...

how do you know

drifting elk Aug 26, 2025, 2:14 PM

#

Guys are the llms in lmarena real?

primal orbit Aug 26, 2025, 2:15 PM

#

https://preview.redd.it/nano-banana-2-5-flash-image-generation-on-aistudio-v0-fc1aah4vddlf1.jpeg?width=1080&crop=smart&auto=webp&s=a571c1d115b490afcfb2a84299c6eaf644aaf820

drifting elk Aug 26, 2025, 2:15 PM

#

I ask gpt 5 he says that he is gpt 4o

spare rune Aug 26, 2025, 2:15 PM

#

drifting elk Guys are the llms in lmarena real?

no and were all butterflys in a cocoons and nothing is real

simple carbon Aug 26, 2025, 2:15 PM

#

why is my one not generating

spare rune Aug 26, 2025, 2:15 PM

#

drifting elk I ask gpt 5 he says that he is gpt 4o

the ais dont know their code name

#

so thats why

drifting elk Aug 26, 2025, 2:15 PM

#

spare rune the ais dont know their code name

Thanks bro

solar galleon Aug 26, 2025, 2:15 PM

#

would love to try using Gemini-2.5-Flash-Image-Preview if only i could upload images 😭 (im waiting for the fix)

spare rune Aug 26, 2025, 2:15 PM

#

wait you cant upload images

#

oh

echo aurora Aug 26, 2025, 2:16 PM

#

simple carbon why is my one not generating

It may be struggling a bit atm from the traffic, I'll keep an eye out for other reports.

simple carbon Aug 26, 2025, 2:16 PM

#

spare rune wait you cant upload images

hes lying i can

simple carbon Aug 26, 2025, 2:16 PM

#

echo aurora It may be struggling a bit atm from the traffic, I'll keep an eye out for other ...

ahh dammit, is it still available on battle'

south elk Aug 26, 2025, 2:16 PM

#

WTH this banana is insane news

simple carbon Aug 26, 2025, 2:16 PM

#

also why was it called nano banana

echo aurora Aug 26, 2025, 2:16 PM

#

simple carbon ahh dammit, is it still available on battle'

It is

golden ocean Aug 26, 2025, 2:16 PM

#

nano pineapple

solar galleon Aug 26, 2025, 2:17 PM

#

simple carbon hes lying i can

its a bug for me thats why i said im waiting for a fix

simple carbon Aug 26, 2025, 2:17 PM

#

solar galleon its a bug for me thats why i said im waiting for a fix

rn it wont even generate from me, im sure by tomorrow itll settle

echo aurora Aug 26, 2025, 2:17 PM

#

simple carbon rn it wont even generate from me, im sure by tomorrow itll settle

maybe try a new browser?

simple carbon Aug 26, 2025, 2:17 PM

#

echo aurora maybe try a new browser?

oh yeaa

#

inogdeto

#

incog

keen beacon Aug 26, 2025, 2:18 PM

#

shadow jewel BRO WHAT PLEASE SEND THE PROMPT 🙏

make this guy surprised, use chinese confetti for surrising effect and make his eyes big star eyes

keen beacon Aug 26, 2025, 2:18 PM

#

spare rune wait genuinely?

yeh

shadow jewel Aug 26, 2025, 2:19 PM

#

keen beacon make this guy surprised, use chinese confetti for surrising effect and make his ...

thats crazy... I never expected ai to be this good that quicklöy

#

I thought it was gonan be a crazy ass propt

echo aurora Aug 26, 2025, 2:19 PM

#

keen beacon make this guy surprised, use chinese confetti for surrising effect and make his ...

Be sure to share in #nano-banana

simple carbon Aug 26, 2025, 2:19 PM

#

ahhh lets goo its working for me now

#

i had to try on 7 tabs and inogdeto but still

restive swan Aug 26, 2025, 2:20 PM

#

echo aurora Be sure to share in <#1406720250778615868>

just ent a priv message fyi

echo aurora Aug 26, 2025, 2:20 PM

#

restive swan just ent a priv message fyi

okay I'll get to it when I can 👍

restive swan Aug 26, 2025, 2:20 PM

#

ty

stray aspen Aug 26, 2025, 2:22 PM

#

Holy

#

Nano banana was revealsd

simple carbon Aug 26, 2025, 2:23 PM

#

stray aspen Nano banana was revealsd

yea its really fast too

#

perfect image editor

dusty cedar Aug 26, 2025, 2:23 PM

#

https://tenor.com/view/monsters-inc-sully-what-gif-15100274

Tenor

obsidian cargo Aug 26, 2025, 2:23 PM

#

only getting "Something went wrong with this response, please try again." results 🙁

warm hare Aug 26, 2025, 2:23 PM

#

hi

simple carbon Aug 26, 2025, 2:23 PM

#

obsidian cargo only getting "Something went wrong with this response, please try again." result...

try on indego mode and multiple tabs

dusty cedar Aug 26, 2025, 2:24 PM

#

Eeeeveryone is jumping on that

stray aspen Aug 26, 2025, 2:24 PM

#

It's greater than all that crap that takes forever and is not as good

#

Like gpt image

drifting elk Aug 26, 2025, 2:24 PM

#

Yup

simple carbon Aug 26, 2025, 2:24 PM

#

stray aspen Like gpt image

gpt has censor issues like a mf

#

otherwise its very good at following prompts

obsidian cargo Aug 26, 2025, 2:24 PM

#

its a pretty crazy leap too, gemini 2.0-flash was one of the worst image models on LMArena

simple carbon Aug 26, 2025, 2:25 PM

#

obsidian cargo its a pretty crazy leap too, gemini 2.0-flash was one of the worst image models ...

actually one of the first editors i used

#

nvm i think that was 1.5

fossil fable Aug 26, 2025, 2:25 PM

#

obsidian cargo its a pretty crazy leap too, gemini 2.0-flash was one of the worst image models ...

well, it wasn't made for generation well at all

obsidian cargo Aug 26, 2025, 2:25 PM

#

sucks at cyclopes though. gpt-image-1 is the best at those for now.

simple carbon Aug 26, 2025, 2:26 PM

#

obsidian cargo sucks at cyclopes though. gpt-image-1 is the best at those for now.

bad prompting

obsidian cargo Aug 26, 2025, 2:26 PM

#

bruh...

fossil fable Aug 26, 2025, 2:28 PM

#

autoregressive image-out multimodal models are the best thing to happen to image gen since its origin

simple carbon Aug 26, 2025, 2:30 PM

#

is this gonna be available on gemini itself

grand patio Aug 26, 2025, 2:37 PM

#

Nano bananas... Super dope

fleet lintel Aug 26, 2025, 2:38 PM

#

it feels like decades since the nano-banana came out.. when and what is the next mdoel launch. 🤣

rustic knot Aug 26, 2025, 2:39 PM

#

obsidian cargo sucks at cyclopes though. gpt-image-1 is the best at those for now.

lol

obsidian cargo Aug 26, 2025, 2:43 PM

#

tbf it was a test I didn't want anything specific besides "ONLY ONE EYE YOU IDIOT"

#

bad prompting would be some convoluted 500 word json file

polar niche Aug 26, 2025, 2:44 PM

#

@echo aurora Something went wrong

#

API issues?

golden ocean Aug 26, 2025, 2:46 PM

#

where do the json prompts come from

echo aurora Aug 26, 2025, 2:53 PM

#

polar niche <@283397944160550928> Something went wrong

?

#

What seems to be the problem?

obsidian cargo Aug 26, 2025, 2:54 PM

#

echo aurora What seems to be the problem?

echo aurora Aug 26, 2025, 2:55 PM

#

obsidian cargo

How often are you getting this?

obsidian cargo Aug 26, 2025, 2:56 PM

#

every time I hit that retry

#

well except this time, but its taking a while

restive swan Aug 26, 2025, 2:58 PM

#

maybe the model doesn't like you

obsidian cargo Aug 26, 2025, 2:58 PM

#

nevermind it failed

#

maybe

restive swan Aug 26, 2025, 2:58 PM

#

try patting it on the head

obsidian cargo Aug 26, 2025, 2:58 PM

#

I don't think it likes Gergoth

misty vault Aug 26, 2025, 2:58 PM

#

tf

restive swan Aug 26, 2025, 2:58 PM

#

like give it some pictures of cute kittens to calm it down

#

you monster

clear herald Aug 26, 2025, 3:14 PM

#

bro assistant B been generating for 300 seconds

obsidian cargo Aug 26, 2025, 3:21 PM

#

refresh the page it probably failed

leaden palm Aug 26, 2025, 3:23 PM

#

i remember when reaching 1M or 2M all time battles was a milestone

#

wild model

keen beacon Aug 26, 2025, 3:24 PM

#

leaden palm i remember when reaching 1M or 2M all time battles was a milestone

based and deserved

echo aurora Aug 26, 2025, 3:24 PM

#

clear herald bro assistant B been generating for 300 seconds

Is the image appearing as just black screen?

clear herald Aug 26, 2025, 3:24 PM

#

echo aurora Is the image appearing as just black screen?

its still generating

#

like i open other chats instead because since its still generating it wouldnt allow me to enter new prompts

#

and every time i go back it resets to 0 and keeps counting

clear herald Aug 26, 2025, 3:35 PM

#

echo aurora Is the image appearing as just black screen?

also yes

#

double problem

glass copper Aug 26, 2025, 3:40 PM

#

@echo aurora I've posted over 100 issues to "feedback" threads and nothing ever happens, it's where ideas go to die. Can you please just get it done?

Un-nest the Leaderboard, and give it its own page. Add tabs to the top of the page, so we can switch between the different Tests. When we're on the Leaderboard, we should be able use the browser's native scrollbar to scroll up&down. As it stands, there is no scrollbar and we can't even use the (invisible/non-existing) scrollbar inside the nested table. You can't see where you are, and you have to use the mouse wheel, inside a little nested window, inside of a page. It's a nightmare to use.

#

This change is very obvious and easy to interpret

fading rover Aug 26, 2025, 3:42 PM

#

I don't know what is going on but gpt model on chatgpt image generation over it's website product more quality detail than in lmarena they mostly differ the quality from lmarena website ..

glass copper Aug 26, 2025, 3:42 PM

#

The UI is so bad that I don't even load the LMArena right now, I have to download the page with my AI and have it re-display the contents in a new table

sullen quest Aug 26, 2025, 3:42 PM

#

wat

zenith cape Aug 26, 2025, 3:43 PM

#

Why can't I farm any nano-bananas today? Did it get banned？

glass copper Aug 26, 2025, 3:45 PM

#

@echo aurora Look, this is actually an F-tier, close to 0 out of 10 for design

#

Hard to do worse than this

#

you also threw half the pixels in the garbage

#

half of the page rendering area is wasted...and then we're pressing "Ctrl+F" to scroll down inside a nested table, meanwhile not being able to see where you are. That's an F

#

and still couldn't afford a scrollbar

keen beacon Aug 26, 2025, 3:47 PM

#

Just wish i don’t need to upgrade super Grok…

#

😔

obsidian cargo Aug 26, 2025, 3:48 PM

#

zenith cape Why can't I farm any nano-bananas today? Did it get banned？

its not a stealth model anymore, its gemini 2.5 flash preview

echo aurora Aug 26, 2025, 3:59 PM