#general

1 messages · Page 119 of 1

ocean kindle
#

gpt image 1

hard drift
#

Can someone please mute this guy @echo aurora

hard drift
#

I mostly use for making app previews, so need something that can do 9:16 high res

#

The Web and Apps LM Arena is not working; it says: “Failed to create sandbox”

Is anyone else facing the same issue or knows a solution? I’ve been trying for a few hours.

proper spindle
#

is there a way to make images which are not 1:1 format on lmarena?

hard drift
hard drift
proper spindle
#

doesn't really work with seedream unfortunately 🙁

hard drift
#

Yeah it doesn’t work with seadream and gpt image

#

I don’t know but I am worried about the lmarena owners 🙂

I don’t know how they gonna monetize it, hope they don’t burn their own money lol 😁

Thanks for the best AI tool, it is going to replace all other tools

potent snow
#

Will we ever be able to upload and edit pictures in side by side mode?

echo aurora
echo aurora
#

It's an easy thing to miss

glossy umbra
#

But also gives tenfold than the side effects

#

chat with anyone at km, play , work,etc

#

even use lmarena

#

😅

neat apex
#

Does the relative sucess of Gemma Vault means they will creat an 10 trillion parametters model using all youtube data ever?

whole swallow
#

So do we have any news about sonoma models? Are they from grok or who?

#

And which one do you guys prefeer? Sky or Dusk

#

I feel like dusk got a little more smarter lately

glacial dock
#

Today’s word of the day is quack, I will now use it in a sentence.

What duh quack ?

#

thank you

vital lake
#

Do you guys get paid for introducing a code-name LM?

sullen sand
#

So are you saying that lmarena are going to become something to pay for use it?

#

Or I am crazy right now

torn bison
#

Can we get gemini 3 pro before november?

worn schooner
#

hey people anyone here have already created any virtual influencer? i would like to know how to create one!

vital lake
#

Tbh

#

All the big jumps come from OpenAI models (o3, o1)

#

The other companies just trying to catch up

eager crag
#

i am sorry for using only direct mode... i kinda feel bad now.

keen beacon
#

I think that you will eventually turn the Arena into a platform where users would be able to use all features of all models they want for free, in exchange for their data used for training, and if they'd want some privacy, they'll have to pay for it themselves.

echo aurora
keen beacon
wicked dome
#

Ai

#

Wsp

proud hazel
keen beacon
neon idol
keen beacon
#

Which I believe is true for 99% of users

proud hazel
neon idol
keen beacon
neon idol
#

I see that they keep always the site with the latest models

keen beacon
proud hazel
echo aurora
keen beacon
proud hazel
neon idol
#

For me is really good becuase I use lmarena on the computer, on mobile and on the tablet

proud hazel
proud hazel
#

Well, since they're unlikely to change it back, I'll just have to accept it.

neon idol
#

But what is the problem in just doing an access with a Google account?

proud hazel
stiff glacier
#

Why is he lying?

#

Deepscam

proud hazel
stiff glacier
#

And this?

proud hazel
stiff glacier
remote arrow
#

🍌

ocean vortex
remote arrow
ocean vortex
# remote arrow

A platform where you can use battle mode to test the models and then have 5 messages after voting for the actual testing independent of the voting with the models already revealed, or pay a fee to use all models in direct chat mode with only fair-use limits. Just my 2 cents.

#

I think that would actually drive engagement and help everyone long-term.

neon idol
keen beacon
ocean vortex
barren prairie
#

Not new it was there a month ago and there is flashcards and visual things if you noticed, also some podcasts. Now chatgpt have the same feature too. But using canvas for those things are better than those quizez.

hardy lion
vital lake
lone vector
#

do you guys think OceanStone is gemini 3.0

remote arrow
keen beacon
#

I do not believe that they can't reason - rather I believe that they reason in their own, mysterious, non-humanlike ways

calm lagoon
#

Friends, what does everyone think about Reve? The image generator, is it better than Nano Banana?

pulsar walrus
#

hi!!!

true oracle
#
poll_question_text

Was GPT 5 a disappointment

victor_answer_votes

5

total_votes

12

victor_answer_id

1

victor_answer_text

No

echo aurora
vital lake
#

What I'm saying is the weights weren't improved, it was just trained to try and act like, for example, o1, but wasnt actually that good at answering things at it's base. This is especially bad if it doesnt have enough fine tunned data for other topics. The only reason R1 was considered "good" was because it was the second reasoning model ever and it was next to o1 in coding.

eager crag
#

hey pineapple! i want to give my thanks to you for giving free access to models! i will donate when i'll have some money!

vital lake
#
poll_question_text

Is GPT 5 Pro just marketing or actually better

victor_answer_votes

4

total_votes

4

victor_answer_id

2

victor_answer_text

No

toxic egret
#

what's the best model for working with math and LaTeX?

safe sleet
keen beacon
eager crag
#

i'd still wait for innovation to do it's thing, before i get to use image generation or editing AI seriously.

#

one year ago, i asked flux inpaint to edit out a door from a photo and replace it with a simple white wall, and all it did, was some jumble of black lines in it's place.

#

i don't know if it's a workflow i downloaded, or it's the model, but this made me think i should wait for it to be ready.

pine sorrel
#

please bring back image upload for Claude models

vital lake
#

Everyone who said yes never used it

gentle breach
burnt sinew
#

gemini 3 will probably release in around 45 days

#

according to polymarket at least

candid nest
#

when i go to type ina prompt it wont let me type anything not even /

ocean vortex
#

With their track record you can kinda expect anything lol. They can be doing 3 new mystery models and then silence for weeks

ocean vortex
# gentle breach

If you have spent $200 you are unlikely to say it's not worth it at least for the duration you are paying for it... catgrin

jade egret
#

but it rlly depends

#

ig

remote arrow
#

🍌 + 🥤

supple vector
#

lmarena

keen beacon
#

Probably from a breathing thing at night

burnt sinew
mortal coyote
#

nano banana cant follow instructions - just gives the same image created , rather than the prompt given to edit the image

#

on LMarena

#

works fine on Ai studio GOOGLE

strong basalt
#

@primal solstice wen

quiet bramble
#

hi all...just joined

echo sinew
remote arrow
#

This server really needs a "welcome" or "intro" channel.. 😏

hard jolt
#

hi ... just joined 🙂

earnest rover
#

Why have to do these. Remember we did it and they gave us a rate limit.

abstract zephyr
#

Seedream-4-high-res is unavailable since yesterday, is it a common maintenance ? Where can we check the status of models when they disappear from the models list ? 🤔

brisk token
#

#做一个视频

stuck plaza
#

New here

deep nest
#

Hi

storm sphinx
#

hello

vital lake
visual chasm
#

is gemini currently down ?

vital lake
# vital lake

Just so yall know, if it doesn't use Tree-of-Thoughts that GPT 5 Pro uses, then it wont be better. https://arxiv.org/abs/2305.10601

robust yoke
robust yoke
robust yoke
vital lake
keen beacon
#

I need unlimited free or cheap site like google ai studio

sour saffron
#

Guys
Is it possible to share a Convo from lm arena?

robust yoke
robust yoke
robust yoke
vital lake
stiff siren
#

I have concerns that how can I use seedream 4 model ?

robust yoke
#

You can use it on LM Arena.

patent hornet
#

hi there, how can i know the latests models?

robust yoke
#

Hmm...

patent hornet
#

i mean, how can i use the Arena Battles App like in this screenshot?

#

is a kind of "alert system" or anything like that, isn't it?

robust yoke
#

I think so.

knotty tiger
#

hello

polar kiln
#

hello

hardy lion
random canyon
#

Does Claude Opus 4.1 have a maximum context of 16,000 on the LM Arena?

vapid hill
#

🦴

alpine elbow
#

hello training to learn

finite verge
#

HELLO

proven lava
#

Hello

proud hazel
#

Heylow

whole furnace
#

helooo

elfin epoch
#

HI

cloud marten
#

what is the best AI tool for making thumbnail

wooden gust
#

⚡ “Endless Waiting in Model Comparison: Still No Results!”

“I have been trying to compare the two models for quite a while, but I still haven’t received any results. This long delay is quite frustrating. Please look into this issue seriously and ensure faster result delivery.”

ocean vortex
verbal nimbus
verbal nimbus
abstract zephyr
#

Can anyone confirm me if you have an access to Seedream-4-high-res right now on the models listed ? 🤔

sly lily
#

Hey everyone, anyone has used AI for digital retouch on products? I'm trying to enhance a flat packshot into something close to a studio photo retouched, but results arent getting there. I've been using nanobana and fixing small errors on photoshop.

#

Any suggest or prompt?

bitter lotus
#

Whenever I am generating images using seedream 4 high res model of Indian woman's output always has the same face of women however I haven't uploaded any reference image how to fix this

noble quest
#

Hi, is there a way to use the video arena privately, without others being able to see the results?

abstract zephyr
#

@bitter lotus is seedream 4 high res working for you now ?

stone oxide
#

yea the LMArena personal tracker browser extension turned into a mess of a "project" and the AI can't even fix click-through issues.
On top of that, the extension managed to absolutely wreak havoc on all other extensions, for example my youtube downloader uh... yea the two extensions are having some sort of weird UI battle.

#

Time to laugh about it and come back in another half year when AI is at the point it doesn't do stuff like that anymore xD

fierce leaf
#

Hello

robust yoke
#

Greetings.

robust yoke
#

Maybe Qwen could fix it.

tropic nova
#

Hello

robust yoke
#

Greetings.

abstract zephyr
#

@bitter lotus I don't know why it's not in my list anymore since yesterday. Also, to fix your problem of having the same lady on your picture, click top left icon and click > new chat, if you restart from scratch the AI won't use the old pictures

bitter lotus
abstract zephyr
#

that's creepy, maybe try from a new account if you can switch for just a test ?

robust yoke
#

Some issues are browser-specific. If you switch to a different browser, that should likely resolve the problem you're currently experiencing.

bitter lotus
ocean vortex
sullen quest
#

gpt o5o... plus

upper steeple
#

How to use LMArena to generate videos

echo aurora
tiny crow
#
poll_question_text

We can trust in AI text detectors?

victor_answer_votes

6

total_votes

11

victor_answer_id

2

victor_answer_text

Absolute not

barren prairie
#

Is there any Gemini3 or Gemma on arena now ??

#

Anounymous

jovial mulch
#

Sup

hot anvil
#

how many videos can i generate in a day?

echo aurora
echo aurora
barren prairie
echo aurora
barren prairie
#

Logan now said " I am a large language model trained by Google" so something must be dropped into arena... I HOPE it is not the useless Gemma

light stump
#

🔥

prisma girder
#

hello

dense dew
#

hii guys, does anyone know how to create a prompt for generating an image like this?

mellow frigate
#

Here you go:

close-up of a model wearing a colorful galaxy mask and crystal helmet, with metallic pink and pastel glitter in the style of haute couture fashion. photographed for the cover of vogue magazine, featuring gold chain necklaces, with a futuristic vintage aesthetic, 2057.

terse dagger
#

Hello

brave orbit
dense dew
tiny crow
#
poll_question_text

Is AGI possible by using current LLMs architecture?

victor_answer_votes

7

total_votes

10

victor_answer_id

3

victor_answer_text

Maybe

keen beacon
# brave orbit

Codex for the price is a much better tradeoff than Opus

#

@echo aurora do you remove stealth models off the arena sometimes?

#

Can't meet Raptor again, met it thrice yesterday!

echo aurora
keen beacon
hot anvil
#

how can I generate vertical images on lmarena website?

echo aurora
echo aurora
hot anvil
#

so its always 1:1?

mellow frigate
robust yoke
sudden socket
#

hi

molten solar
#

hii

keen beacon
#

Is multiple unrelated problems in one prompt more difficult for a LLM than one problem per prompt?

robust yoke
robust yoke
robust yoke
#

That is, unless the LLM is insanely talented and is able to handle multiple problems, like Claude, for example.

robust yoke
keen beacon
#

yes, that claude that is insanely talented at being bad at everything except coding where it exceeds gpt 5 a microstep for 100x increase in cost.

prime mulch
#

I would said seedream 4 highres in first and gemini in 2nd

robust yoke
#

True.

winged mauve
prime mulch
#

Is it looks like ai generated

robust yoke
#

For editing, it is on par with ChatGPT.

surreal creek
#

Just realized I’ve been reading it this whole time as Seed Ream and not See Dream lol

robust yoke
#

For actual generating, it's on par with ChatGPT.

mellow frigate
prime mulch
prime mulch
winged mauve
robust yoke
#

Somewhat.

prime mulch
mellow frigate
#

Looks pretty nice. i didn't know motorcycles can have chains

winged mauve
prime mulch
#

Chain is must

#

Without chain how it runs

robust yoke
#

The motor. 🤓

mellow frigate
#

Yeah im searching it now, looks like they actually do have chains. never knew

prime mulch
robust yoke
prime mulch
#

All hail seedream 4 highres

neon idol
#

Hello

remote arrow
#

Oh soda dream.. 🥤

ocean vortex
tribal wedge
leaden sun
robust yoke
#

Does that mean Otaku is correct, or am I correct?

#

Or are we both correct?

glossy umbra
astral meadow
#

yo guys new here
wanted to know why is Midjourney not ranked at a good spot in the leaderboard, have heard that its one of the best text to image models out there.

astral meadow
novel nimbus
#

hello!

brazen coral
#

Guys is there a buh

#

G

rancid orbit
#

what does people use to upscale and unblur messy images

echo aurora
brazen coral
#

Stay loading

echo aurora
frosty berry
#

Does anyone have a prompt to do a face swap with nano banana?

sterile isle
proud hazel
willow grail
#

diud i just read on reddit that gemini 3 is on lmarena?

neon idol
willow grail
#

OceanStone, NightRideOn, NightRideOnV2, SkyTrail

neon idol
#

I'll dream about this tonight

willow grail
#

possible gemini 3 names on lmarena

exotic tartan
#

Is it only me or 4o is better than GPT-5? Only today I had 2 examples where 5 would just go on and on and miss the point entirely, while 4o gets the right answer, while being concise and not responding in 6 bulletins every time. I even double checked the same prompt and had the same results to make sure it's not a fluke. The f is going on?

willow grail
#

actually gpt 3.5 is much better than 4o

#

i swear by my...

exotic tartan
#

No, it was really bad and 4o is an obviously better model

willow grail
#

no gpt 2 is better

exotic tartan
#

With 5 it's not that obvious in many cases

vital lake
#

Ragebait

vital lake
exotic tartan
#

I'm using the normal desktop agent, Pro tier with GPT-5 on Auto

#

Switching between that and 4o. 4o kicked its ass multiple times today.

#

I have 0 reason to simp for 4o, I'm not one of those weirdos who hate the change in character or whatever. I just want good answers

#

I just felt like 5 is being a dumbass lately, so decided to switch back to verify since I didn't remember it being so off

sick sedge
#

hi every body

empty stump
robust yoke
#

Claude can also give good answers, I believe. It's also pretty good at natural creative writing and helping with getting accurate information.

#

But to each their own, I suppose.

remote arrow
# remote arrow
poll_question_text

Future of LMArena

victor_answer_votes

4

total_votes

5

victor_answer_id

1

victor_answer_text

Nightcafe

barren prairie
neon idol
robust yoke
neon idol
robust yoke
#

Doing good, and you?

neon idol
robust yoke
#

My pleasure.

neon idol
#

:)

#

Waiting for some memes on nano banana or soda dream 😞

robust yoke
#

That's fair.

#

In that case, I could create a naner-bananer meme.

neon idol
robust yoke
neon idol
robust yoke
#

È un modo per dire “Va bene”.

neon idol
#

Thx

remote arrow
#

Memes are forbidden in this channel, you'll get a hot slap..

neon idol
#

Pineapple with us 😞🥀

remote arrow
#

🤣 🤣 🤣

neon idol
#

Omg these emojis remember me The Big Brother of George Orwell, idk why 😂😂

remote arrow
#

I only post on #ai-creations when I have time and idea. Only come here to read random HI and HELLO..

remote arrow
#

I keep wondering why the name is GENERAL, while it's not allowed for general talks. Said it must be "AI related only," yet it's mixed with so many HI, HELLO, and I AM NEW. Or "Can you help me to make my photo smile?"

remote arrow
robust yoke
neon idol
#

Always pineapple for revenge

remote arrow
#

Hi. I am not new.. 👋

neon idol
robust yoke
#

Hi, I like to mew.

#

In fact, I'm mewing right now.

remote arrow
robust yoke
#

🤫🧏‍♂️

remote arrow
#

Just exploring. And I am wondering why there are so many people posting prompts. In almost every channel. Even the leaderboard channel..

neon idol
remote arrow
neon idol
remote arrow
#

... ... ... ... but ... why?

thick moss
#

Hello world! I'm new to the internet, what's there to do around here?

neon idol
#

...

#

........

#

@remote arrow no sooner said than done

remote arrow
#

...My biggest accomplishment today: I can show my family that I can type on a keyboard...

remote arrow
robust yoke
#

Heh.

toxic egret
#

whats different between gpt-5-high-new-system-prompts and gpt-5-high? or more precicely what means new system prompts?

remote arrow
#

There: your banani and dreamy..

remote arrow
robust yoke
#

And “sycophantic” means to be overly supportive.

remote arrow
robust yoke
vital lake
# vital lake
poll_question_text

Was o1 Pro way better then regular o1?

victor_answer_votes

6

total_votes

10

victor_answer_id

2

victor_answer_text

Yes

hollow stag
#

hey

gentle breach
# gentle breach
poll_question_text

Do you think the "ChatGPT Pro" subscription is worth the investment?

victor_answer_votes

9

total_votes

19

victor_answer_id

4

victor_answer_text

I don't have it and I don't want it

polar niche
#

@echo aurora What happened to my bugs thread?

echo aurora
polar niche
#

No, because they are still not generating

#

I thought it was a generation issue.

echo aurora
still spire
#

hello

echo aurora
mortal star
#

😎

rocky stream
#

Hi

#

I am here to try out video creation

echo aurora
elfin sundial
#

hey good ppl first here

alpine star
#

Hello mate

stuck drum
#

Hello. Looking forward to trying new AI platforms.

hot pelican
#

Hey guys, why has the leaderboard stalled, especially since August? updates every 2 weeks?

#

is lmarena intentionally slowing down

dusty needle
#

Hi, here to try different LLMs.

surreal creek
polar marlin
#

Image creation seems slower than before.

hot pelican
#

@surreal creek Yeah, I have almost lost habit of visiting the leaderboard, as it is always the same.

#

I hope there is a way to remind the team the bad effects of this

#

@tiny palm are updates planned to be slower and slower?

surreal creek
#

I don’t think it’s a net-negative thing, batch estimation of Bradley-Terry scores with millions of votes is pretty computationally intensive, so it’s not really viable to have a real-time leaderboard - the point is to have it up to-date with the latest models

hot pelican
#

Yeah, I was hopping if it was feasible to update at least every week

#

also makes news and publicity for the leaderboard when models overtaking the top get announced by users and shared around. Instead of missing such events and dumping a long time update including with new models.

#

I am sure a change was there with this week's update. But if the update is going to be combined with Gemini 3, it's another top position lockup by google

robust yoke
robust yoke
robust yoke
robust yoke
#

Very interesting.

hard quiver
#

SeaDream4 still has the restriction where output always produces square images, which makes image editing—especially tasks where I want it to fill in a blank part of the image—turn out quite poorly. Are they going to change that restriction at some point?

robust yoke
#

Well actually, it seems like the aspect ratio is built into the actual model itself, not the site. So in order to change the aspect ratio, the people who made Seedream 4 would have to manually change what kind of aspect ratio it can output.

#

At least I think that's how that works anyway. I think you can specify what kind of output ratio it sets by creating a very detailed prompt as to what kind of aspect ratio you want coming out of it. I remember I made a prompt that was very detailed and specified a pretty wide aspect ratio, and it gave me a pretty wide aspect ratio upon generating the image.

hard quiver
robust yoke
remote arrow
robust yoke
#

Perhaps it simply might just be that the version they used has a default ratio of 1:1.

remote arrow
#

LMArena might adjusted for lighter load. And it is just alright, to set the way it is now, because the main purpose is to test the quality.

robust yoke
#

Yeah, exactly. The model doesn't have to output a perfect thing; it's solely just for testing.

remote arrow
#

It's just like a bakery promoting a new snack and distributing free tester pack.

#

People that are asking full pack for free might be marked with big tattoo on their forehead.

robust yoke
#

True.

#

A big tattoo that says 'Freeloader' on their forehead.

remote arrow
remote arrow
unborn lantern
#

that is seedream 4 fal?

robust yoke
remote arrow
#

🤔 Who? Where? What?

robust yoke
#

Who, where, when, what, why, was?

remote arrow
#

How?

robust yoke
#

How, who, when, what, where, why?

remote arrow
#

And this makes me want to send another AI creation or a new meme.. 🤣

robust yoke
#

Heh.

spiral zephyr
#

Hello

remote arrow
robust yoke
robust yoke
remote arrow
robust yoke
chilly granite
#

hello

robust yoke
#

Greetings.

#

I hate to break it to you, Safeer, but I don't think this is the place for that.

obsidian shell
#

is the site down ?

robust yoke
#

Lemme check...

empty stump
#

it loads fine

robust yoke
#

Nah.

fiery leaf
#

Hi

remote arrow
#

I'm down. The site's up.

robust yoke
#

I'm up, the site's down.

remote arrow
#

ALERT!

dawn mortar
#

hi

jovial nova
#

hey friends i m new here kindly accept my greetings

dreamy lily
#

hello

vital lake
# vital lake
poll_question_text

Will Gemini Deepthink 3.0 be better then GPT-5-Pro?

victor_answer_votes

29

total_votes

33

victor_answer_id

1

victor_answer_text

Yes

tribal wedge
echo aurora
robust yoke
remote arrow
#

What's SodaDream 4 fal? Something modded? LORA embedded?

jagged ferry
#

He

slow beacon
#

Hlo

strange yacht
#

Hi everyone

astral meadow
#

yo guys new here
wanted to know why is Midjourney not ranked at a good spot in the leaderboard, have heard that its one of the best text to image models out there.

arctic gazelle
#

Hi everyone, where is gemini3?

crisp brook
#

hi

tacit wave
#

Hello. I want to create video for my educational youtube channel

echo aurora
dense fox
#

how i usE IT ? TWO PLATFORMS LIKE veo3 and seedream at a time ?

vital lake
# vital lake
poll_question_text

Would you buy a addon to something like GPT Plus plan that makes it so you can get extra requests for a model like GPT 5 Pro without having to pay for Pro/Team plans. Like 5 per week for GPT 5 Pro if you buy the addon.

victor_answer_votes

10

total_votes

18

victor_answer_id

2

victor_answer_text

No

leaden sun
valid flume
#

hi i want image to video

robust yoke
#

Neat.

robust yoke
robust yoke
#

Greetings, newcomers.

tulip grove
#

Some time ago, before logging in, all chats disappeared from my history. A few days ago, one chat “froze” during generation. I reported the bug, but it wasn’t fixed. While browsing the options, I accidentally deleted that thread. Is it possible to recover them? At least the one associated with my account?

vernal meadow
#

Why can you choose between Seedream 4 and Seedream-4-fal ? Are they the same?

fresh flint
#

Guys send me some potrait prompts...

dapper nexus
#

Greetings from Kenya

fiery lichen
#

hi

brisk path
#

hello

honest ore
#

hello

worn folio
prime mulch
worn folio
#

Promt nội dung

tame pine
heady kernel
#

I have my logged in chats on my PC. On my phone I have different chats and not logged in. If I now log in into the same account from my phone will the chats merge into the same account and then I will be able to access them from both my phone and my pc?

limber wind
#

hi i am new here loving this platfrom amazing work thanks team for making this tool

fresh basin
#

hello everybody it's a fantastic tool here to create magic video

quasi sleet
#

Hey there its good to be here lads!!!

bitter lotus
#

What is the difference between seedream-4 & seedream-4-fal ?

limber saffron
patent sandal
#

Hello !!! I'm here to research video generators .... hope you're fine and happy

coarse geode
#

hello, came in to explore the great tools available here

lavish stratus
#

Hello guys Its a pleasure!!!

night fern
#

Hi! Hoping to find a suitable LLM we can host to replace our CAT tool

teal lion
#

hello everyone... i'm here to explore the best tools in the world

barren prairie
#

ليس هنا

hollow imp
#

I didn't understand

chrome swallow
#

hello guys

sturdy mica
#

hello guys

lucid flint
#

I vahe just heard of LMarena! i should have knowned it a long time before

sturdy mica
acoustic remnant
#

你好

keen beacon
#

Hi, does anybody know what seedream-4-fal version is compared to the high res and normal version?

naive ravine
#

Hello Gs!

opal stream
#

hey guys new to the community here to explore

keen beacon
rustic phoenix
#

heloo

verbal nimbus
#

Nightride-on is very likely a Gemini 2.5 Pro checkpoint. Their responses are almost verbatim (top is 2.5 pro).

#

I think it uses the Internet though, isn't that kind of cheating?

#

I think nightride-on = nightride but online.

atomic stream
#

Lmarena died?

#

I'm getting always errors. Just one time worked i don't know how.

nimble vale
#

10-second vertical animation. A large bag of Kuriyama 54 Stars Rice Crackers opens in the center, releasing floating star-shaped crackers and mini packs. Koshio (a cheerful young boy in flight attendant-style outfit) happily makes a star gesture, while Mumu (a cute girl mascot with big round eyes) hugs a rice cracker and jumps with joy. Background with dreamy starry sky and soft pastel glow, family-friendly, heartwarming, shareable mood. No text。

solid beacon
#

#command

sterile sequoia
#

/videos

tame kiln
#

the person in the image as a reporter in a report about a tsunami

exotic gust
#

hi

prime isle
#

hello, i come here to create advertise video .

willow grail
#

lets not forget google is the one who has money. not anthropic

#

@glossy umbra

glossy umbra
#

anthropic is work harder not smarter

#

because their infrastruture is good yes but it cost too much to maintain

willow grail
willow grail
remote idol
# willow grail

Well, I choose creativity and design Opus, but the Gemini 3 is good as well.

willow grail
#

thats why u cant just look gem 2 to 2.5

remote idol
#

Is there a possibility that it might be better at design?

fallen hemlock
#

Helloooo

remote idol
willow grail
stuck storm
#

hmm does gpt-5-high or gpt-5-high-new-system-prompt error out for some more intense questions (that it must spend more time to think over)

#

take this one for example, a random slightly less solved project euler question:

#

A triplet of positive integers ( ( a, b, c ) ) is called a Cardano Triplet if it satisfies the condition:
[
\sqrt[3]{a+b \sqrt{c}}+\sqrt[3]{a-b \sqrt{c}}=1
]

For example, ( ( 2,1,5 ) ) is a Cardano Triplet.
There exist 149 Cardano Triplets for which ( a+b+c \leq 1000 ).

Find how many Cardano Triplets exist such that ( a+b+c \leq 110000000 ). Your solution must be in Python.

remote idol
stuck storm
remote idol
#

If only they made an operator that has an agent that could work on a VM and execute instructions like ChatGPT Operator then that would be amazing

#

Could Gemini 3 have been capable of that?

#

If so, then Gemini 3 is the best AI in the history of the world, even better than local models that run up to 500B parameters.

stark hearth
#

9:16 format. Animated infographic video style. A cartoon electricity meter spinning rapidly, exaggerated motion. Smooth glowing energy lines flow outward from the meter into the air, symbolizing wasted electricity. Continuous dynamic motion with gentle pulsing glow effects. Eco-friendly colors: greens, blues, soft yellows. Minimal clean background, modern and uncluttered. Duration: 6 seconds, seamless loop, smooth camera zoom-in toward the spinning meter.

remote idol
quasi atlas
quasi atlas
remote arrow
#

🤣

remote idol
remote arrow
#

It's always hilarious for me, anytime I check this server and found all the channel flooded with prompts..

ocean vortex
# willow grail

They probably figured they gonna push 2.5ultra forward into 3.0 version stage and release them all at once 🧐

remote idol
ocean vortex
#

I suppose it makes sense for naming. Would have been weird to have 2.5ultra with no 2.0ultra ever being there

remote idol
dusky pier
remote idol
#

Windows 12?

remote arrow
tight spire
#

Hello, I am here to compare ai tools

leaden sun
# willow grail

I guess Anthropic has completely lost the race now, havent they? time for them to consider a holistic restructuring if they want to survive...

steel pecan
#

Hello, I am here to compare ai tools

leaden sun
neon idol
#

What is the difference between the normal seedream 4 and seedream 4 fal?

loud junco
#

Hello. Here to compare ai tools

ocean vortex
#

gpt5-mini is not really better than o3 etc

barren prairie
pine knoll
#

hi

barren prairie
ocean vortex
barren prairie
ocean vortex
#

Hence why I said it may compete on most benchmarks, but can't realistically replace one gen older bigger model

weary cedar
#

Hi

robust yoke
echo aurora
#

Good timezones ablobwave

wintry tinsel
wintry tinsel
sinful ember
#

Hi all

tall summit
#

Hello, I am here to compare ai tools

polar niche
#

Just like all of us lmao

floral comet
#

Gpt 5 high new system prompt? Is it considered to be better than the original gpt 5 high?

remote idol
ocean vortex
viral notch
#

gemini 3 is out? is it on lmarena now?

remote arrow
remote idol
ocean vortex
#

The Flash?

#

🤣

leaden sun
maiden dune
#

hi

elfin oak
elder burrow
ocean kindle
elder burrow
ocean kindle
#

?

elder burrow
#

what phantom ai

ocean kindle
#

phantom ai

elder burrow
#

what is it

#

??

ocean kindle
#

it is phantom ai

elder burrow
elfin oak
#

it seems from Apple

elder burrow
elfin oak
echo aurora
slow hare
#

Hello all, fairly new to AI. In a night class and looking to work with midjourney, discord, Higgsfield and LMarena to create. Looking forward to all the knowledge here.

echo aurora
wicked sage
#

hello!! i found lmarena.ai because one of the members from a server called "websim" said something about it!

#

safe to say, its amazing

solid brook
elder burrow
#

ok well

#

lol 😭

#

DUDE!!!

#

found a way to differentiate between flash and pro, I guess 🤣

#

WHAAAAAHAAHAHAHAHAHWHAHWHA

#

mb for choosing google, I'll press tie for my next tests

prime mulch
elder burrow
elder burrow
elder burrow
elder burrow
vital lake
#

A

#

nvm

#

LOL

#

I'M STUPID

#

😭 🙏

elder burrow
#

😭

#

safe to say, oceanreef is gemini 3 pro - responds with "Google" consistently, flash models never respond properly.. lol

#

actually.. both oceanreef and oceanstone

wicked sage
#

anyways whats going on rn

elder burrow
wicked sage
#

evil...

#

wait question

remote arrow
wicked sage
#

are there actually like released unreleased models that arent listed

elder burrow
#

so, lmarena gets exclusive access to "stealth" models from ai companies so they get feedback before releasing it

wicked sage
#

wtf is oceanstone

elder burrow
kind lava
#

Hello

wicked sage
elder burrow
wicked sage
#

thats sad

elder burrow
#

its ok

wicked sage
#

but also funny at the same time

prime mulch
wicked sage
#

whats the best ai for coding web games?

#

i tried out making a tycoon game with chatgpt 5 high and it turned out great

#

i liked the ui and it included dark mode only

#

and the game was interesting

#

oh thats why

#

text wall

surreal creek
elder burrow
remote arrow
wicked sage
#

another question

whats the most up to date bot

#

i mostly used chatgpt 5 chat and i recently found out that its from 2024, im kinda stupid how i found out this late ngl

prime mulch
wicked sage
prime mulch
remote arrow
wicked sage
#

i honestly couldnt care really

#

icl

remote arrow
prime mulch
remote arrow
tranquil anvil
#

guys so i just created an image to video prompt but it doesnt have the sound, whyis that?

viral notch
#

but does oceanstone code well?

prime mulch
wicked sage
#

we dont know yet since its basically battle mode only

viral notch
#

gemini 2.5 pro is decent but 4.1 opus and gpt5 are better

prime mulch
wicked sage
remote arrow
tranquil anvil
tranquil anvil
#

is there any way to force it tho or you cant?

prime mulch
viral notch
wicked sage
#

is there a way i can make a .lrc creator

prime mulch
tranquil anvil
#

so its basically just luck

#

right?

viral notch
#

like gpt5 and 4.1 opus nailed the tasks

remote arrow
# tranquil anvil oh ok

No. It depends on the generator. Veo3 can make sound, but other like RealMotion can't. The best workaround: manual dub.

static jolt
#

/video গ্রামের এক প্রান্তে ছোট্ট একটি ছেলে থাকত, তার নাম আরাভ। দাদু-দিদার সাথে থাকত সে। খুব চঞ্চল আর কৌতূহলী ছিল আরাভ। প্রতিদিনই সে মাঠে, বাগানে খেলতে খেলতে নতুন কিছু খুঁজে বের করত।

একদিন দুপুরে আরাভ আমগাছের নিচে বসে ছিল। হঠাৎই কোথা থেকে একটি ছোট্ট বাঁদর এসে হাজির হলো। প্রথমে আরাভ ভয় পেয়ে গেল, মনে করল হয়তো বাঁদরটা তাকে আক্রমণ করবে। কিন্তু আশ্চর্য! বাঁদরটা মাটিতে পড়ে থাকা একটি আম কুড়িয়ে এনে তার দিকে বাড়িয়ে দিল।

আরাভ অবাক হয়ে হেসে ফেলল। তারপর নিজের পকেট থেকে একটা বিস্কুট বের করে বাঁদরটাকে দিল। সেদিন থেকেই শুরু হলো তাদের অদ্ভুত বন্ধুত্ব।

আরাভ বাঁদরের নাম দিল “মোন্টি”। দুজন প্রতিদিন দেখা করত। মোন্টি তাকে গাছে ওঠা শেখাত, কখনো নারকেল পেড়ে খাওয়াত। আরাভও তাকে ফল আর রুটি এনে দিত। এই অদ্ভুত বন্ধুত্ব দেখে পুরো গ্রামে হইচই পড়ে গেল। সবাই অবাক হত— “ছোট ছেলে আর বুনো বাঁদর কিভাবে এত ভালো বন্ধু হলো?”

একদিন গ্রামে মেলা বসেছিল। আরাভ মেলায় গিয়ে ভিড়ের মধ্যে হারিয়ে গেল। ভয়ে কেঁদে ফেলল সে। ঠিক তখনই গাছের ডাল থেকে লাফিয়ে মোন্টি এসে হাজির হলো। সে আরাভের হাত ধরে তাকে দাদু-দিদার কাছে নিরাপদে পৌঁছে দিল।

সেদিন গ্রামবাসীরা বুঝল, এই বন্ধুত্ব শুধুই খেলা নয়, বরং আসল ভরসা আর ভালোবাসার উপর দাঁড়িয়ে আছে।

আরাভ আর মোন্টির এই অনন্য বন্ধুত্ব সবাইকে শেখাল—
ভালোবাসা আর বোঝাপড়া ভাষা বা প্রজাতির সীমা মানে না। মানুষ আর প্রাণীও একে অপরের সত্যিকারের বন্ধু হতে পারে।

prime mulch
wicked sage
#

DUDE

wicked sage
#

stop it with the textwalls please

wicked sage
tranquil anvil
#

also thanks @remote arrow

remote arrow
wicked sage
#

or more

#

idrk

#

ill brb

remote arrow
elder burrow
#

??? how

tranquil anvil
#

Thanks alot

elder burrow
#

I didn't give it today's date btw

wicked sage
#

what is nightride v2

elder burrow
#

it responded almost instantly

wicked sage
#

how are you guys finding this

wraith cliff
#

guys is anyone good at prompts 4 veo 3

wicked sage
#

thats MY question

echo aurora
#

Hey checking in - are others seeing model response seem very slow right now just in Battle? Eventually they'll respond, but taking noticeably longer.

elder burrow
#

its inconsistent

wicked sage
wicked sage
#

while claude opus 4 finished

elder burrow
elder burrow
elder burrow
wicked sage
#

check image

#

..im not

#

wait am i stupid

#

i swear i was

#

no i did not

elder burrow
remote arrow
#

Only Geppeto-1-Image slow as usual. It have constant brain damage. The others are just fine..

wicked sage
#

oh you said 4.1

#

sorry im blind

#

im legally blind*

#

i swear

spare rune
#

Ni hao chat

#

Anyonghaseyo chat

#

Hello chat

elder burrow
#

anthropic stinky as usual

wicked sage
#

retrying ...

elder burrow
#

holy locked down

elder burrow
wicked sage
wicked sage
elder burrow
wicked sage
#

this guy might expose lmarena

#

this guy might be the owner i cant prove it

versed nova
#

what is seedream-4-fal?

spare rune
wicked sage
#

@elder burrow from what i see claude opus 4.1 thinking is cooking

spare rune
#

It’s cooler

wicked sage
#

ty

spare rune
#

If you use it you become cool

#

Not cool as me tho

wicked sage
#

gpt 5 high already started

#

great

verbal nimbus
#

It has factual inaccuracies

wicked sage
#

this is cool

candid storm
#

@echo aurora Its been a while since the leaderboard got an update?

leaden laurel
#

i think i did guess stealth model to be seedream 4

#

before

#

but its easier on images

verbal nimbus
#

GLM 4.5 > Longcat

verbal nimbus
ornate kayak
#

Hi there, I'd like to try the video generation bot

verbal nimbus
#

By checking if Pro makes the same factual error as reef. If not, then oceanreef is Flash

elder burrow
#

yo... i figured out which one is truly gemini 3 flash

gleaming kestrel
#

Hi! As a new user, just wanted to say that I'm working with AI since quite some time and I discovered the benefits of LMArena in being able to early test and compare side-by-side models which is very helpful, and this is what brought me here.

verbal nimbus
wicked sage
#

<-- gpt 5 high claude 4.1 opus thinking 16k -->

verbal nimbus
#

Flash actually got it right

wicked sage
#

lmk which looks better

#

personally i think gpt 5 high looks better

#

@leaden laurel jumpscare

elder burrow
#

oceanstone is gemini 3 pro

wicked sage
verbal nimbus
wicked sage
verbal nimbus
#

If it is, then I'd be quite disappointed

elder burrow
elder burrow
verbal nimbus
#

Even GLM 4.5 beats it

wicked sage
#

how do i feed claude opus 4.1, gpt 5 high .css

elder burrow
#

btw look, they behave very differently now and i like it

verbal nimbus
elder burrow
verbal nimbus
wicked sage
# elder burrow

wait did you make it so its automatically sending oceanstone reponses

#

or smth

verbal nimbus
#

Which is... Odd

echo aurora
verbal nimbus
#

So quite a big difference

wicked sage
#

hey pineapple i got a question

leaden laurel
#

asked some mystery ai to make bytebeat song and it made this: (t>>10^((t>>14|t>>10)&(t>>15&31)))(t>>7&1?127:64) + ((t5&t>>11|t3&t>>13|t7&t>>17)^t*((t>>14&3)+(t>>12&1)))>>2

#

run it

elder burrow
#

OCEANREEF IS WILDLY DIFFERENT

leaden laurel
#

(its not good)
oops bad copy paste

wicked sage
#

t=0 error: ((t >> 10) ^ (((t >> 14) | (t >> 10)) & ((t >> 15) & 31))) is not a function

leaden laurel
elder burrow
leaden laurel
#

atleast i got phoenix

#

onc

#

e

wicked sage
#

yeah true

#

WHAT IS PHOENIX

leaden laurel
#

idk

verbal nimbus
elder burrow
wicked sage
elder burrow
#

lmfao

verbal nimbus
#

Oceanreef sucks tbh

elder burrow
verbal nimbus
#

Like even GLM beats it

leaden laurel
#

i mean phantom

verbal nimbus
elder burrow
wicked sage
#

how are you finding ts

verbal nimbus
#

I haven't seen phantom either

#

Wasn't that an anonymous model way back

#

It's so easy to distinguish Qwen 3

#

“It likes to write like this” ✅

#

In quotes

verbal nimbus
#

It's not really a blind battle if I can identify Qwen 3 every time, lol

wicked sage
#

overall, i think opus claude 4.1 thinking 16k should be top 1 in webdev

elder burrow
#

oh god

wicked sage
#

even tho the css is... meh its still fast and gets the job done

elder burrow
#

i just got the whole system prompt of oceanstone

leaden laurel
#

i told that in other servers but

verbal nimbus
wicked sage
leaden laurel
#

i think mystery "carrot" model in anycoder is sonnet 4.1

elder burrow
verbal nimbus
#

How are you guys encountering it so often 😂

leaden laurel
#

what is this

wicked sage
#

hey omnificat

#

what prompt do you use for finding stuff like this

#

its kinda cool ngl

wintry tinsel
wicked sage
wintry tinsel
#

I did notice the quality didn’t feel quite right but I wasn’t sure if it was my system prompt or what

verbal nimbus
echo aurora
# verbal nimbus > “It likes to write like this” ✅

I wasn't able to replicate this, do you have an example prompt you'd be able to share? Ultimately, we do care if there are ways users are able to skew votes in a certain way that'd create inaccurate leaderboards. We're constantly on the hunt for preventing votes that aren't authentic human preference.

verbal nimbus
#

Usually programming questions

strong basalt
leaden laurel
#

ok

pallid raptor
#

flux-1-kontext-dev to noSomething went wrong with this response, please try again.

leaden laurel
#

mouse

verbal nimbus
# echo aurora I wasn't able to replicate this, do you have an example prompt you'd be able to ...

Here's a dumb one I cooked up (dumb since it doesn't really make sense):

What does the receiver do when a duplicate TCP packet is received? Analyze which approach is best regarded in terms of performance and congestion control.

Got Qwen 3 Max to write in quotes again. The more complex the question, the more likely it is for quotes appear, I think.

It should be easy to find in the dataset. Filter/grep for > “

(Edit) See this to this message for a better example + screenshot: #general message

gleaming kettle
#

i still believe nano is better than seedream 4

wicked sage
#

i love lmarn ea

glossy umbra
#

it is not good with retaining objects while editing their attributes

#

sure it can take an object and add it into an image or remove one, but it cant work with them that well

obsidian current
#

how to make video

strong basalt
#

does someone know what is the "lmarena-internal-test-only" ?

verbal nimbus
#

😃 Summary

“As seen, Qwen 3 models are very recognizable because they write like this.” ✅

teal mantle
#

Is it just me or gpt 5 instant feels like they backtrained with 4o synthetic data?

strong basalt
#

they're on gpt-5-nano lvl

wicked sage
#

android or iphone

#

im deadass r

#

rn

glossy umbra
wicked sage
#

thank you for telling me this dude

#

youre a life saver

glossy umbra
#

I know you're sarcastic but it's really weird to see . Even gpt 5 mini is better than chat.

verbal nimbus
#

I still haven't gotten oceanstone 🤪

wicked sage
#

i mostly use gpt 5 chat, i didnt know it was this bad

#

it was either gpt 5 high or chat

#

again, ty

glossy umbra
# wicked sage again, ty

No worries man
GPT-5-HIGH is very good , but slow. If you need quick answers but not on the joke level of 5-chat, I suggest using Sonnet 3.5 .

wicked sage
#

ty!

verbal nimbus
#

Oh just got it. Oceanstone > oceanreef

#

But about on par with Qwen 3 🤷

#

I feel like it's a Flash checkpoint

#

Doesn't feel like Gemini 3 to me

verbal nimbus
wicked sage
#

✌️

burnt sinew
wicked sage
#

also

claude sonnet 4 thinking 32k
or
claude opus 4.1 thinking 16k

glossy umbra
limber marlin
#

Hello

wicked sage
#

hi

burnt sinew
#

1368 vs 1430

wicked sage
#

so sonnet 4 thinking 32k is better?

#

atp i might short it to sonnet 4 32k

burnt sinew
#

if you need speed

empty stump
#

is gpt 5 chat the instant model n chatgpt?

wicked sage
#

oh

#

so thinking is just more time but better responses
non-thinking is less time but.. not sure if they are bad responses

#

if someone can help me, which is better opus or sonnet?

sullen quest
#

Opus

wicked sage
#

idk which is better

wicked sage
verbal nimbus
#

It just have used Qwen data

wicked sage
#

what does this string of numbers mean

empty stump
#

8/5/2025

#

?

wicked sage
#

oh

#

im stupid

burnt sinew
wicked sage
#

ty

wicked sage
verbal nimbus
#

why is it putting everything in quotes, lol

wicked sage
#

my question is where you found this

verbal nimbus
#

Random battle

wicked sage
#

ok

#

ty

verbal nimbus
#

Should be on direct mode too ig, since it's not anonymous

#

But the other Chinese models are better

verbal nimbus
#

Lol

#

Logcat

wicked sage
#

first try non-direct chat

#

nice

wicked sage
#

nvm i see why opus is good

#

back to back

elder burrow
#

its pretty bad though

wicked sage
verbal nimbus
wicked sage
#

its funny seeing random ass models i have never seen

elder burrow
#

they only have a flash model and they trained on other models

#

i hoped longcat was a good model

#

cz u know