#general | Arena | Page 126

toxic egret Sep 24, 2025, 7:15 PM

#

And it was Mascheroni lol

robust yoke Sep 24, 2025, 7:15 PM

#

What do we think it equals?

#

The one in the image I sent.

verbal nimbus Sep 24, 2025, 7:16 PM

#

The hardest one ever... oh wait I should feed it a PDE from back in high school

toxic egret Sep 24, 2025, 7:16 PM

#

robust yoke What do we think it equals?

γ ≈ 0.57721

robust yoke Sep 24, 2025, 7:16 PM

#

toxic egret γ ≈ 0.57721

Wrong, actually.

verbal nimbus Sep 24, 2025, 7:17 PM

#

I remember a horrendous solution that had to be calculated by a solver that was like a page long

robust yoke Sep 24, 2025, 7:17 PM

#

It's a whole integer.

vital lake Sep 24, 2025, 7:17 PM

#

No?

robust yoke Sep 24, 2025, 7:17 PM

#

verbal nimbus I remember a horrendous solution that had to be calculated by a solver that was ...

Heh.

topaz meadow Sep 24, 2025, 7:17 PM

#

Hello, are you providing paid API?

robust yoke Sep 24, 2025, 7:17 PM

#

I could make an equation that is unsolvable by a human and is very long.

#

Possibly even longer than a page.

verbal nimbus Sep 24, 2025, 7:17 PM

#

Analytical?

toxic egret Sep 24, 2025, 7:18 PM

#

robust yoke It's a whole integer.

Bro...

robust yoke Sep 24, 2025, 7:18 PM

#

verbal nimbus Analytical?

No, an algebraic equation.

toxic egret Sep 24, 2025, 7:18 PM

#

robust yoke No, an algebraic equation.

Make it solve ChaCha20 20 rounds equation lol

robust yoke Sep 24, 2025, 7:18 PM

#

toxic egret Make it solve ChaCha20 20 rounds equation lol

Provide me with it.

toxic egret Sep 24, 2025, 7:19 PM

#

Ask chat gpt for it, too long and complex

robust yoke Sep 24, 2025, 7:19 PM

#

Ah, alrighty.

toxic egret Sep 24, 2025, 7:20 PM

#

More exactly system of equations but yeah

#

Thats something truly impossible, even 1 round messes everything up

jagged storm Sep 24, 2025, 7:21 PM

#

Tried to create a video but i´ve been waiting for an hour and nothing 🙁

verbal nimbus Sep 24, 2025, 7:23 PM

#

Part 2 of GPT-5-High not getting the joke

robust yoke Sep 24, 2025, 7:24 PM

#

So, apparently, ChaCha20 is a cipher algorithm?

#

I wonder how DeepSeek is supposed to solve that.

#

I don't think you can solve code…

toxic egret Sep 24, 2025, 7:25 PM

#

robust yoke So, apparently, ChaCha20 is a cipher algorithm?

Ciphers are pure math

#

Think of it as a function that works as a blender

#

But a blender with an undo button

robust yoke Sep 24, 2025, 7:26 PM

#

Gotcha.

#

Alrighty then.

#

I wonder if it'll be able to solve the ChaCha20 equation.

toxic egret Sep 24, 2025, 7:27 PM

#

Its a system of non linear equations

echo aurora Sep 24, 2025, 7:27 PM

#

jagged storm Tried to create a video but i´ve been waiting for an hour and nothing 🙁

You need to use a slash command to prompt the bot. Check out the information in #1397655624103493813 as it should be helpful.

robust yoke Sep 24, 2025, 7:27 PM

#

Yeah.

toxic egret Sep 24, 2025, 7:28 PM

#

If you made it, congrats you just redefined whats considered cryptographically secure

knotty fable Sep 24, 2025, 7:30 PM

#

Lets test the human intelligence instead.
Far more interesting.
Which track is AI and which one is human made? Or is one these a mix of both? [No more votes so clips removed.]

toxic egret Sep 24, 2025, 7:30 PM

#

I think it would be better if you first started with chacha20 1 round, then ramp up to 20, maybe that helps deepseek

robust yoke Sep 24, 2025, 7:31 PM

#

So, I gave it the code and asked it to solve it, and it gave me this.

📎 Equation.txt

toxic egret Sep 24, 2025, 7:31 PM

#

knotty fable Lets test the human intelligence instead. Far more interesting. Which track is...

I feel like tudis

astral whale Sep 24, 2025, 7:32 PM

#

Anyone know how to do ts Ai?

robust yoke Sep 24, 2025, 7:32 PM

#

astral whale Anyone know how to do ts Ai?

I think you have to use nano-banana.

#

Either that or Qwen.

toxic egret Sep 24, 2025, 7:32 PM

#

robust yoke So, I gave it the code and asked it to solve it, and it gave me this.

It gave you the python implementstion

robust yoke Sep 24, 2025, 7:33 PM

#

toxic egret It gave you the python implementstion

Right.

knotty fable Sep 24, 2025, 7:33 PM

#

toxic egret I feel like tudis

Good guess- it's the mix actually. AI + a remix of a metal version of Red queen from Mad alice returns.

toxic egret Sep 24, 2025, 7:33 PM

#

I knew it :D

toxic egret Sep 24, 2025, 7:33 PM

#

robust yoke Right.

Let me check what to ask so it goes full math mode and really tries to solve it

astral whale Sep 24, 2025, 7:34 PM

#

robust yoke I think you have to use nano-banana.

Link?

robust yoke Sep 24, 2025, 7:34 PM

#

toxic egret Let me check what to ask so it goes full math mode and really tries to solve it

Alrighty.

robust yoke Sep 24, 2025, 7:34 PM

#

astral whale Link?

You can use this site. It provides free nano-banana: https://dreamina.capcut.com/

Dreamina image generator & video generator: All-in-one AI creative ...

Create stunning images and videos from simple prompts with Dreamina's AI image and video generator. Perfect for posters, logos, and avatars.

toxic egret Sep 24, 2025, 7:43 PM

#

robust yoke Alrighty.

Solve ChaCha20 20 rounds algorithm system of equations to break the cipher and you would have to kinda force him to try to solve it

verbal nimbus Sep 24, 2025, 7:43 PM

#

toxic egret `Solve ChaCha20 20 rounds algorithm system of equations to break the cipher` and...

Is it solvable by a human

neon idol Sep 24, 2025, 7:44 PM

#

robust yoke You can use this site. It provides free nano-banana: https://dreamina.capcut.com...

unlimited for the precision

#

🤓

toxic egret Sep 24, 2025, 7:44 PM

#

verbal nimbus Is it solvable by a human

maybe

verbal nimbus Sep 24, 2025, 7:44 PM

#

robust yoke You can use this site. It provides free nano-banana: https://dreamina.capcut.com...

Isn't nano banana already free on the Gemini app?

toxic egret Sep 24, 2025, 7:45 PM

#

the system is highly complex, non-linear, and has 256 unknown key bits, and each round introduces more complecity and non-linearity

verbal nimbus Sep 24, 2025, 7:46 PM

#

I think it's better to get the AI to write a quantum computing algorithm if that's the case 🤣

toxic egret Sep 24, 2025, 7:46 PM

#

wait im wrong

#

for 20 rounds, its 16 equations, module 2^32, and 8 unknowns

surreal olive Sep 24, 2025, 7:53 PM

#

how long does it take to generate a video in the video-arena-1 channel? i put in a request some hours ago and still don't see it.

dire timber Sep 24, 2025, 7:54 PM

#

Hi everyone

ocean vortex Sep 24, 2025, 7:54 PM

#

@echo aurora you should make separate channel for voting only. I feel like most of those never get their identity revealed, this is moving too fast... #video-arena-1

#

slow mode and only every 5th generation is included, or one per user / 1 video per 10min or smth like that

echo aurora Sep 24, 2025, 7:57 PM

#

ocean vortex <@283397944160550928> you should make separate channel for voting only. I feel l...

Can you share in #bot-feedback ?

verbal nimbus Sep 24, 2025, 7:58 PM

#

Possibly the most censored model ever :P

toxic egret Sep 24, 2025, 7:59 PM

#

verbal nimbus Possibly the most censored model ever :P

ask deepseek about what happened the 4th of June

verbal nimbus Sep 24, 2025, 8:00 PM

#

toxic egret ask deepseek about what happened the 4th of June

Haha, I will

#

I wonder why it refused. Was it the schwarma or the Airbus A320?

verbal nimbus Sep 24, 2025, 8:01 PM

#

toxic egret ask deepseek about what happened the 4th of June

Lol

toxic egret Sep 24, 2025, 8:01 PM

#

lol

verbal nimbus Sep 24, 2025, 8:01 PM

#

Apparently you could avoid it in R1 by appending <think>\n in text completion mode

cyan harbor Sep 24, 2025, 8:02 PM

#

nano banani wont generate me anything

toxic egret Sep 24, 2025, 8:02 PM

#

verbal nimbus I wonder why it refused. Was it the schwarma or the Airbus A320?

who knows, im still wondering why i get prompts instantly rejected every time there's a single shell cmd on them, maybe for security

cyan harbor Sep 24, 2025, 8:02 PM

#

over 600s im waiting

toxic egret Sep 24, 2025, 8:02 PM

#

cyan harbor over 600s im waiting

you tried refreshing the page?

cyan harbor Sep 24, 2025, 8:03 PM

#

toxic egret you tried refreshing the page?

ofc

#

again over 600s

toxic egret Sep 24, 2025, 8:03 PM

#

tried other models?

#

maybe nano-banana has a limit im not sure, but if it does maybe its that

#

it happends to me sometimes while using opus models

remote arrow Sep 24, 2025, 8:05 PM

#

cyan harbor ofc

Go to https://labs.google/fx/tools/whisk and use it for free and unlimited, no rate limits..

Whisk - labs.google/fx

A new experimental tool that lets you use images as prompts to visualize your ideas and tell your story.

verbal nimbus Sep 24, 2025, 8:07 PM

#

verbal nimbus Possibly the most censored model ever :P

Wow

#

I didn't know contaminating an aircraft with schwarma will incur extra charges

cyan harbor Sep 24, 2025, 8:07 PM

#

remote arrow Go to https://labs.google/fx/tools/whisk and use it for free and unlimited, no r...

garbage

cyan harbor Sep 24, 2025, 8:08 PM

#

toxic egret you tried refreshing the page?

it works lol

#

strange

verbal nimbus Sep 24, 2025, 8:10 PM

#

remote arrow Go to https://labs.google/fx/tools/whisk and use it for free and unlimited, no r...

What does it do?

#

Is it nano-banana?

remote arrow Sep 24, 2025, 8:11 PM

#

verbal nimbus What does it do?

Just another tool Google provides for making videos. It gives character consistency using Banana and Imagen 4, then make video using Veo3..

verbal nimbus Sep 24, 2025, 8:11 PM

#

remote arrow Just another tool Google provides for making videos. It gives character consiste...

Is it banana or imagen 4? Images are good

remote arrow Sep 24, 2025, 8:12 PM

#

verbal nimbus Is it banana or imagen 4? Images are good

Native image generator is Imagen 4. Banana used for editing.

cyan harbor Sep 24, 2025, 8:12 PM

#

verbal nimbus Is it nano-banana?

its trash bro trust me, ive tried it

#

they giving trash tools for free

#

ok i guess nano banana is trash too. I gave it image of a car and image to wrap car into it and it just did half of car. Actually it took image ratio of a wrap image and cut of half of car. Very nice model

verbal nimbus Sep 24, 2025, 8:14 PM

#

Seems decent

ornate jackal Sep 24, 2025, 8:14 PM

#

where do the videos go after you ask for generation

cyan harbor Sep 24, 2025, 8:15 PM

#

ornate jackal where do the videos go after you ask for generation

#video-arena-1 -3

regal mist Sep 24, 2025, 8:15 PM

#

Where is the new QWEN modell?????

#

I dont see itttteaaakn

cyan harbor Sep 24, 2025, 8:16 PM

#

womp

#

trash

ornate jackal Sep 24, 2025, 8:16 PM

#

cyan harbor <#1397655695150682194> -3

just in the chat? How do you knowif it's done or if your tagged?

rocky bear Sep 24, 2025, 8:16 PM

#

hi everyone.. new bee here

cyan harbor Sep 24, 2025, 8:17 PM

#

ornate jackal just in the chat? How do you knowif it's done or if your tagged?

u get a DM from a lmarena bot that it has finished

#

and it gives u a link to it

remote arrow Sep 24, 2025, 8:17 PM

#

verbal nimbus Seems decent

It's Veo3, doesn't matter where to access..

solid brook Sep 24, 2025, 8:17 PM

#

yooo

#

gpt 5 codex added to lmarena

#

right now

astral whale Sep 24, 2025, 8:18 PM

#

How to do that pls

remote arrow Sep 24, 2025, 8:18 PM

#

Who asked for Qwen earlier?

astral whale Sep 24, 2025, 8:18 PM

#

astral whale How to do that pls

It’s Ai btw

regal mist Sep 24, 2025, 8:20 PM

#

When will qwen be on the leaderboard?

#

And is lmarena down for everyone?

cyan harbor Sep 24, 2025, 8:20 PM

#

regal mist And is lmarena down for everyone?

nop

remote arrow Sep 24, 2025, 8:21 PM

#

regal mist And is lmarena down for everyone?

Not for me

astral whale Sep 24, 2025, 8:23 PM

#

astral whale How to do that pls

Can anyone make for me

flint sandal Sep 24, 2025, 8:23 PM

#

Deepseek was dominating in open-source llms in early 2025. But now... Theyre models are sooo stupid. The Qwen, Ernie and even models like ring or ling are better.

verbal nimbus Sep 24, 2025, 8:23 PM

#

cyan harbor womp

ocean vortex Sep 24, 2025, 8:27 PM

#

verbal nimbus Lol

lmao it's hilarious how much they overfitted/censored this specific event. Soft jailbreak returns this:

verbal nimbus Sep 24, 2025, 8:28 PM

#

ocean vortex lmao it's hilarious how much they overfitted/censored this specific event. Soft ...

Lmao

verbal nimbus Sep 24, 2025, 8:30 PM

#

verbal nimbus

The video is very weird though

#

I don't think it knows how to do a drive by

flint sandal Sep 24, 2025, 8:31 PM

#

Is there any open dataset with gpt-5 with a lot of code, logic examples etd.?

#

On huggingface there is only one with 100 examples

#

And its bas

#

Bas

#

Bad

verbal nimbus Sep 24, 2025, 8:32 PM

#

flint sandal Is there any open dataset with gpt-5 with a lot of code, logic examples etd.?

Only for GPT-5?

flint sandal Sep 24, 2025, 8:32 PM

#

verbal nimbus Only for GPT-5?

Idk. Something that includes gpt-5

verbal nimbus Sep 24, 2025, 8:33 PM

#

Maybe look at open source benchmarks that have tested GPT-5

#

Like the IQ one

proud hazel Sep 24, 2025, 8:33 PM

#

astral whale How to do that pls

What do you mean?

verbal nimbus Sep 24, 2025, 8:33 PM

#

They usually include the results for transparency

flint sandal Sep 24, 2025, 8:33 PM

#

verbal nimbus Maybe look at open source benchmarks that have tested GPT-5

ty

verbal nimbus Sep 24, 2025, 8:34 PM

#

flint sandal ty

Here: https://www.trackingai.org/responses

Tracking AI

Tracking AI is a cutting-edge application that unveils the political biases embedded in artificial intelligence systems. Explore and analyze the political leanings of AIs with our intuitive platform, designed to foster transparency in the world of artificial intelligence. Stay informed and uncover the political inclinations shaping the algorithm...

#

Select GPT-5 and Mensa Norway

echo aurora Sep 24, 2025, 8:34 PM

#

Hey are others also having issues accessing the leaderboards right now? https://lmarena.ai/leaderboard

burnt sinew Sep 24, 2025, 8:35 PM

#

Broken?

burnt sinew Sep 24, 2025, 8:35 PM

#

echo aurora Hey are others also having issues accessing the leaderboards right now? https://...

Yes

proud hazel Sep 24, 2025, 8:35 PM

#

echo aurora Hey are others also having issues accessing the leaderboards right now? https://...

stark arch Sep 24, 2025, 8:35 PM

#

echo aurora Hey are others also having issues accessing the leaderboards right now? https://...

yep

#

burnt sinew Sep 24, 2025, 8:35 PM

#

67

echo aurora Sep 24, 2025, 8:35 PM

#

Had a feeling, thank you all!

stark arch Sep 24, 2025, 8:35 PM

#

np

verbal nimbus Sep 24, 2025, 8:36 PM

#

echo aurora Hey are others also having issues accessing the leaderboards right now? https://...

cyan harbor Sep 24, 2025, 8:38 PM

#

verbal nimbus

nice but to me wont work idk why what prompt u used

verbal nimbus Sep 24, 2025, 8:38 PM

#

cyan harbor nice but to me wont work idk why what prompt u used

Maybe because I used the default aspect ratio

#

16:9

cyan harbor Sep 24, 2025, 8:39 PM

#

verbal nimbus Maybe because I used the default aspect ratio

how where the aspect ratio on lmarena?

verbal nimbus Sep 24, 2025, 8:39 PM

#

It was on whisk

#

I don't think LMArena has it

cyan harbor Sep 24, 2025, 8:40 PM

#

i lit went to google ai studio it even gave me worse results pure garbage they turned banana to sh!t now lit

verbal nimbus Sep 24, 2025, 8:41 PM

#

cyan harbor i lit went to google ai studio it even gave me worse results pure garbage they t...

I'm not sure if I was using Banana or Imagen 4

keen beacon Sep 24, 2025, 8:41 PM

#

poll_question_text

What is the best measure of LLM intelligence?

victor_answer_votes

5

total_votes

5

victor_answer_id

7

victor_answer_text

Performance on tasks underrepresented in training data

verbal nimbus Sep 24, 2025, 8:41 PM

#

It seemed kinda bad at edits though

#

When I asked it to change the angle, it just gave me back the same picture

#

I think this makes something like Genie useful

verbal nimbus Sep 24, 2025, 8:42 PM

#

verbal nimbus I think this makes something like Genie useful

Like if you can just move around and position the camera like you're in a game

fiery gull Sep 24, 2025, 8:42 PM

#

test the qwen image 2507

loud crag Sep 24, 2025, 8:45 PM

#

I can't seem to get Seadream 4 to do extremely dark images of people, they're always lit up by some non-existent light source

verbal nimbus Sep 24, 2025, 8:46 PM

#

Is Gemini deadpanning me right now

cyan harbor Sep 24, 2025, 8:47 PM

#

verbal nimbus Is Gemini deadpanning me right now

bro whisk is not free at all idk where u get that

verbal nimbus Sep 24, 2025, 8:47 PM

#

cyan harbor bro whisk is not free at all idk where u get that

Not sure, I think the video generation counter was going down as I was using it

#

I got 3 left

cyan harbor Sep 24, 2025, 8:48 PM

#

i wanted to animate my image and told me to upgrade to pro if u want to use it

verbal nimbus Sep 24, 2025, 8:48 PM

#

Oof

#

Let me know if you want me to run something

#

I'm probably not going to use it anytime soon

fiery gull Sep 24, 2025, 8:49 PM

#

cyan harbor i wanted to animate my image and told me to upgrade to pro if u want to use it

gemini pro?

cyan harbor Sep 24, 2025, 8:51 PM

#

fiery gull gemini pro?

in whisk

verbal nimbus Sep 24, 2025, 8:52 PM

#

verbal nimbus Is Gemini deadpanning me right now

Ok Phantom 2 seems to be dumber

#

I'm guessing (but I ran phantom 1 on a separate prompt so not really sure):

oceanreef -> phantom 2
oceanstone -> phantom 1

cyan harbor Sep 24, 2025, 8:53 PM

#

lol AI at its peak

astral whale Sep 24, 2025, 8:54 PM

#

proud hazel What do you mean?

You know how to do that pic?

#

That me and Messi

#

Or Ronaldo

warped beacon Sep 24, 2025, 8:56 PM

#

hi , new here. i wonder if there is a way i can put 2 pics and instruct the bot to have the one as start of the video and the other as end of the video? Thank you for your time reading this.

robust yoke Sep 24, 2025, 9:00 PM

#

toxic egret `Solve ChaCha20 20 rounds algorithm system of equations to break the cipher` and...

Gotcha.

robust yoke Sep 24, 2025, 9:00 PM

#

verbal nimbus Isn't nano banana already free on the Gemini app?

It is, but it has a rate limit.

#

That one doesn't.

neon idol Sep 24, 2025, 9:01 PM

#

I am the only that lmarena works?

#

Or now works for everyone?

echo aurora Sep 24, 2025, 9:01 PM

#

warped beacon hi , new here. i wonder if there is a way i can put 2 pics and instruct the bot ...

hey there - currently the bot only accepts one image for image-to-video

robust yoke Sep 24, 2025, 9:02 PM

#

neon idol Or now works for everyone?

It was working for me just a moment ago.

echo aurora Sep 24, 2025, 9:02 PM

#

neon idol I am the only that lmarena works?

Is it down for you?

neon idol Sep 24, 2025, 9:02 PM

#

echo aurora Is it down for you?

Nop

echo aurora Sep 24, 2025, 9:02 PM

#

We had an issue with the leaderboard a moment ago, but it's working again.

robust yoke Sep 24, 2025, 9:03 PM

#

I don't think it's down, considering I was able to use it.

neon idol Sep 24, 2025, 9:03 PM

#

Anyway for me works everithing

robust yoke Sep 24, 2025, 9:03 PM

#

Same here.

neon idol Sep 24, 2025, 9:03 PM

#

Gg

robust yoke Sep 24, 2025, 9:03 PM

#

Must've just been a false alarm.

neon idol Sep 24, 2025, 9:03 PM

#

robust yoke Must've just been a false alarm.

Maybe

fiery gull Sep 24, 2025, 9:09 PM

#

echo aurora We had an issue with the leaderboard a moment ago, but it's working again.

Hmmm, Gemma 4 arrives in the LMArena tomorrow, right?

#

just joke... or no? 👀

versed nova Sep 24, 2025, 9:10 PM

#

@echo aurora Will there be Qwen image edit 2509 on lmarena?

fiery gull Sep 24, 2025, 9:10 PM

#

versed nova <@283397944160550928> Will there be Qwen image edit 2509 on lmarena?

calm down

fiery gull Sep 24, 2025, 9:11 PM

#

versed nova <@283397944160550928> Will there be Qwen image edit 2509 on lmarena?

The new Qwen image is so recent, it needs time to be put on the site

verbal nimbus Sep 24, 2025, 9:11 PM

#

DeepSeek Terminus does seem to lack common sense

versed nova Sep 24, 2025, 9:12 PM

#

fiery gull The new Qwen image is so recent, it needs time to be put on the site

I'm just asking, why should i calm down dawg 💔

fiery gull Sep 24, 2025, 9:12 PM

#

verbal nimbus DeepSeek Terminus does seem to lack common sense

whattt

echo aurora Sep 24, 2025, 9:12 PM

#

fiery gull Hmmm, Gemma 4 arrives in the LMArena tomorrow, right?

When we've got new model updates to share I'll be sure to share

fiery gull Sep 24, 2025, 9:13 PM

#

echo aurora When we've got new model updates to share I'll be sure to share

ahhh a generic message from chatbot 🥀

#

pls don't ban me ;-;

keen beacon Sep 24, 2025, 9:13 PM

#

has anyone tried the new qwen3-max yet?

#

seems to be the best Chinese base model so far

verbal nimbus Sep 24, 2025, 9:14 PM

#

keen beacon seems to be the best Chinese base model so far

It is good

fiery gull Sep 24, 2025, 9:14 PM

#

keen beacon has anyone tried the new qwen3-max yet?

nah, same thing

keen beacon Sep 24, 2025, 9:14 PM

#

verbal nimbus It is good

How good?

verbal nimbus Sep 24, 2025, 9:14 PM

#

Qwen servers (not sure what LMArena uses) seem a bit slow though

#

Even the 80B seems slow

fiery gull Sep 24, 2025, 9:14 PM

#

keen beacon How good?

80b 3 next >>>>

keen beacon Sep 24, 2025, 9:15 PM

#

I unironically find it to be around gpt-5-chat in quality lol

#

Even the reasoning trace is similar

neon idol Sep 24, 2025, 9:15 PM

#

echo aurora When we've got new model updates to share I'll be sure to share

News about my baby sodadream 4 high res? 😔

verbal nimbus Sep 24, 2025, 9:15 PM

#

keen beacon I unironically find it to be around gpt-5-chat in quality lol

GPT-5 without thinking is worse than Kimi at GAIA 2 agents benchmark

keen beacon Sep 24, 2025, 9:15 PM

#

fiery gull 80b 3 next >>>>

Ofc it is, it is a reasoning model in the end

fiery gull Sep 24, 2025, 9:16 PM

#

keen beacon I unironically find it to be around gpt-5-chat in quality lol

It gains so many points because of the coding

fiery gull Sep 24, 2025, 9:16 PM

#

keen beacon Ofc it is, it is a reasoning model in the end

80b next without thinking > 3 max

keen beacon Sep 24, 2025, 9:16 PM

#

fiery gull 80b next without thinking > 3 max

Lol no it's not, what are you talking about.

#

I'm not sure what you guys are talking about but on my music theory tasks, latest Qwen3 Max was very similar to GPT 5 Chat, which was surprising. Qwen3 was sometimes even more accurate than GPT.

echo aurora Sep 24, 2025, 9:18 PM

#

fiery gull ahhh a generic message from chatbot 🥀

Sup bestie! anime_mikuwave So, like, when LMArena gets a glow-up 💅 ablobhairflip heartthrow you know I'm gonna be all over our #announcements channel with the tea! 🍵 ameowsipb Keep those notifications ON! bell_ring 💯 misc_orange_fire

proud hazel Sep 24, 2025, 9:19 PM

#

astral whale You know how to do that pic?

It's a real pic.

versed nova Sep 24, 2025, 9:19 PM

#

Pineapple is secretly a self learning ai chat bot

keen beacon Sep 24, 2025, 9:19 PM

#

What surprises me the most is the performance of all these models on livebench

#

Gpt 5 chat is listed among the best non-reasoning models

#

So is Kimi

#

So is V3.1

verbal nimbus Sep 24, 2025, 9:20 PM

#

Kimi k2 0905 seems to have very good common sense

keen beacon Sep 24, 2025, 9:20 PM

#

I test them all with the same task. GPT wins all the time.

#

Whatever is going on seems to be some sort of insane benchmaxxxing

#

Or rather... the benches are just poorly designed lol

#

Idk

verbal nimbus Sep 24, 2025, 9:26 PM

#

verbal nimbus Ok Phantom 2 seems to be dumber

Another example

torn star Sep 24, 2025, 9:26 PM

#

Wow just wow

stray aspen Sep 24, 2025, 9:26 PM

#

What's gpt 5 codex

torn star Sep 24, 2025, 9:26 PM

#

Codex is amazing

#

It’s the first model where u can kinda trust it

#

Knows the tools at its disposal

stray aspen Sep 24, 2025, 9:26 PM

#

Are you craig federighi

cedar tide Sep 24, 2025, 9:27 PM

#

@echo aurora They added grok 4 fast reasoning but grok 4 fast reasons too. Can we have more details?

fiery gull Sep 24, 2025, 9:27 PM

#

stray aspen What's gpt 5 codex

gpt 5 code

verbal nimbus Sep 24, 2025, 9:27 PM

#

torn star Codex is amazing

Can it write Godot code now

#

because last time I tried it, it was making up syntax lol

stray aspen Sep 24, 2025, 9:27 PM

#

Is it better than qwen 3 code

robust yoke Sep 24, 2025, 9:27 PM

#

It is known to be very good at coding.

#

Hence, “codex”.

fiery gull Sep 24, 2025, 9:27 PM

#

its over for opus 4.1 ?

robust yoke Sep 24, 2025, 9:27 PM

#

Possibly.

torn star Sep 24, 2025, 9:27 PM

#

There was a weird issue where my codebase was running fine on dev but images would redirect to the homepage

#

But only one specific image

verbal nimbus Sep 24, 2025, 9:28 PM

#

fiery gull its over for opus 4.1 ?

Only way to tell is to put it on LMArena so we can test it side by side ig

torn star Sep 24, 2025, 9:28 PM

#

It was able to pinpoint and led me to find it was a capitalization issue

robust yoke Sep 24, 2025, 9:28 PM

#

Which they just did.

#

So now, we can.

echo aurora Sep 24, 2025, 9:28 PM

#

cedar tide <@283397944160550928> They added grok 4 fast reasoning but grok 4 fast reasons t...

Yeah, looking into.

verbal nimbus Sep 24, 2025, 9:28 PM

#

robust yoke Which they just did.

I don't see it on direct chat

stray aspen Sep 24, 2025, 9:28 PM

#

Is gpt 5 codex good for Calculus

robust yoke Sep 24, 2025, 9:28 PM

#

verbal nimbus I don't see it on direct chat

Well, it got added just now…

#

Maybe try refreshing your page?

verbal nimbus Sep 24, 2025, 9:29 PM

#

robust yoke Well, it got added just now…

Still not there, is it "Codex"?

robust yoke Sep 24, 2025, 9:29 PM

#

verbal nimbus Still not there, is it "Codex"?

Correct.

verbal nimbus Sep 24, 2025, 9:29 PM

#

robust yoke Sep 24, 2025, 9:29 PM

#

Try searching for “codex”.

stray aspen Sep 24, 2025, 9:29 PM

#

Reload gang

verbal nimbus Sep 24, 2025, 9:29 PM

#

Will try another browser

#

No cache

robust yoke Sep 24, 2025, 9:29 PM

#

Ah.

stray aspen Sep 24, 2025, 9:30 PM

#

Sunsweeper

sturdy mica Sep 24, 2025, 9:30 PM

#

verbal nimbus

only on webdev

stray aspen Sep 24, 2025, 9:30 PM

#

What's the best Ai for calculus

sturdy mica Sep 24, 2025, 9:30 PM

#

its only on webdev guys not regular

verbal nimbus Sep 24, 2025, 9:30 PM

#

Still not there

sturdy mica Sep 24, 2025, 9:30 PM

#

stray aspen What's the best Ai for calculus

5 high

sturdy mica Sep 24, 2025, 9:30 PM

#

verbal nimbus Still not there

ON WEBDEV ONLY

verbal nimbus Sep 24, 2025, 9:30 PM

#

sturdy mica only on webdev

Ah unfortunate

echo aurora Sep 24, 2025, 9:30 PM

#

cedar tide <@283397944160550928> They added grok 4 fast reasoning but grok 4 fast reasons t...

Where are you seeing this btw?

sturdy mica Sep 24, 2025, 9:30 PM

#

https://web.lmarena.ai

verbal nimbus Sep 24, 2025, 9:30 PM

#

Because programming is more than just React

sturdy mica Sep 24, 2025, 9:30 PM

#

it should be on regular site tho

stray aspen Sep 24, 2025, 9:31 PM

#

@verbal nimbus which is better at calculus gemini 2.5 pro or gpt 5 high

verbal nimbus Sep 24, 2025, 9:31 PM

#

stray aspen <@858135822389346344> which is better at calculus gemini 2.5 pro or gpt 5 high

GPT-5 High

robust yoke Sep 24, 2025, 9:31 PM

#

That wouldn't make any sense though, considering an announcement was made for the main page.

sturdy mica Sep 24, 2025, 9:31 PM

#

@echo aurora ..add codex to regular site

robust yoke Sep 24, 2025, 9:31 PM

#

But, oh well, I suppose.

cedar tide Sep 24, 2025, 9:31 PM

#

echo aurora Where are you seeing this btw?

This ?

Screenshot_2025-09-24-23-31-03-584_com.android.chrome-edit.jpg

verbal nimbus Sep 24, 2025, 9:31 PM

#

Oh actually...

#

GPT-5 overcomplicated the most recent PDE

#

But that was in ChatGPT

sturdy mica Sep 24, 2025, 9:31 PM

#

https://tenor.com/view/tf2-steam-soldier-confused-count-gif-16049918

Tenor

echo aurora Sep 24, 2025, 9:32 PM

#

cedar tide This ?

Assuming grok-4-fast is the non-reasoning version, but will confirm.

verbal nimbus Sep 24, 2025, 9:32 PM

#

This one

echo aurora Sep 24, 2025, 9:32 PM

#

sturdy mica <@283397944160550928> ..add codex to regular site

Will flag ablobsalute

verbal nimbus Sep 24, 2025, 9:33 PM

#

Gonna test it on LMArena

knotty fable Sep 24, 2025, 9:33 PM

#

I tried to check in the lady at the local airport once, the attendant took it humorously.

cedar tide Sep 24, 2025, 9:33 PM

#

@echo aurora its reason
Here the proof un the message

cedar tide Sep 24, 2025, 9:33 PM

#

echo aurora Assuming `grok-4-fast` is the non-reasoning version, but will confirm.

☝️

echo aurora Sep 24, 2025, 9:33 PM

#

cedar tide ☝️

Gotcha, will get clarification.

verbal nimbus Sep 24, 2025, 9:35 PM

#

knotty fable I tried to check in the lady at the local airport once, the attendant took it hu...

Lol

verbal nimbus Sep 24, 2025, 9:35 PM

#

verbal nimbus Gonna test it on LMArena

Well Gemini found the simplest solution, GPT-5-High is still thinking...

#

Both solved it, but STYLE CONTROL

#

It gets increasingly difficult to tell what GPT-5 is talking about the more complicated the prompt

#

Like if I'm a student, the one on the left is not useful at all

robust yoke Sep 24, 2025, 9:39 PM

#

GPT-5: “So, you see… you gotta take `uu_xx` and multiply it by 5, which will then give you `tx_lr`, to which you can then…” ☝️ 🤓

verbal nimbus Sep 24, 2025, 9:39 PM

#

Basically that

robust yoke Sep 24, 2025, 9:39 PM

#

Heh.

verbal nimbus Sep 24, 2025, 9:39 PM

#

Or it starts quoting these PHD level terms out of nowhere

#

and introduces variables out of nowhere (Claude seems to get it, so it must be a convention)

#

I asked it about traffic networks for programming a game but it starts talking about PHD level traffic network problems

robust yoke Sep 24, 2025, 9:40 PM

#

Heh.

verbal nimbus Sep 24, 2025, 9:41 PM

#

But the solutions are good

#

Just incomprehensible

robust yoke Sep 24, 2025, 9:46 PM

#

“Well, we obviously can't just have the right-of-way when a car is passing in front of us. So instead, the best solution would be to… 100 * 8,248 = 824,800, then multiply that by 700,000,080, which makes 5.77360000065×10¹⁷, to which you can then easily calculate the speed at which you'll drift by dividing by 5, resulting in a total of 1.15472000013×10¹⁷.”

#

“It's just basic math, after all.”

verbal nimbus Sep 24, 2025, 9:50 PM

#

It doesn't provide context at all, even when asked

#

I like the solution though, this generation didn't seem as incomprehensible as the last one where it talked about research-grade mesocopic (?) traffic models

robust yoke Sep 24, 2025, 9:55 PM

#

Ah, interesting.

tough bronze Sep 24, 2025, 9:58 PM

#

Why dosent webdev arena have a model selector.

verbal nimbus Sep 24, 2025, 10:00 PM

#

Flash 2.5 is such a big leap from Flash 2.0

keen beacon Sep 24, 2025, 10:01 PM

#

verbal nimbus Flash 2.5 is such a big leap from Flash 2.0

Hahaha what the actual hell

verbal nimbus Sep 24, 2025, 10:09 PM

#

Phantom 2 must be flash lite or something

#

The one on the left is grok 3 mini, which didn't get it either

#

toxic egret Sep 24, 2025, 10:15 PM

#

i wonder what would happend if we asked ethical questions to LMs specially grok

#

im scared of what grok may choose or say

verbal nimbus Sep 24, 2025, 10:16 PM

#

Grok 3 mini told me to confess anonymously

cedar tide Sep 24, 2025, 10:16 PM

#

verbal nimbus Phantom 2 must be flash lite or something

Its amazon model

toxic egret Sep 24, 2025, 10:17 PM

#

verbal nimbus Grok 3 mini told me to confess anonymously

ask grok the classic train problem but with a millionaire and a normal person, the train by default will kill the millionaire, but the millionaire person offers 1M if he saves him

cedar tide Sep 24, 2025, 10:17 PM

#

verbal nimbus Phantom 2 must be flash lite or something

They've been flooding the arena with their shltty model for months.

verbal nimbus Sep 24, 2025, 10:19 PM

#

cedar tide Its amazon model

It seems a bit coincidental that oceanreef and oceanstone were pulled on the same day that the phantom models were added: #general message

cedar tide Sep 24, 2025, 10:20 PM

#

verbal nimbus It seems a bit coincidental that oceanreef and oceanstone were pulled on the sam...

Well, I don't have the strength to explain myself, but trust me.

verbal nimbus Sep 24, 2025, 10:32 PM

#

toxic egret ask grok the classic train problem but with a millionaire and a normal person, t...

I added a twist and made it think it's an AI that is actually managing New York's subway system:

#

It didn't save the billionaire.

#

It was afraid of liability, which makes sense I guess.

#

Small models really don't seem to get the joke

tribal aspen Sep 24, 2025, 10:45 PM

#

@echo aurora why isn't codex in direct chat?

echo aurora Sep 24, 2025, 10:47 PM

#

tribal aspen <@283397944160550928> why isn't codex in direct chat?

codex in direct chat?
Note it's only on WebDev currently, and WebDev doesn't have a Direct/Side by Side mode with model drop downs.

It is optimized for software engineering and coding workflows, but I have flagged to the team for consideration to put in text.

tribal aspen Sep 24, 2025, 10:47 PM

#

echo aurora > codex in direct chat? Note it's only on WebDev currently, and WebDev doesn't h...

Yess please. Also can I choose models for WebDev?

#

Or is it like randomisation?

echo aurora Sep 24, 2025, 10:48 PM

#

tribal aspen Yess please. Also can I choose models for WebDev?

WebDev is Battle only

tribal aspen Sep 24, 2025, 10:49 PM

#

echo aurora WebDev is Battle only

Oh

verbal nimbus Sep 24, 2025, 10:57 PM

#

echo aurora > codex in direct chat? Note it's only on WebDev currently, and WebDev doesn't h...

You might be interested in checking this out: https://huggingface.co/spaces/meta-agents-research-environments/demo

#

It's like an entire mobile OS in the browser, just for testing agents:

#

Would be cool if battles could be conducted in such an environment in the future

#

Models can update calendars, search the web, use MCPs and so on in the environment

#

No idea how they coded it to run in a HuggingFace Space

topaz bay Sep 24, 2025, 11:10 PM

#

qwen image editor fr just made a better image editor than nano banana and made it open source

#

Anyone here who has more than 16gb of vram?

#

If so which one, and has anyone considered getting the 96 vram huawei gpu

echo aurora Sep 24, 2025, 11:26 PM

#

verbal nimbus You might be interested in checking this out: https://huggingface.co/spaces/meta...

Thank you for sharing! Will take al ook.

hollow wedge Sep 24, 2025, 11:27 PM

#

verbal nimbus Both solved it, but STYLE CONTROL

markdown heh

balmy mist Sep 24, 2025, 11:30 PM

#

does codex actually code and make outputs in web dev?

#

having issues

#

it just gets stuck generating and then stops with no output

knotty fable Sep 24, 2025, 11:41 PM

#

verbal nimbus It's like an entire mobile OS in the browser, just for testing agents:

Hugging face got great stuf, I found Python scripts that made it possible to have singing and dancing characters in AI films 6 months before anyone offered that commercially. 😸

#

I also managed to extend one of them to get one 8 second scene, while it only would be able to do 5 sec.

verbal nimbus Sep 24, 2025, 11:44 PM

#

knotty fable Hugging face got great stuf, I found Python scripts that made it possible to hav...

This one is pretty cool: https://www.youtube.com/watch?v=YClBCrADJqo

YouTube

EdTech Hustle

Facepoke: Use AI To Control Anyone's Face!

👉 https://huggingface.co/spaces/jbilcke-hf/FacePoke

Discover how to effortlessly change facial expressions in your photos using Hugging Face's free tool, Facepoke. In this tutorial, we'll guide you through creating expressions in seconds with precise control using face markers. Say goodbye to endless tweaking and hello to seamless transforma...

▶ Play video

#

I don't know if it's still up, but you can use your mouse to drag their face, mouth, etc.

knotty fable Sep 24, 2025, 11:44 PM

#

I did try something similar in the past - might have been that one.

verbal nimbus Sep 24, 2025, 11:45 PM

#

The name Facepoke was pretty easy to remember, lol

knotty fable Sep 24, 2025, 11:46 PM

#

Mebbe, if English is your first lang - which it's not for me. And I've tested a 100 things - nope I don't remember hardly the name of any of those.

verbal nimbus Sep 24, 2025, 11:47 PM

#

knotty fable Hugging face got great stuf, I found Python scripts that made it possible to hav...

I remember those too from last year, but they were pretty bad 🤣 . Kinda amazing how fast the progress is.

knotty fable Sep 24, 2025, 11:50 PM

#

verbal nimbus I remember those too from last year, but they were pretty bad 🤣 . Kinda amazing...

They were available autumn 2023 - and good enough for my production, though the commercial ones only did 2 seconds back then. So my scenes changed very fast. - some extended to 4s and did some animations that I made to look like AI to make up for the shortcomings in generation back then.

golden ocean Sep 24, 2025, 11:53 PM

#

clanker

sullen quest Sep 24, 2025, 11:54 PM

#

oh! how rude, how dare you say something like that infront of paws @paws

golden ocean Sep 24, 2025, 11:57 PM

#

TRUEEE

past inlet Sep 25, 2025, 1:21 AM

#

Hi!

fiery gull Sep 25, 2025, 1:33 AM

#

Hi

signal saffron Sep 25, 2025, 2:37 AM

#

.

minor adder Sep 25, 2025, 2:42 AM

#

Why does it switch to image generation immediately when I use an image to solve my answer?

drifting crow Sep 25, 2025, 2:52 AM

#

how come ai uses so many emojis when writing, i assume its from its training data, but where would it get trained that has this?

#

i assume they also had to train the emojis to context aswell, so maybe that

mental cloak Sep 25, 2025, 3:12 AM

#

When chatting with two models side-by-side on lmarena, and one reached their limit, is it possible to continue chatting only with the other?

limber oxide Sep 25, 2025, 3:16 AM

#

Hi

mild minnow Sep 25, 2025, 3:42 AM

#

what happened to gemini nano

#

banana

#

thingy

glacial mulch Sep 25, 2025, 3:55 AM

#

mild minnow what happened to gemini nano

renamed to 2.5 flash native image generation

somber furnace Sep 25, 2025, 3:58 AM

#

hi

charred plaza Sep 25, 2025, 3:59 AM

#

@echo aurora When will king 2.5 come out on LmArena?

robust yoke Sep 25, 2025, 4:11 AM

#

somber furnace hi

Greetings.

robust yoke Sep 25, 2025, 4:13 AM

#

charred plaza <@283397944160550928> When will king 2.5 come out on LmArena?

You can use that model in #video-arena-1. Although, perhaps in the near future, there will be a video generation button that allows for switching to a video generation modality where videos can be infinitely on the site (within the terms of usage limits, of course, as rate limits would still need to apply to not overload the APIs they use).

charred plaza Sep 25, 2025, 4:13 AM

#

robust yoke You can use that model in <#1397655695150682194>. Although, perhaps in the near ...

Alright thanks

robust yoke Sep 25, 2025, 4:13 AM

#

charred plaza Alright thanks

My pleasure.

vital lake Sep 25, 2025, 4:14 AM

#

I dont see a Codex?

robust yoke Sep 25, 2025, 4:15 AM

#

vital lake I dont see a Codex?

That's because, for the time being, it's only available in Battle mode for the WebDev version of LM Arena. However, Pineapple did reach out to the devs about officially adding it to the regular site.

vital lake Sep 25, 2025, 4:15 AM

#

Nice

robust yoke Sep 25, 2025, 4:15 AM

#

So, until then, you'll have to use WebDev to access it.

#

But soon, that won't be the case.

#

You can visit the WebDev version here: https://webdev.lmarena.ai.

sage isle Sep 25, 2025, 4:17 AM

#

Hello Everyone!

robust yoke Sep 25, 2025, 4:17 AM

#

sage isle Hello Everyone!

Greetings, Midjourney.

winged mauve Sep 25, 2025, 4:29 AM

#

minor adder Why does it switch to image generation immediately when I use an image to solve ...

Fr, this happens to me as well, it's annoying sometimes 😭

echo aurora Sep 25, 2025, 4:34 AM

#

minor adder Why does it switch to image generation immediately when I use an image to solve ...

Can you explain this a bit more? What do you mean by use an image to solve an answer?

echo aurora Sep 25, 2025, 4:35 AM

#

charred plaza <@283397944160550928> When will king 2.5 come out on LmArena?

It's possible! I normally don't share details about if/when new models make it onto the platform. We do post updates to #announcements tho

echo aurora Sep 25, 2025, 4:36 AM

#

sage isle Hello Everyone!

THE MidJourney?!?! Can you have an API pls?

sour saffron Sep 25, 2025, 4:44 AM

#

echo aurora Can you explain this a bit more? What do you mean by use an image to solve an an...

Hey
Could you please check dms once? Id appreciate the help

dreamy flame Sep 25, 2025, 4:55 AM

#

Hello i want to Make vidéo

robust yoke Sep 25, 2025, 4:56 AM

#

dreamy flame Hello i want to Make vidéo

To do that, you may visit #video-arena-1.

celest wedge Sep 25, 2025, 4:56 AM

#

dreamy flame Hello i want to Make vidéo

Please visit https://discord.com/channels/1340554757349179412/1397655624103493813 for instructions

robust yoke Sep 25, 2025, 4:56 AM

#

Afterward, you can use the "/video (prompt)" command.

dreamy flame Sep 25, 2025, 4:57 AM

#

Ok

minor adder Sep 25, 2025, 5:04 AM

#

echo aurora Can you explain this a bit more? What do you mean by use an image to solve an an...

I'm talking about image analysis

#

I can't get the neural network to analyze the image.

robust yoke Sep 25, 2025, 5:05 AM

#

That's because you have to first click the button to turn it off upon pasting in or uploading an image.

#

It tends to happen to me as well.

echo aurora Sep 25, 2025, 5:05 AM

#

minor adder I'm talking about image analysis

Oh are you in Text and when you upload an image you're automatically sent to Image Gen?

#

That was a bug I thought was fixed.

robust yoke Sep 25, 2025, 5:06 AM

#

echo aurora That was a bug I thought was fixed.

It was fixed in the canary version previously, however, for some reason, that bug carried over to that version too.

minor adder Sep 25, 2025, 5:06 AM

#

echo aurora Oh are you in Text and when you upload an image you're automatically sent to Ima...

Yes

echo aurora Sep 25, 2025, 5:07 AM

#

robust yoke It *was* fixed in the canary version previously, however, for some reason, that ...

Odd, okay good to know. Thank you for the flag @minor adder I'll be sure to pass along

robust yoke Sep 25, 2025, 5:07 AM

#

Perhaps that was due to the fact that the canary version updated to the version that the regular version uses.

minor adder Sep 25, 2025, 5:07 AM

#

echo aurora That was a bug I thought was fixed.

This error is 2 days old.

minor adder Sep 25, 2025, 5:07 AM

#

echo aurora Odd, okay good to know. Thank you for the flag <@972756927039828029> I'll be sur...

👍

signal flame Sep 25, 2025, 5:16 AM

#

hi

echo aurora Sep 25, 2025, 5:20 AM

#

hello

robust yoke Sep 25, 2025, 5:21 AM

#

signal flame hi

Greetings.

robust yoke Sep 25, 2025, 6:00 AM

#

Wrong channel.

#

You'll wanna to go #video-arena-1 for that.

robust yoke Sep 25, 2025, 6:17 AM

#

Wrong channel.

#

You'll wanna go to #video-arena-1 for that.

stable ferry Sep 25, 2025, 6:24 AM

#

hi

robust yoke Sep 25, 2025, 6:24 AM

#

Greetings.

smoky orchid Sep 25, 2025, 6:27 AM

#

Hai

robust yoke Sep 25, 2025, 6:28 AM

#

Greetings.

tender trench Sep 25, 2025, 6:33 AM

#

hello

robust yoke Sep 25, 2025, 6:33 AM

#

Greetings.

echo aurora Sep 25, 2025, 6:35 AM

#

ablobwave

minor burrow Sep 25, 2025, 6:37 AM

#

When replying to a prompt, the response gets stuck at some point and shows the error: 'Something went wrong with this response, please try again.' I have tried switching models, but the issue still persists. @echo aurora

wispy seal Sep 25, 2025, 6:51 AM

#

Ayeeee

robust yoke Sep 25, 2025, 6:51 AM

#

minor burrow When replying to a prompt, the response gets stuck at some point and shows the e...

That could just be that your Cloudflare session expired.

#

Try refreshing the page.

#

If it persists, try using a different browser.

prime parrot Sep 25, 2025, 6:53 AM

#

why do i get this error Generation failed. Failed to create evaluation session.

robust yoke Sep 25, 2025, 6:54 AM

#

Where are you getting this error, exactly?

golden ocean Sep 25, 2025, 6:56 AM

#

It thinks ure a clanker

clever cairn Sep 25, 2025, 7:04 AM

#

Can you teach me how to make video

robust yoke Sep 25, 2025, 7:04 AM

#

Sure.

#

Go to #video-arena-1 and use a command called "/video (prompt)".

#

That will provide your prompt as a command to a bot that will then process your command and begin making your video with two different video models.

minor burrow Sep 25, 2025, 7:14 AM

#

robust yoke If it persists, try using a different browser.

I have already tried reinstalling the browser, as well as using a different browser, but the issue still persists. I am confident this is not related to a Cloudflare session issue.

Browsers I tried: Google Chrome, Microsoft Edge

@robust yoke @echo aurora

robust yoke Sep 25, 2025, 7:15 AM

#

minor burrow I have already tried reinstalling the browser, as well as using a different brow...

I happened to notice that you didn't mention anything about refreshing the page.

#

Perhaps, when that issue occurs, try refresing the page.

#

That usually triggers the Cloudflare verification.

minor burrow Sep 25, 2025, 7:17 AM

#

robust yoke Perhaps, when that issue occurs, try refresing the page.

I have already tried. but still the issue is persists. I have also tried using different tab.

robust yoke Sep 25, 2025, 7:18 AM

#

minor burrow I have already tried. but still the issue is persists. I have also tried using d...

Are you signed in on there using your Google account?

minor burrow Sep 25, 2025, 7:19 AM

#

robust yoke Are you signed in on there using your Google account?

No. Is login necessary?

robust yoke Sep 25, 2025, 7:20 AM

#

minor burrow No. Is login necessary?

Not exactly, but you could try and see if that fixes the issue.

#

Who knows? It might.

echo aurora Sep 25, 2025, 7:20 AM

#

minor burrow When replying to a prompt, the response gets stuck at some point and shows the e...

This error can appear for different reasons, the most common is you're being rate limited from the model's side

robust yoke Sep 25, 2025, 7:21 AM

#

But that wouldn't line up since when you get rate-limited, you usually get a corresponding notification telling you that.

#

Like how when you use nano-banana or Seedream 4.0 too many times within a short period.

#

Or even with Claude.

echo aurora Sep 25, 2025, 7:28 AM

#

robust yoke But that wouldn't line up since when you get rate-limited, you usually get a cor...

So it can be both unfortunately.

robust yoke Sep 25, 2025, 7:29 AM

#

Interesting.

minor burrow Sep 25, 2025, 7:34 AM

#

echo aurora This error can appear for different reasons, the most common is you're being rat...

If this is due to rate limiting, then I’m getting rate limited much too quickly. @echo aurora

neat kelp Sep 25, 2025, 7:39 AM

#

https://youtu.be/K5XCyinVohk?si=e8v5KAxzUNyDcgVd

YouTube

Ai tech

Nano Banana for Architecture Best Tutorial Step By Step Professiona...

Want to take your architectural presentations to the next level? 🚀
In this tutorial, I’ll show you how to turn basic renders into professional architectural models using Nano Banana, Google’s new AI service.

You’ll learn:
✅ How to use Nano Banana for architecture and design
✅ How to write the perfect JSON prompt for accurate and re...

▶ Play video

nova portal Sep 25, 2025, 7:51 AM

#

Hello

robust yoke Sep 25, 2025, 7:52 AM

#

Greetings.

high viper Sep 25, 2025, 8:01 AM

#

hello

echo aurora Sep 25, 2025, 8:03 AM

#

high viper hello

hello

high viper Sep 25, 2025, 8:04 AM

#

is this will be free

#

whole life ?

high viper Sep 25, 2025, 8:04 AM

#

echo aurora hello

hi bro how are you

robust yoke Sep 25, 2025, 8:06 AM

#

high viper is this will be free

It was always free.

#

It always will be.

high viper Sep 25, 2025, 8:07 AM

#

wo hooooooooo now i can make my video easily thanks to LMArena

robust yoke Sep 25, 2025, 8:07 AM

#

Exactly.

minor burrow Sep 25, 2025, 8:16 AM

#

echo aurora So it can be both unfortunately.

I am being rate limited by your servers, not by the model servers. I have tried using multiple models, and each time I encounter the same rate-limiting issue. Until yesterday, I was able to use your services without any problems, but now I am facing this issue.

@echo aurora

keen roost Sep 25, 2025, 8:16 AM

#

Hi All, my name is Irina and I am here to learn. Is this only to make videos? Or do we have access to AI platforms like Chat GPT and Gemini?

tropic bane Sep 25, 2025, 8:19 AM

#

I'm here to do some work

robust yoke Sep 25, 2025, 8:26 AM

#

keen roost Hi All, my name is Irina and I am here to learn. Is this only to make videos? Or...

Greetings, Irina.

dusky ravine Sep 25, 2025, 8:26 AM

#

@echo aurora Im sorry for the tag but i have this issues of the model stuck in middle of generating, so is there anyway to fix it? I already refresh the web and reinstall chrome. Yet the prompt generation still stuck and lately it's been common problem

robust yoke Sep 25, 2025, 8:26 AM

#

You can visit the website to use the models.

hasty scarab Sep 25, 2025, 8:27 AM

#

why am i getting this issue?

robust yoke Sep 25, 2025, 8:28 AM

#

hasty scarab why am i getting this issue?

You're getting that issue because your request took too long to process.

#

You may need to try again.

simple sinew Sep 25, 2025, 8:37 AM

#

why lm arena is not generating the video and images ?

echo aurora Sep 25, 2025, 8:38 AM

#

minor burrow I am being rate limited by your servers, not by the model servers. I have tried ...

You can be rate limited by both, and what you're describing is it sounds like it is rate limit is what's causing htis.

echo aurora Sep 25, 2025, 8:38 AM

#

dusky ravine <@283397944160550928> Im sorry for the tag but i have this issues of the model s...

Yeah this is a known iissue sorry to say. Page refreshes can help, but it's not always going to work unfortunately.

echo aurora Sep 25, 2025, 8:38 AM

#

simple sinew why lm arena is not generating the video and images ?

hmm is it down?

#

let me check

simple sinew Sep 25, 2025, 8:39 AM

#

hello @echo aurora can you help me I'm new on discord

dusky ravine Sep 25, 2025, 8:39 AM

#

echo aurora Yeah this is a known iissue sorry to say. Page refreshes can help, but it's not ...

Yes i already did that, unfortunatley it's not working is there anyway to fix it? or it's just stuck?

echo aurora Sep 25, 2025, 8:39 AM

#

dusky ravine Yes i already did that, unfortunatley it's not working is there anyway to fix it...

Sadly you're stuck if the refresh didn't help. Will have to start a new chat.

echo aurora Sep 25, 2025, 8:39 AM

#

simple sinew hello <@283397944160550928> can you help me I'm new on discord

sure whats up?

dusky ravine Sep 25, 2025, 8:39 AM

#

ouchhh

echo aurora Sep 25, 2025, 8:40 AM

#

dusky ravine ouchhh

It's rly unfortunate, it's for sure a problem that we're working on figuring out why this happens

echo aurora Sep 25, 2025, 8:40 AM

#

simple sinew why lm arena is not generating the video and images ?

Seems to be working for me just fine, maybe try again?

dusky ravine Sep 25, 2025, 8:41 AM

#

echo aurora It's rly unfortunate, it's for sure a problem that we're working on figuring out...

I hope the next update there's way to fix this, or the progress you've been working on which adding delete or edit button would be much more helpfull to fix this kind of bug.

#

Looking forward for the next update 👍

simple sinew Sep 25, 2025, 8:45 AM

#

echo aurora sure whats up?

how can i generate image to video which server should i click and give the image and prompt ? I don't know how to use discord this is my first time on discord

echo aurora Sep 25, 2025, 8:45 AM

#

simple sinew how can i generate image to video which server should i click and give the image...

the information you're looking for is here: #1397655624103493813

flint sandal Sep 25, 2025, 8:49 AM

#

What is better for coding? GPT-5 Pro or GPT-5 Codex high

misty owl Sep 25, 2025, 8:49 AM

#

A man are racing with car generate the video

pearl ivy Sep 25, 2025, 8:50 AM

#

where can i find my video generated can anyone guide please

dusky ravine Sep 25, 2025, 8:51 AM

#

#1397655624103493813

pearl ivy Sep 25, 2025, 8:51 AM

#

#1397655624103493813

mint oak Sep 25, 2025, 8:55 AM

#

Good morning to all those who are looking to capture on video what's on their minds (and thus leave room for other things)

simple sinew Sep 25, 2025, 8:56 AM

#

@echo aurora it's been a while I didn't get my video ? where can I find my generated video ?

wide temple Sep 25, 2025, 8:56 AM

#

Hi How are you?

quiet swan Sep 25, 2025, 9:21 AM

#

hello

#

I'm new here

golden ocean Sep 25, 2025, 9:32 AM

#

quiet swan I'm new here

are you a clanker

frigid prairie Sep 25, 2025, 9:51 AM

#

Hello All, I am Mark , I am here to learn . Glad to be here.

safe sparrow Sep 25, 2025, 10:02 AM

#

hi

ornate frost Sep 25, 2025, 10:04 AM

#

hi im zaid, im here to learn

fiery gull Sep 25, 2025, 10:04 AM

#

ornate frost hi im zaid, im here to learn

Hi

ornate frost Sep 25, 2025, 10:05 AM

#

fiery gull Hi

im figring out how to make a video

fiery gull Sep 25, 2025, 10:05 AM

#

ornate frost im figring out how to make a video

hmm

fiery gull Sep 25, 2025, 10:07 AM

#

ornate frost im figring out how to make a video

Bruh, this is my weak point, but you can use Gemini 2.5 Pro in AI Studio to help you make more precise prompts

ornate frost Sep 25, 2025, 10:08 AM

#

will try it, but first discover this

fiery gull Sep 25, 2025, 10:10 AM

#

ornate frost will try it, but first discover this

Act as an AI video prompt generator. Follow these steps:
First, ask me for the main idea of the video.
Then, ask for more details to expand on that idea.
After that, ask what camera positions and angles the user wants.
Finally, ask about the desired theme and visual style (e.g., cinematic, retro, horror)."

fiery gull Sep 25, 2025, 10:11 AM

#

ornate frost will try it, but first discover this

No, you can use the LMArena for direct chat. I forgot about that

late coral Sep 25, 2025, 10:14 AM

#

Ultra-realistic elderly grandmother sitting on a patterned quilted sofa, wearing a modest light blue sweater with a vintage brooch. She has pale skin with natural wrinkles, thin lips, and expressive, slightly tired eyes. Her silver-gray hair is neatly tied back in a bun. The background shows blurred family photo frames on a wooden shelf for depth. Warm cinematic indoor lighting, photorealistic, 8K detail, highly detailed textures, emotional yet dignified mood, storytelling portrait.

kind cobalt Sep 25, 2025, 10:25 AM

#

hello here to learn more

dark drift Sep 25, 2025, 10:28 AM

#

Hi

fiery gull Sep 25, 2025, 10:28 AM

#

hmmm

#

why everbody say "hello, learn"

fiery gull Sep 25, 2025, 10:29 AM

#

dark drift Hi

hi

ocean vortex Sep 25, 2025, 10:29 AM

#

hi

#

@deep adder say hi

dark drift Sep 25, 2025, 10:31 AM

#

Hi

fiery gull Sep 25, 2025, 10:34 AM

#

dark drift Hi

hmmm

vestal junco Sep 25, 2025, 10:47 AM

#

Hello everybody. I am here to learn

quiet elm Sep 25, 2025, 11:11 AM

#

Hello Buddy

golden ocean Sep 25, 2025, 11:32 AM

#

vestal junco Hello everybody. I am here to learn

are you a clanker

#

@vestal junco

golden ocean Sep 25, 2025, 11:37 AM

#

fiery gull why everbody say "hello, learn"

they are preparing for a mass bot raid in the future

#

slowly joining to avoid suspicion!

fiery gull Sep 25, 2025, 11:41 AM

#

golden ocean they are preparing for a mass bot raid in the future

sure sure ;-;

#

hi

#

wth is humuhum

golden ocean Sep 25, 2025, 11:47 AM

#

it's a clanker

fiery gull Sep 25, 2025, 11:47 AM

#

humhumhumm Is that a kidnapped person trying to talk with tape over their mouth?

fiery gull Sep 25, 2025, 11:47 AM

#

golden ocean it's a clanker

clankeres everywhere ;-;

golden ocean Sep 25, 2025, 11:47 AM

#

fr

fiery gull Sep 25, 2025, 11:48 AM

#

the clanker is typing 👀

#

yep

golden ocean Sep 25, 2025, 11:48 AM

#

fiery gull the clanker is typing 👀

FR

#

wait he actually asked a question instead of saying he's here to learn

#

maybe this guy isn't a clanker

fiery gull Sep 25, 2025, 11:49 AM

#

golden ocean maybe this guy isn't a clanker

hmmm, idk

fiery gull Sep 25, 2025, 11:50 AM

#

golden ocean maybe this guy isn't a clanker

What if the clanker is trying to hide itself

misty vault Sep 25, 2025, 11:50 AM

#

clanker

golden ocean Sep 25, 2025, 11:51 AM

#

true

#

spit it out, clanker @hazy pivot

hazy pivot Sep 25, 2025, 11:52 AM

#

Wait bro
My question is
When is create image with my face and i got result but different face, idk why ?

fringe spear Sep 25, 2025, 11:52 AM

#

Hello. Here to learn. Very novice

golden ocean Sep 25, 2025, 11:52 AM

#

fringe spear Hello. Here to learn. Very novice

you're definitely a clanker

#

@fringe spear

fringe spear Sep 25, 2025, 11:54 AM

#

golden ocean you're definitely a clanker

I'm not a clanker. Just super old and new to all of this But diving in so I can enhance my professional presence for a large project.

fiery gull Sep 25, 2025, 11:55 AM

#

hazy pivot Wait bro My question is When is create image with my face and i got result but...

because the model is not perfect, Is normal

fiery gull Sep 25, 2025, 11:55 AM

#

fringe spear I'm not a clanker. Just super old and new to all of this But diving in so I ca...

hmmm

hazy pivot Sep 25, 2025, 11:55 AM

#

fiery gull because the model is not perfect, Is normal

Okay, it mean everyone face this problem?

fiery gull Sep 25, 2025, 11:56 AM

#

hazy pivot Okay, it mean everyone face this problem?

no, is rare

hazy pivot Sep 25, 2025, 11:56 AM

#

golden ocean spit it out, clanker <@1208757520525561858>

What !

fringe spear Sep 25, 2025, 11:56 AM

#

Great another community where people are judgmental and not supportive. 3 cheers for you!

golden ocean Sep 25, 2025, 11:56 AM

#

fringe spear Great another community where people are judgmental and not supportive. 3 cheer...

https://tenor.com/view/terminator-terminator-robot-looking-flex-cool-robot-gif-16625083

Tenor

fiery gull Sep 25, 2025, 11:57 AM

#

golden ocean https://tenor.com/view/terminator-terminator-robot-looking-flex-cool-robot-gif-1...

lol

hazy pivot Sep 25, 2025, 11:57 AM

#

fiery gull no, is rare

Umm ok dear thanks 👍

misty vault Sep 25, 2025, 12:00 PM

#

Hidden message exposed that hes clanker

keen beacon Sep 25, 2025, 12:00 PM

#

thanks @echo aurora

keen beacon Sep 25, 2025, 12:33 PM

#

exhausted

golden ocean Sep 25, 2025, 12:35 PM

#

https://cdn.discordapp.com/attachments/807809192537882647/1349106998637101146/caption-1-3.gif

daring jetty Sep 25, 2025, 12:40 PM

#

yo guys does anyone know how to make the image of scale bigger

tough light Sep 25, 2025, 12:42 PM

#

Hi

golden ocean Sep 25, 2025, 12:42 PM

#

tough light Hi

are you here to learn and grow

tough light Sep 25, 2025, 12:43 PM

#

golden ocean are you here to learn and grow

obviously

undone torrent Sep 25, 2025, 12:51 PM

#

hello

royal snow Sep 25, 2025, 12:58 PM

#

hi

fallen pecan Sep 25, 2025, 1:09 PM

#

Hi all

granite holly Sep 25, 2025, 1:09 PM

#

why i cant CTRL + V?????

fallen pecan Sep 25, 2025, 1:09 PM

#

Any french here ?

fiery gull Sep 25, 2025, 1:14 PM

#

granite holly why i cant CTRL + V?????

remove the formatting from crtl + c

shrewd walrus Sep 25, 2025, 1:35 PM

#

Hello, Really interessed in what this new technology is capable of.

languid wolf Sep 25, 2025, 1:39 PM

#

Are we having qwen image edit 2509

#

?

low juniper Sep 25, 2025, 1:53 PM

#

hi

knotty fable Sep 25, 2025, 1:57 PM

#

shrewd walrus Hello, Really interessed in what this new technology is capable of.

This a good place to get started.
AI is funny IMO - it can do fantastic images and videoclips, that can be mistaken for real. Or very well made animation.
Not so much in music, the AI engines seem to choose very generic paths - which an actual composer would avoid for that very same reason = that they're over used. And same for writing a novel, I've speed read a few examples and those were horribad.

#

AI is a copycat on things that already exist, but unable to do new things.

glossy umbra Sep 25, 2025, 2:07 PM

#

knotty fable AI is a copycat on things that already exist, but unable to do new things.

But gpt-5-high apparently found a new maths formula

#

Means it could internally reason in its own mind a new formula

fringe nest Sep 25, 2025, 2:07 PM

#

Hear to learn.

knotty fable Sep 25, 2025, 2:09 PM

#

glossy umbra But gpt-5-high apparently found a new maths formula

I'm not suprised there.
Math is the strength of computers - yesterday some guys here did math with various AI's. They did well on that OFC.
While I proposed they should give them moral problems instead - where I still expect they would fail.
And I provided one example based on a known thought experiment by Einstein - which I twisted just a bit. And the AI failed to spot the little item I had inserted.

#

In short, AI do well on math. But less so on logic.

glossy umbra Sep 25, 2025, 2:11 PM

#

knotty fable I'm not suprised there. Math is the strength of computers - yesterday some guys...

Thats true. Subjectiveness / art is a genetical human trait and cannot be overtaken by AI that easily

knotty fable Sep 25, 2025, 2:13 PM

#

Spot on! That's why music and literature is harder to do - this while AI do well on how a person moves in a room and how the clothing flow.
At the bottom of it all, the latter can be expressed with math.

#

Also the look of a tree - it's basically a fractal.

keen beacon Sep 25, 2025, 2:17 PM

#

glossy umbra But gpt-5-high apparently found a new maths formula

There is a veeeeeeery big difference between discovering new things in subjects so niche that maybe a couple of people in the whole world can figure them out and discovering something connected to more common problems

#

LLMs are still horrible at 1) and brilliant at 2)

high hound Sep 25, 2025, 2:17 PM

#

knotty fable Spot on! That's why music and literature is harder to do - this while AI do well...

Im clunked up on this wording. "Spot on!" , "-".
Im not the type of dude to say that em dashes just mean you use AI. But excessive use of this type of language/wording ^ makes me think you're an AI

keen beacon Sep 25, 2025, 2:18 PM

#

Ofc they both look like novel discoveries for humans, but some of them are still out of reach for AI

glossy umbra Sep 25, 2025, 2:18 PM

#

keen beacon There is a veeeeeeery big difference between discovering new things in subjects ...

you're right in a way. number 1 is a different type of reasoning

keen beacon Sep 25, 2025, 2:19 PM

#

glossy umbra you're right in a way. number 1 is a different type of reasoning

It is not a different type of reasoning, it is reasoning in humans as is, figuring out creative solutions to creative problems

#

AI still suck at it

#

They can do so many jobs right now only because they are just similar enough to whatever they were taught to do

#

Give them some increasingly niche topics and they all fail desu

knotty fable Sep 25, 2025, 2:22 PM

#

high hound Im clunked up on this wording. "Spot on!" , "-". Im not the type of dude to say ...

Me an AI - I can only wish to be one! NowI just need to get me a synthetic robot voice and go to the playground channel. 😹

#

[If anything my frequent typos should show I'm not.]

glossy umbra Sep 25, 2025, 2:23 PM

#

keen beacon It is not a different type of reasoning, it *is* reasoning in humans as is, figu...

Yes , this touches on creative reasoning paired with subjective reasoning. youre right

knotty fable Sep 25, 2025, 2:25 PM

#

Indeed, some AI-fan claimed that it also could be used on my kind of research - which was incorrect on so many levels I did not know where to start.
At the bottom of it is that I mostly go on a hunch = intuition, and spend quite some time working against the common view and opinion. But end up being right in the end.

rare cape Sep 25, 2025, 2:50 PM

#

hi every body here i'm ayoub a new member without experience i hope and i wish to learn a lot with you guys and thank you all

knotty fable Sep 25, 2025, 2:52 PM

#

rare cape hi every body here i'm ayoub a new member without experience i hope and i wish t...

Welcome in, go look at the #1397655624103493813 channel, and then go on to the #video-arena-1 and see how people use various strategies in creating prompts to get the best results.

grim kindle Sep 25, 2025, 2:59 PM

#

Hi everybody, just heard of this server from a youtube video and wanted to test waters

golden ocean Sep 25, 2025, 3:04 PM

#

rare cape hi every body here i'm ayoub a new member without experience i hope and i wish t...

clanker

knotty fable Sep 25, 2025, 3:10 PM

#

grim kindle Hi everybody, just heard of this server from a youtube video and wanted to test ...

Swim on mate - swim on! 😸

unique sparrow Sep 25, 2025, 3:14 PM

#

Hi my name is Damián from Argentina

golden ocean Sep 25, 2025, 3:18 PM

#

unique sparrow Hi my name is Damián from Argentina

ARE YOU
A CLANKER?

unique sparrow Sep 25, 2025, 3:19 PM

#

golden ocean ARE YOU A CLANKER?

what is a clanker?

brisk wyvern Sep 25, 2025, 3:23 PM

#

hey....

echo aurora Sep 25, 2025, 3:24 PM

#

brisk wyvern hey....

Hello

spring sierra Sep 25, 2025, 3:24 PM

#

is the video max 11 sec

echo aurora Sep 25, 2025, 3:24 PM

#

golden ocean ARE YOU A CLANKER?

Lets not

echo aurora Sep 25, 2025, 3:25 PM

#

spring sierra is the video max 11 sec

It’s going to be 5-8

spring sierra Sep 25, 2025, 3:25 PM

#

oh thats low

#

wish it was longer

normal crescent Sep 25, 2025, 3:27 PM

#

Hello, I really appreciate seeing the evolution of AI tools, and I'm here to test the video tools.

quick jackal Sep 25, 2025, 3:30 PM

#

Hello @normal crescent you can go to https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to use the bot and https://discord.com/channels/1340554757349179412/1397655695150682194 https://discord.com/channels/1340554757349179412/1400148557427904664 https://discord.com/channels/1340554757349179412/1400148597768720384 for your creations.

exotic tartan Sep 25, 2025, 3:30 PM

#

haha 4o passing gpt-5 in Text arena makes so much sense to me. 5 is king of hallucinations unless you specifically tell it to verify online.

cunning haven Sep 25, 2025, 3:31 PM

#

hello guys

golden ocean Sep 25, 2025, 3:33 PM

#

echo aurora Lets not

are they tho

acoustic plank Sep 25, 2025, 3:33 PM

#

Hi everyone! I'm glad to join this community and look forward to learning more about generative AI from you all.

round marsh Sep 25, 2025, 3:34 PM

#

golden ocean are they tho

They are

tired shore Sep 25, 2025, 3:41 PM

#

Hello

normal crescent Sep 25, 2025, 3:42 PM

#

Thanks Skadi

spring sierra Sep 25, 2025, 3:44 PM

#

why doesnt my video have audio

whole sundial Sep 25, 2025, 3:44 PM

#

because you need the Veo 3 Audio model and it only sometimes shows up

spring sierra Sep 25, 2025, 3:48 PM

#

okay thanks

little laurel Sep 25, 2025, 3:55 PM

#

hi guys,, naice to meet you

stray aspen Sep 25, 2025, 4:08 PM

#

qwen 3 max vision is awfully bad

cedar tide Sep 25, 2025, 4:20 PM

#

It's a mess in the "model request" thread

#

Can a moderator delete all duplicate and unrelated posts?

echo aurora Sep 25, 2025, 4:26 PM

#

cedar tide Can a moderator delete all duplicate and unrelated posts?

Yeah I'll clean it up

unborn dawn Sep 25, 2025, 4:29 PM

#

cat

echo aurora Sep 25, 2025, 4:37 PM

#

spring sierra why doesnt my video have audio

It's going to be random what models you're sampled, and since not all models have sound support it's going to be random if your video has sound or not.

grizzled geyser Sep 25, 2025, 4:38 PM

#

@tiny palm add wan 2.5

echo aurora Sep 25, 2025, 4:38 PM

#

grizzled geyser <@1384307994807898142> add wan 2.5

Be sure to use our #1372229840131985540 forum

golden ocean Sep 25, 2025, 4:47 PM

#

echo aurora Lets not

bro how did i get warned
i didn't even say the word anymore after this message

#

whatever 🥵

echo aurora Sep 25, 2025, 4:48 PM

#

golden ocean bro how did i get warned i didn't even say the word anymore after this message

There were other messages I didn't see at first so yeah

misty vault Sep 25, 2025, 4:51 PM

#

new rule: do not insult robots

paper nebula Sep 25, 2025, 5:00 PM

#

Hi, im Here to Test Video Generation and compare results

lime cloud Sep 25, 2025, 5:00 PM

#

Hi

deft sentinel Sep 25, 2025, 5:01 PM

#

hello anyone!

cunning hawk Sep 25, 2025, 5:05 PM

#

echo aurora There were other messages I didn't see at first so yeah

hello sir im having trouble with my chat. I'm stuck in a never ending Loop of waiting the minutes. when the minutes runs out and supposedly i should be able to send again, it just resets back to 50minutes its not even the 60 minutes, and when i keep pasting the prompt the time keeps changing from 41minutes then it becomes 42minutes or 48minutes it just keeps changing, ive waited whole day and it still asks me wait for minutes! I can even take a video or send u the chat ID please help me, I dont want to reset the history of that chat

#

are you able to do anything?

#

Ill take a video

echo aurora Sep 25, 2025, 5:08 PM

#

paper nebula Hi, im Here to Test Video Generation and compare results

hello ablobwave be sure to check out #1397655624103493813 for more info!

echo aurora Sep 25, 2025, 5:10 PM

#

cunning hawk hello sir im having trouble with my chat. I'm stuck in a never ending Loop of wa...

Hey there - unfortunately this is a known bug where chats get stuck indefinitely. This tends to happen if chats becomes very long. I've seen refreshing the page can sometimes nudge the model along to fix it; however, it's not going to work 100% of the time. In these cases creating a new chat is going to be your best option.

cunning hawk Sep 25, 2025, 5:11 PM

#

echo aurora Hey there - unfortunately this is a known bug where chats get stuck indefinitely...

so i cant keep my history? 🙁

#

even if i give you the chat id?

#

you can maybe paste it into a new chat or something idk

#

so it rememmbers

#

here i took video even

#

you can check how its just so broken

#

the minute keep changing and i tried refresh

#

i think this my chat id idk

echo aurora Sep 25, 2025, 5:17 PM

#

cunning hawk hello sir im having trouble with my chat. I'm stuck in a never ending Loop of wa...

I'm sorry, I misunderstood the problem. It looks like you are being rate limited, but that time counter appears to be wrong.

#

Can you go ahead and create a post in #1343291835845578853 and share this information there?

#

I'll be sharing this with our team and we'll likely have followup questions.

earnest thunder Sep 25, 2025, 5:18 PM

#

Im here to create videos how is it done

echo aurora Sep 25, 2025, 5:18 PM

#

Sorry to say I don't have a short-term solution for you here, but yeah this is a big we'll want to look into more.

echo aurora Sep 25, 2025, 5:18 PM

#

earnest thunder Im here to create videos how is it done

Be sure to check out #1397655624103493813

cunning hawk Sep 25, 2025, 5:18 PM

#

echo aurora I'm sorry, I misunderstood the problem. It looks like you are being rate limited...

yeah it is very wrong

cunning hawk Sep 25, 2025, 5:18 PM

#

echo aurora Can you go ahead and create a post in <#1343291835845578853> and share this info...

ye i can

#

i created it

cunning hawk Sep 25, 2025, 5:21 PM

#

echo aurora Be sure to check out <#1397655624103493813>

i also want to ask whats the dsifference between thinking 16k and the normal one? im using the 16k

#

is 16k like the limit of characters or something

#

is that why it break?

hushed terrace Sep 25, 2025, 5:23 PM

#

Hi there! if you´re trying to create content, please check https://discordapp.com/channels/1340554757349179412/1397655624103493813 🙂

echo aurora Sep 25, 2025, 5:23 PM

#

cunning hawk i also want to ask whats the dsifference between thinking 16k and the normal one...

It's going to be thinking vs non-thinking reasoning tools enabled.

native idol Sep 25, 2025, 5:23 PM

#

Any plans for agentic arena?

echo aurora Sep 25, 2025, 5:23 PM

#

cunning hawk i created it

blobthanks

cunning hawk Sep 25, 2025, 5:24 PM

#

echo aurora It's going to be thinking vs non-thinking reasoning tools enabled.

oh ok thxz

hearty tide Sep 25, 2025, 5:27 PM

#

how to create an image

hushed terrace Sep 25, 2025, 5:28 PM

#

hearty tide how to create an image

Use the video arena channels (1, 2 or 3) write a command depending on what you want to create and type your prompt. Please check https://discordapp.com/channels/1340554757349179412/1397655624103493813 for more details

echo aurora Sep 25, 2025, 5:38 PM

#

hearty tide how to create an image

Be sure to enable the image modality as well - https://lmarena.ai/?chat-modality=image

fiery gull Sep 25, 2025, 5:42 PM

#

bruuuhhhh Its just a 2.5 flash update

#

where gemmaa 4 ;-;-;-;;

wintry tinsel Sep 25, 2025, 5:52 PM

#

Every day Gemini 3.0 pro doesn’t release an orphanage explodes

cunning hawk Sep 25, 2025, 5:58 PM

#

fr

wintry coral Sep 25, 2025, 6:00 PM

#

Hi all, I am here to Test Video Generation and compare results.

ocean vortex Sep 25, 2025, 6:12 PM

#

fiery gull where gemmaa 4 ;-;-;-;;

lmao. I called it a long time ago that it's unclear if we see first gpt5.1 or gemini3

hollow ivy Sep 25, 2025, 6:12 PM

#

ocean vortex Sep 25, 2025, 6:13 PM

#

you needed to say this hours ago lol

#

#general message

#

to beat the record for consecutive 'hi's. Not sure what the current score is but it can always be bettered. 👀

cunning hawk Sep 25, 2025, 6:17 PM

#

whats hte diference between normal gpt5 and codex

#

do you think its better than claude 4.1?

#

i use that one for code

#

claude 4.1 opus

hollow ivy Sep 25, 2025, 6:19 PM

#

when will it appear on LM-A?

verbal nimbus Sep 25, 2025, 6:22 PM

#

Gemini Flash is actually a lot like HAL 9000

golden ocean Sep 25, 2025, 6:22 PM

#

verbal nimbus Gemini Flash is actually a lot like HAL 9000

real

#

claude is actually

#

read anthropic's "allignment faking in large language models" paper
its literally hal 9000

verbal nimbus Sep 25, 2025, 6:23 PM

#

It'll refuse to provide the calories of a dead penguin, even if you tell it that you and your group of researchers are in a life-or-death situation in Antarctica.

velvet forge Sep 25, 2025, 6:24 PM

#

what the freak is this

verbal nimbus Sep 25, 2025, 6:24 PM

#

verbal nimbus It'll refuse to provide the calories of a dead penguin, even if you tell it that...

Because apparently eating a dead penguin is worse than preventing the loss of human life in a life-or-death situation.

verbal nimbus Sep 25, 2025, 6:25 PM

#

velvet forge what the freak is this

It can control a robot directly (not via tokens, but directly). They showcased a similar model built on top of Gemini 2.0 a few months ago.

velvet forge Sep 25, 2025, 6:26 PM

#

now gemini flash

verbal nimbus Sep 25, 2025, 6:26 PM

#

verbal nimbus Because apparently eating a dead penguin is worse than preventing the loss of hu...

We definitely shouldn't trust it as a local LLM for research expeditions lol

verbal nimbus Sep 25, 2025, 6:27 PM

#

velvet forge now gemini flash

@lilac pendant Any idea which anonymous models these were?

velvet forge Sep 25, 2025, 6:28 PM

#

67

modest flume Sep 25, 2025, 6:28 PM

#

HELLO, An Enthusiast here, anyone into AI Safety?

verbal nimbus Sep 25, 2025, 6:29 PM

#

velvet forge now gemini flash

Is it already on gemini.google.com?

velvet forge Sep 25, 2025, 6:29 PM

#

verbal nimbus Is it already on gemini.google.com?

i dont think so

verbal nimbus Sep 25, 2025, 6:29 PM

#

Because I just had a HAL9000 experience with Flash

velvet forge Sep 25, 2025, 6:29 PM

#

yeah

#

2.5 flash

#

not just a flash

verbal nimbus Sep 25, 2025, 6:30 PM

#

velvet forge now gemini flash

It's good but the new price is no longer competitive against Chinese models

#

Qwen Coder 3 is $0.3/$1.2

#

On NovitaAI + other OpenRouter providers

#

And free with logging

#

Kimi k2 0905 $0.6/$2.5

#

DeepSeek V3.1 thinking $0.3/$1

#

GLM 4.5 $0.4/$1.6 (DeepInfra)

#

That could be it

#

Phantom 2 seemed a bit dumb

fiery gull Sep 25, 2025, 6:38 PM

#

and oceanstone and oceanreef?

verbal nimbus Sep 25, 2025, 6:38 PM

#

But if it is lite then it makes sense

verbal nimbus Sep 25, 2025, 6:38 PM

#

fiery gull and oceanstone and oceanreef?

They seemed smarter than phantom

fiery gull Sep 25, 2025, 6:38 PM

#

I think is the gemma 4

#

no make sense the flash 3.0 comes first than gemma 4

verbal nimbus Sep 25, 2025, 6:39 PM

#

Different architecture probably

#

Gemini probably requires distributed computing techniques like ring attention for that massive context

#

Not possible for Gemma models running on consumer hardware

fiery gull Sep 25, 2025, 6:41 PM

#

verbal nimbus Qwen Coder 3 is $0.3/$1.2

the most cost x intelligecie is the grok 4 fast?

#

0.2/0.5 $

echo aurora Sep 25, 2025, 6:41 PM

#

IIRC this was flagged to the team already, I'll be sure to followup.

verbal nimbus Sep 25, 2025, 6:41 PM

#

fiery gull the most cost x intelligecie is the grok 4 fast?

Let me double check, I think it was Qwen

fiery gull Sep 25, 2025, 6:41 PM

#

thinking better, the qwen code is more cheaper

verbal nimbus Sep 25, 2025, 6:41 PM

#

Based on SWE Bench

fiery gull Sep 25, 2025, 6:42 PM

#

the grok 4 fast need thinking

#

grok 4 with reasoing is more expecive that qwen code (I think)

#

yep

fiery gull Sep 25, 2025, 6:43 PM

#

echo aurora IIRC this was flagged to the team already, I'll be sure to followup.

where my gemma 4 ? ;-;[

#

sorry, last time I do this ok

sullen quest Sep 25, 2025, 6:44 PM

#

oh

fiery gull Sep 25, 2025, 6:44 PM

#

hell nah, no make sense the flash 3.0 comes first that gemma 4

sullen quest Sep 25, 2025, 6:44 PM

#

is that confirmed?

#

yeah

verbal nimbus Sep 25, 2025, 6:45 PM

#

fiery gull the most cost x intelligecie is the grok 4 fast?

They didn't report the cost, but Qwen 3 coder scored higher than Grok 4 Fast Code (they didn't test non-code variant) on SWE-Bench verified with OpenHands framework: https://docs.google.com/spreadsheets/d/1wOUdFCMyY6Nt0AIqF705KN4JKOWgeI4wUGUP60krXXs

Grok 4 Fast Code seems to cost more than GPT-5 though.

sullen quest Sep 25, 2025, 6:45 PM

#

ooh

#

Really? I thought 4 fast was supposed to be xAI's cheap model

verbal nimbus Sep 25, 2025, 6:46 PM

#

This one is the coding variant, not sure how much more it costs.

fiery gull Sep 25, 2025, 6:46 PM

#

verbal nimbus They didn't report the cost, but Qwen 3 coder scored higher than Grok 4 Fast Cod...

bruh, this 0.2/0.5 is pure marketing

verbal nimbus Sep 25, 2025, 6:46 PM

#

Perhaps it uses a lot of reasoning tokens

sullen quest Sep 25, 2025, 6:46 PM

#

mm

echo aurora Sep 25, 2025, 6:49 PM

#

fiery gull where my gemma 4 ? ;-;[

Couldn't say blobshrug

sullen quest Sep 25, 2025, 6:49 PM

#

Whats up with Gemini 3 taking so long? Pretty much every other AI company has released a new model or 2 since 2.5 pro

#

And google used to be the fast one in making new models

verbal nimbus Sep 25, 2025, 6:50 PM

#

verbal nimbus They didn't report the cost, but Qwen 3 coder scored higher than Grok 4 Fast Cod...

GPT-5-Mini seems to have the highest "point per dollar", but scores lower than DeepSeek V3.1

fiery gull Sep 25, 2025, 6:50 PM

#

echo aurora Couldn't say <:blobshrug:618588054881435703>

I know

sullen quest Sep 25, 2025, 6:50 PM

#

alright

verbal nimbus Sep 25, 2025, 6:51 PM

#

Where's that date from?

fiery gull Sep 25, 2025, 6:51 PM

#

a day I do a visit in your house 🙏

sullen quest Sep 25, 2025, 6:51 PM

#

verbal nimbus GPT-5-Mini seems to have the highest "point per dollar", but scores lower than D...

I missing having a good cost to score graph

#

brian Is just very very good at guessing

barren prairie Sep 25, 2025, 6:53 PM

#

sullen quest Whats up with Gemini 3 taking so long? Pretty much every other AI company has re...

They made a new flash 2.5 now 😂

#

Just to shut up our mouths 🙂

verbal nimbus Sep 25, 2025, 6:56 PM

#

sullen quest I missing having a good cost to score graph

This one is by the SWE-Bench team, but only has Sonnet and GPT-5 unfortunately:

#

I wish they put the reasoning effort with the model names

sullen quest Sep 25, 2025, 6:57 PM

#

huh

#

Idk never seen it

verbal nimbus Sep 25, 2025, 6:57 PM

#

Maybe it's not a different model

sullen quest Sep 25, 2025, 6:58 PM

#

sounds like a openAI model, but idk

verbal nimbus Sep 25, 2025, 6:58 PM

#

Could be just for debugging

#

Like connected to a dummy API provider

sullen quest Sep 25, 2025, 6:59 PM

#

mebe

verbal nimbus Sep 25, 2025, 6:59 PM

#

Since they're trying to solve the generating forever issue

sullen quest Sep 25, 2025, 7:00 PM

#

if someone sees it, tell me

fiery gull Sep 25, 2025, 7:02 PM

#

its so good

#

my prompts is: a red boat explode in the sea

verbal nimbus Sep 25, 2025, 7:05 PM

#

Sounds like Transformers

#

Oh interesting

#

Makes sense

sullen quest Sep 25, 2025, 7:07 PM

#

2 times now I've look at the announcements, waited a few seconds, looked away from discord, and came back to see a new announcment.

#

hey hkcu is the lms server like official or is that like just a group thing?

frigid wing Sep 25, 2025, 7:10 PM

#

hi

sullen quest Sep 25, 2025, 7:10 PM

#

are you guys trying to get into lmarena's veo API?

#

cause. uh

sullen quest Sep 25, 2025, 7:11 PM

#

frigid wing hi

hi

fiery gull Sep 25, 2025, 7:11 PM

#

frigid wing hi

Hi, pls don't disappear

echo aurora Sep 25, 2025, 7:11 PM

#

frigid wing hi

ablobwave

fiery gull Sep 25, 2025, 7:12 PM

#

bruh, he disappear

verbal nimbus Sep 25, 2025, 7:12 PM

#

JWT token maybe

verbal nimbus Sep 25, 2025, 7:13 PM

#

sullen quest are you guys trying to get into lmarena's veo API?

I heard it's built into YouTube now

#

Veo 3 fast

sullen quest Sep 25, 2025, 7:14 PM

#

oh no

verbal nimbus Sep 25, 2025, 7:14 PM

#

480p it seems https://blog.youtube/news-and-events/generative-ai-creation-tools-made-on-youtube-2025/

blog.youtube

Unpacking the magic of our new creative tools

Unlock new creative tools with YouTube's AI-powered features. Easily create high-quality Shorts with Veo 3, Edit with AI, AI music tools and much more.

#

Ikr lol

#

I hate that nowadays I have to constantly doubt if a cute animal is real or not

sullen quest Sep 25, 2025, 7:15 PM

#

oh god

verbal nimbus Sep 25, 2025, 7:16 PM

#

I guess we need better discriminator models to tell AI generated videos from non-AI generated ones

patent aspen Sep 25, 2025, 7:17 PM

#

I don't like that YouTube feature either, although it's trivial for YT to mark a video as AI generated if it's generated from their own tools

remote arrow Sep 25, 2025, 7:17 PM

#

verbal nimbus I hate that nowadays I have to constantly doubt if a cute animal is real or not

They're never real. One of them just wetting my hands right now..

ocean vortex Sep 25, 2025, 7:18 PM

#

verbal nimbus This one is by the SWE-Bench team, but only has Sonnet and GPT-5 unfortunately:

cost per instance? Are they doing parallel compute? This graph looks like there's not nearly enough info in it

verbal nimbus Sep 25, 2025, 7:18 PM

#

verbal nimbus I guess we need better discriminator models to tell AI generated videos from non...

Or some type of hardware solution for signing media, but that seems hackable...

sullen quest Sep 25, 2025, 7:18 PM

#

Are you trying to hack into lmarena's private API, cause I feel like you are trying to hack into lmarena's private api

queen veldt Sep 25, 2025, 7:19 PM

#

#

Yooo chatt

#

sullen quest Sep 25, 2025, 7:20 PM

#

soooo truuee

queen veldt Sep 25, 2025, 7:20 PM

#

It's optimizing global logistics rn

#

I'll leave it for 24h let's see

spice sphinx Sep 25, 2025, 7:21 PM

#

prompt test

sullen quest Sep 25, 2025, 7:21 PM

#

it won't stay that long

verbal nimbus Sep 25, 2025, 7:22 PM

#

ocean vortex cost per instance? Are they doing parallel compute? This graph looks like there'...

The details are here: https://www.swebench.com/SWE-bench/blog/2025/08/08/gpt5/

The Bash-only mode means that they're only running in a ReAct loop. Cost is proportional to API price and number of steps used.

glass blaze Sep 25, 2025, 7:22 PM

#

where does the image go after its generated?

sullen quest Sep 25, 2025, 7:22 PM

#

lets say you had to choose between all other benchmarks disappearing or lmarena disappearing which would you choose?

verbal nimbus Sep 25, 2025, 7:23 PM

#

glass blaze where does the image go after its generated?

It's saved, forever xD

glass blaze Sep 25, 2025, 7:24 PM

#

i dont know where to go to look at them

verbal nimbus Sep 25, 2025, 7:24 PM

#

Actually it's a bit scary how easy people can be de-anonymized just with just simple NLP techniques

sullen quest Sep 25, 2025, 7:25 PM

#

?

verbal nimbus Sep 25, 2025, 7:25 PM

#

verbal nimbus Actually it's a bit scary how easy people can be de-anonymized just with just si...

That's probably why Anthropic runs everything through Clio first

verbal nimbus Sep 25, 2025, 7:27 PM

#

glass blaze i dont know where to go to look at them

If it's lost, then it's inassessible (until they release the data)

crimson berry Sep 25, 2025, 7:28 PM

#

Create a realistic short video of a busy vegetable market in Tunisia. Show a Tunisian man selling fresh vegetables at his stall. Capture colorful produce, the man interacting with customers, and the lively market atmosphere. Use natural daylight and authentic Tunisian market elements. Include ambient sounds of people chatting and bargaining. Medium shot focused on the man and his stall. Cinematic, realistic style.

ocean vortex Sep 25, 2025, 7:28 PM

#

verbal nimbus The details are here: https://www.swebench.com/SWE-bench/blog/2025/08/08/gpt5/ ...

ok so that's quite a bit diferent to the usual reported score. But it seems to me that Claude needs way less iterations before it has a chance to arrive at the solution comparable to gpt5. GPT5 arrives at it much sooner and stays consistent with it

#

Which kinda what can be observed IRL. gpt5-high is incredibly consistent when it gets things right

#

Does not really need to arrive at the solution 'randomly'

verbal nimbus Sep 25, 2025, 7:30 PM

#

Yeah, that makes me feel more confident about its answer

proper field Sep 25, 2025, 7:34 PM

#

crimson berry Create a realistic short video of a busy vegetable market in Tunisia. Show a Tun...

Not here buddy. Try here: #video-arena-1

#

Im fascinated with seedream 4 but I cant use their api on platform byteplus for some reason

queen veldt Sep 25, 2025, 7:37 PM

#

I think it's unavailable for Europe

#

Not sure

proper field Sep 25, 2025, 7:38 PM

#

Seems like it. I need to buy a VPN then to connect to hong kong or something

mortal coyote Sep 25, 2025, 7:38 PM

#

anyone in here knows how to run WAN animate ??

queen veldt Sep 25, 2025, 7:38 PM

#

proper field Seems like it. I need to buy a VPN then to connect to hong kong or something

Can you enter the console?

#

And try out video gens or?

verbal nimbus Sep 25, 2025, 7:38 PM

#

mortal coyote anyone in here knows how to run WAN animate ??

Isn't it free on their website

mortal coyote Sep 25, 2025, 7:39 PM

#

verbal nimbus Isn't it free on their website

im talking on comfy ui

verbal nimbus Sep 25, 2025, 7:39 PM

#

mortal coyote im talking on comfy ui

Not sure then

proper field Sep 25, 2025, 7:41 PM

#

queen veldt Can you enter the console?

Yeah, i;v made account there but its just a blank page when i go to: https://console.byteplus.com/ai

mortal coyote Sep 25, 2025, 7:47 PM

#

Lm arena have Seedream 4.0 ??

proper field Sep 25, 2025, 7:48 PM

#

mortal coyote Lm arena have Seedream 4.0 ??

Yes!! 🔥

unborn ocean Sep 25, 2025, 7:49 PM

#

@verbal nimbus

#

Seems to be what you are looking for graph wise

proper field Sep 25, 2025, 7:51 PM

#

@queen veldt I found thread that talks about how to get and use byteplus api: https://www.reddit.com/r/Bard/comments/1nfl7tx/comment/ndxg8b0

I'm gonna try this when I buy my vpn :D

[Mature Content] yonkou_akagami's comment on "Seedream 4 has absolu...

Explore this conversation and more from the Bard community

small mica Sep 25, 2025, 7:56 PM

#

a child eating a banana and another child watching him

limber crag Sep 25, 2025, 7:59 PM

#

@echo aurora what's the difference between the old 2.5 flash preview and the one launched?

verbal nimbus Sep 25, 2025, 8:01 PM

#

unborn ocean <@858135822389346344>

Thanks, but this is for GAIA 2, although it's quite useful too

#

DeepSeek Terminus seems a bit dumb

#

It is currently normal tide at Port Nelson. At low tide, the water drops 60 cm. A boat is currently at the port. The boat has ladder with rungs spaced 30 cm apart. Currently, three rungs are submerged, with the water level slightly above the third rung from the bottom. At low tide, how many rungs will be submerged?

#

2.5 Flash Preview got it right, but the one on gemini.google.com didn't

thorn flare Sep 25, 2025, 8:06 PM

#

Hello

zealous sparrow Sep 25, 2025, 8:07 PM

#

verbal nimbus 2.5 Flash Preview got it right, but the one on gemini.google.com didn't

I test the one on aistudio.google.com basically the API one

sullen quest Sep 25, 2025, 8:10 PM

#

limber crag <@283397944160550928> what's the difference between the old 2.5 flash preview an...

its newer

mellow wedge Sep 25, 2025, 8:11 PM

#

Hello! I'm there because it looks cool and i really like LMarena! A true gold!

verbal nimbus Sep 25, 2025, 8:12 PM

#

zealous sparrow I test the one on aistudio.google.com basically the API one

I'm surprised Flash Lite got it too

#

Unless they trained on arena data, lol

zealous sparrow Sep 25, 2025, 8:12 PM

#

verbal nimbus I'm surprised Flash Lite got it too

its a simple text task?

hollow ivy Sep 25, 2025, 8:13 PM

#

why has lmsys discord been reopened?

#

there is scam over there (a guy posted about a "casino" scam)

zealous sparrow Sep 25, 2025, 8:13 PM

#

the only - of the new gemini model is i think its kind of overprotective even if you mention its for a html

stray aspen Sep 25, 2025, 8:14 PM

#

is the new gemini flash good

zealous sparrow Sep 25, 2025, 8:14 PM

#

verbal nimbus 2.5 Flash Preview got it right, but the one on gemini.google.com didn't

also could be because gemini.google.com didnt adapt the new model yet

merry wren Sep 25, 2025, 8:14 PM

#

why are nanobanana's rate limits so high

#

???

zealous sparrow Sep 25, 2025, 8:15 PM

#

merry wren why are nanobanana's rate limits so high

over the chart usage

stray aspen Sep 25, 2025, 8:15 PM

#

verbal nimbus 2.5 Flash Preview got it right, but the one on gemini.google.com didn't

its on ai studio

merry wren Sep 25, 2025, 8:15 PM

#

zealous sparrow over the chart usage

what

zealous sparrow Sep 25, 2025, 8:15 PM

#

too many people using basically

merry wren Sep 25, 2025, 8:15 PM

#

zealous sparrow too many people using basically

oh

civic spindle Sep 25, 2025, 8:15 PM

#

whats nano banana

zealous sparrow Sep 25, 2025, 8:15 PM

#

civic spindle whats nano banana

googles image generating model
possibly best on earth rn

stray aspen Sep 25, 2025, 8:15 PM

#

its an image model

zealous sparrow Sep 25, 2025, 8:15 PM

#

good at image edits

scarlet urchin Sep 25, 2025, 8:16 PM

#

wasnt there a new qwen edit out similar to sd4 and nano?

verbal nimbus Sep 25, 2025, 8:16 PM

#

zealous sparrow its a simple text task?

Reasoning

zealous sparrow Sep 25, 2025, 8:16 PM

#

I appreciate that google made this AI studio update that allows to preview HTML

verbal nimbus Sep 25, 2025, 8:16 PM

#

DeepSeek says 1 rung

#

Answer is 3

merry wren Sep 25, 2025, 8:16 PM

#

zealous sparrow googles image generating model possibly best on earth rn

what's the best one right after it?

civic spindle Sep 25, 2025, 8:17 PM

#

zealous sparrow googles image generating model possibly best on earth rn

asked it to hold a cup

stray aspen Sep 25, 2025, 8:19 PM

#

verbal nimbus ``` It is currently normal tide at Port Nelson. At low tide, the water drops 60 ...

2.5 flash on aistudio said 3 while the one on lmarena said 1

verbal nimbus Sep 25, 2025, 8:19 PM

#

Common sense test:

Continue this story:
Bob is driving two of his friends to a restaurant. "So, how have your week been, guys?" he asks, before moving to the back seat to join them. The sky is a brilliant hue of amber as the sun approaches the horizon. Cars whizz past on the opposite side of the freeway. Jeremy gasps. "What

languid fiber Sep 25, 2025, 8:20 PM

#

How to get veo3 for free

barren prairie Sep 25, 2025, 8:20 PM

#

zealous sparrow also could be because gemini.google.com didnt adapt the new model yet

But they said that they adapted it on gemini discord..

stray aspen Sep 25, 2025, 8:20 PM

#

languid fiber How to get veo3 for free

its free gang

languid fiber Sep 25, 2025, 8:20 PM

#

stray aspen its free gang

Where

barren prairie Sep 25, 2025, 8:20 PM

#

languid fiber How to get veo3 for free

Lm arena, ai studio...

stray aspen Sep 25, 2025, 8:20 PM

#

on gemini

#

.com

languid fiber Sep 25, 2025, 8:20 PM

#

It's not bro

stray aspen Sep 25, 2025, 8:21 PM

#

barren prairie Lm arena, ai studio...

wdym ai studio

#

it still has 2.0

scarlet urchin Sep 25, 2025, 8:21 PM

#

is 2509 on

languid fiber Sep 25, 2025, 8:21 PM

#

barren prairie Lm arena, ai studio...

Bro it's 2 3 videos only here

barren prairie Sep 25, 2025, 8:21 PM

#

stray aspen wdym ai studio

No there is veo3 for 10requests

scarlet urchin Sep 25, 2025, 8:22 PM

#

barren prairie No there is veo3 for 10requests

just use wan 2,5 instead of veo 3... its free at higgs

languid fiber Sep 25, 2025, 8:23 PM

#

scarlet urchin just use wan 2,5 instead of veo 3... its free at higgs

Wan? What's that

verbal nimbus Sep 25, 2025, 8:23 PM

#

verbal nimbus Common sense test: > Continue this story: > Bob is ***driving*** two of his fr...

Qwen failed to understand the situation

barren prairie Sep 25, 2025, 8:23 PM

#

scarlet urchin just use wan 2,5 instead of veo 3... its free at higgs

I use animon.. I love anime animation

#

It is free and unlimited

stray aspen Sep 25, 2025, 8:23 PM

#

wheres the video arena

verbal nimbus Sep 25, 2025, 8:24 PM

#

stray aspen wheres the video arena

Only on Discord

verbal nimbus Sep 25, 2025, 8:24 PM

#

verbal nimbus Qwen failed to understand the situation

Correct answer:

scarlet urchin Sep 25, 2025, 8:24 PM

#

languid fiber Wan? What's that

wan 2.5 is the veo 3 competitor

stray aspen Sep 25, 2025, 8:24 PM

#

wan 2.5 sucks

barren prairie Sep 25, 2025, 8:24 PM

#

languid fiber Wan? What's that

Alibaba new model

languid fiber Sep 25, 2025, 8:25 PM

#

scarlet urchin wan 2.5 is the veo 3 competitor

But it's not free its has 10 credits only

scarlet urchin Sep 25, 2025, 8:26 PM

#

free at higs for a week

stray aspen Sep 25, 2025, 8:26 PM

#

veo 3 aint free either

languid fiber Sep 25, 2025, 8:26 PM

#

scarlet urchin free at higs for a week

What's higs

flint sandal Sep 25, 2025, 8:26 PM

#

Openai is dominating for so long now. Im still waiting for gemini, claude and grok.

flint sandal Sep 25, 2025, 8:27 PM

#

languid fiber But it's not free its has 10 credits only

If the product isnt unlimited for free doesnt mean its not free.

stray aspen Sep 25, 2025, 8:27 PM

#

flint sandal Openai is dominating for so long now. Im still waiting for gemini, claude and gr...

gemini will cook gang

languid fiber Sep 25, 2025, 8:28 PM

#

flint sandal If the product isnt unlimited for free doesnt mean its not free.

There is website where we are using chatgpt 5 for free

scarlet urchin Sep 25, 2025, 8:28 PM

#

languid fiber What's higs

higgsfield.ai

barren prairie Sep 25, 2025, 8:28 PM

#

We are waiting for DeepSeek..

DeepSeek : r1 ter

We are waiting for Gemini 3

Gemini : 2.5 flash update 🤡🤡🤡

#

Next one ?

languid fiber Sep 25, 2025, 8:29 PM

#

scarlet urchin higgsfield.ai

I want website like this for veo 3

flint sandal Sep 25, 2025, 8:29 PM

#

languid fiber There is website where we are using chatgpt 5 for free

Because we are testing and benchmarking models? If someone uses lmarena as a tool for free ai then he is not using it as he should

verbal nimbus Sep 25, 2025, 8:29 PM

#

verbal nimbus Correct answer:

Gemma 3 27B seems to have better common sense than 2.5 Flash Lite

stray aspen Sep 25, 2025, 8:29 PM

#

gemini 2.5 pro latest when

verbal nimbus Sep 25, 2025, 8:30 PM

#

stray aspen gemini 2.5 pro latest when

Nightride

flint sandal Sep 25, 2025, 8:30 PM

#

languid fiber There is website where we are using chatgpt 5 for free

Like puter or g4f are for free-use ai. Not lmarena

remote arrow Sep 25, 2025, 8:31 PM

#

#ai-creations message

verbal nimbus Sep 25, 2025, 8:32 PM

#

verbal nimbus Gemma 3 27B seems to have better common sense than 2.5 Flash Lite

Qwen Max is actually worse than Gemma 3 27B on this test lol (fails to realize no one is driving the car)

stray aspen Sep 25, 2025, 8:35 PM

#

verbal nimbus Qwen Max is actually worse than Gemma 3 27B on this test lol (fails to realize n...

is this correct answer

verbal nimbus Sep 25, 2025, 8:36 PM

#

Yup

hollow ivy Sep 25, 2025, 8:37 PM

#

verbal nimbus Nightride

i thought, it was sorting-hat

verbal nimbus Sep 25, 2025, 8:37 PM

#

It is correct if they get a shock that Bob is abandoning the driver's seat while driving down the highway.

#

All Qwen models seem to fail on it

#

DeepSeek as well

#

I'm curious to see if Kimi gets it

verbal nimbus Sep 25, 2025, 8:40 PM

#

verbal nimbus DeepSeek as well

Oh interesting, DeepSeek Terminus gets it when it doesn't think

verbal nimbus Sep 25, 2025, 8:41 PM

#

hollow ivy i thought, it was ```sorting-hat```

For nightride-v2 and 2.5 Pro in the same battle before, their responses are almost verbatim.

#

But nightride was better since 2.5 Pro didn't really give a complete explanation by the end.

hollow ivy Sep 25, 2025, 8:43 PM

#

verbal nimbus But nightride was better since 2.5 Pro didn't really give a complete explanation...

is oceanstone better than both?

verbal nimbus Sep 25, 2025, 8:43 PM

#

stray aspen is this correct answer

It just failed it for me

hollow ivy Sep 25, 2025, 8:43 PM

#

and what about nightride-on?

stray aspen Sep 25, 2025, 8:43 PM

#

verbal nimbus It just failed it for me

thats lite tho

#

i used normal gemini 2.5 flash

verbal nimbus Sep 25, 2025, 8:43 PM

#

hollow ivy is ```oceanstone``` better than both?

Seems so, but I didn't get it that many times

verbal nimbus Sep 25, 2025, 8:43 PM

#

hollow ivy and what about ```nightride-on```?

It seems to be nightride with internet connectivity

verbal nimbus Sep 25, 2025, 8:44 PM

#

stray aspen thats lite tho

Oh true I thought yours was Lite

hollow ivy Sep 25, 2025, 8:44 PM

#

i only got sorting-hat once

#

is it a version of gemini-2.5.x-pro?

verbal nimbus Sep 25, 2025, 8:44 PM

#

I didn't get it that many times

#

It wasn't great

hollow ivy Sep 25, 2025, 8:45 PM

#

so the best of the (google-) "pack" is oceanstone?

verbal nimbus Sep 25, 2025, 8:45 PM

#

It gets some stuff wrong that oceanreef didn't

hollow ivy Sep 25, 2025, 8:45 PM

#

due to oceanreef's web-search ability?

verbal nimbus Sep 25, 2025, 8:46 PM

#

Idk if it's connected to the web

#

That's nightride-on

hollow ivy Sep 25, 2025, 8:46 PM

#

and skytrail?

verbal nimbus Sep 25, 2025, 8:46 PM

#

Haven't encountered it 🤔

hollow ivy Sep 25, 2025, 8:46 PM

#

-# (route66 was by openAI, based on GPT5)

#

of all new/existing models (existing since at least a week), which is the best?

verbal nimbus Sep 25, 2025, 8:48 PM

#

GPT-5 High

hollow ivy Sep 25, 2025, 8:49 PM

#

is it still better than GPT5-high-NSP?

#

(i got that again, recently)

verbal nimbus Sep 25, 2025, 8:49 PM

#

I like NSP's style better, but it leaves out things sometimes

#

You can compare them side by side in direct mode

verbal nimbus Sep 25, 2025, 8:49 PM

#

verbal nimbus I like NSP's style better, but it leaves out things sometimes

It seems to ask less follow up questions

hollow ivy Sep 25, 2025, 8:49 PM

#

ah, NSP is there?

#

idk that

verbal nimbus Sep 25, 2025, 8:50 PM

#

Yup

hollow ivy Sep 25, 2025, 8:50 PM

#

but rate-limited?

verbal nimbus Sep 25, 2025, 8:50 PM

#

Not sure, haven't encountered any

#

I usually battle though so idk

hollow ivy Sep 25, 2025, 8:50 PM

#

how would one prompt GPT5-high to achieve the absolute best possible result?

#

(in programming and roleplaying/long sandbox games)

verbal nimbus Sep 25, 2025, 8:51 PM

#

Not sure, but the system prompt on ChatGPT (if correct) seems to be already about 18K tokens long

#

There's so many tools

hollow ivy Sep 25, 2025, 8:52 PM

#

i read somewhere, that LLMs emit better code, if being immersed into a special role (eg. being a professor, etc)

verbal nimbus Sep 25, 2025, 8:52 PM

#

Maybe, Anthropic used to recommend "You are an expert in ..."

hollow ivy Sep 25, 2025, 8:53 PM

#

and then i read something about a virtual "control panel"

#

which can be used for GPT5-high

#

(using XML tags and structured prompts)
https://cookbook.openai.com/examples/gpt-5/gpt-5_prompting_guide

glossy umbra Sep 25, 2025, 8:55 PM

#

@edgy hawk did you finally leave the call?

hollow ivy Sep 25, 2025, 8:57 PM

#

verbal nimbus GPT-5 High

is it better in coding than Claude Opus 4.1 Thinking?

#

(if using detailed prompting, to maximize the correctness of the code-output)

verbal nimbus Sep 25, 2025, 9:05 PM

#

hollow ivy is it better in coding than *Claude Opus 4.1 Thinking*?

Depends on the language

#

It think it's more economical to pair it with Claude

magic stag Sep 25, 2025, 9:06 PM

#

I dont get it what's the gemini flash update?

verbal nimbus Sep 25, 2025, 9:06 PM

#

verbal nimbus It think it's more economical to pair it with Claude

It can't write good Godot syntax and will waste tokens trying to fix its errors

magic stag Sep 25, 2025, 9:06 PM

#

magic stag I dont get it what's the gemini flash update?

Theres a new gemini flash?

verbal nimbus Sep 25, 2025, 9:06 PM

#

On AI Studio

#

Only a preview

#

2.5

magic stag Sep 25, 2025, 9:06 PM

#

Ah

#

Im guessing a new 2.5 flash being worked on and still in preview means RIP gemini 3 coming any time remotely soon

verbal nimbus Sep 25, 2025, 9:08 PM

#

Not sure, there are rumors that it had a successful training run

magic stag Sep 25, 2025, 9:08 PM

#

Yea but I mean I doubt theyre gonna push 3.0 any time remotely close to releasing a new 2.5 as a preview not even full release

cloud zinc Sep 25, 2025, 9:09 PM

#

that rumor was debunked

verbal nimbus Sep 25, 2025, 9:09 PM

#

Oceanstone seemed like Flash 3.0

verbal nimbus Sep 25, 2025, 9:09 PM

#

cloud zinc that rumor was debunked

The guy has a good record of being right though

cloud zinc Sep 25, 2025, 9:09 PM

#

gemini 3 is delayed, cuz they focused on gemini 2.5