#general

1 messages ยท Page 237 of 1

fiery gull
#

python is to otimize txt, and replace docs, opus to txt etc

#

yep ;-;

bleak cave
#

?

fiery gull
#

sorry, I need to sleep now

spark python
#

Ur making it json?

fiery gull
#

depend on the application

spark python
#

But for what reason do you need that many pythons

fiery gull
#

because I have 700gb of data

spark python
#

Of text?

fiery gull
#

word, text, opus, ever thing

spark python
#

And what's the context in them

fiery gull
#

I'm automating a giant education company (a really big one), I'm simply exhausted right now, I can't even write anymore, I should go to sleep now. I'm automating everything from company posts to WhatsApp replies and so much more

#

I'm currently working on 5 opus 4.5, compressing text for LLM to read

#

I think in the future I will need hmmm, 200 agents and 300 skills, something

spark python
#

Ok go rest

fiery gull
#

and 1 gemini 3.0 pro seeing now 20 instagram to make a doc to my core

#

Things are crazy here, in the future it's going to be chaos for me to organize everything ๐Ÿ™‚

#

I'm thinking better, is better the flash 3.0, because the pro high is so slow to see instagram posts ;-;

#

@echo aurora

#

lol I'm so dumb, just now I released I can see the instagram profiles with python ;-;

#

why I'm seeing with gemini? bruhhhhhhhhhhh

#

Maybe it's because I've only been studying Python for 5 days and I'm still making these mistakes

junior sonnet
#

Does anyone know a website where I can use Nano Banana Pro for free or with a free trial instead of LM Arena?

thick pawn
#

It's a good question since lm arena doesn't like to work anymore

polar niche
#

Where are the new mf text llms

#

Not everyone wants image generators

proud bobcat
#

NEW QWEN IMAGE

#

Oh yeah baby

#

Jackpot

polar niche
#

Literally qwen

#

How is that a jackpot

proud bobcat
#

what

#

Open source image generator model with extremely high prompt following is not a jackpot?

gusty egret
proud bobcat
#

Free to use and cost effective???

chrome goblet
#

Can you guys make GPT 5 faster?

gusty egret
#

Use nano

proud bobcat
polar niche
proud bobcat
chrome goblet
#

Crap

polar niche
#

Now

polar niche
gusty egret
#

I'm too up-to-date with AI man

#

That's already old news to me

chrome goblet
#

Gemini sucks at doing Random Fictional Characters Stories

chrome goblet
#

I keep making scripts and it's the same crap every time

#

And I don't like it

polar niche
chrome goblet
#

Y'all feel me?

proud bobcat
#

Oh wow qwen image 2512 is quite good

#

Noticeable increase in detail

#

Not bad at all tbh

proud bobcat
#

The

#

The updated qwen image

#

2512

#

Itโ€™s a new model update

#

Chat

chrome goblet
#

Get rid of that crappy CAPTCHA!!!

echo aurora
chrome goblet
#

Ok

fossil socket
#

@echo aurora any idea when y'all gonna add the attach image feature from GPT and Gemini onto other models

obtuse smelt
#

hmm what you mean ?

#

oh yeah i use gemini model and it can on anime styles

vague solar
#

where did all the claude models go ๐Ÿ˜ญ

obtuse smelt
#

uhmm

#

try you refresh a site

vague solar
#

yeah i tried

#

nvm i fixed it

#

that was weird

gusty egret
#

Token for token

obtuse smelt
#

if you side by side is delay 1 hour for using model

#

i try use gemini is delayed waiting 1 hours really in side by side

chrome goblet
#

I can't do anything because of CAPTCHA

fossil socket
#

But others cannot

obtuse smelt
#

hmm try side by side

dreamy crystal
#

is there chat arena of somesort that allows deepresearch?

prisma cipher
#

This has happened twice now that Claude opus 4.5 has not generated the complete response for me even after refreshing the page.

verbal nimbus
prisma cipher
prisma cipher
#

Reduce the number to 8250 to avoid the red words haha

craggy wasp
#

vertical size does not come

prisma cipher
left lodge
#

video modality appears for logged in users this time but with a twist now , only battle mode is available for video , supports image input and both videos have to be first played to abe able to vote for feedback.

#

Only after 3 generations, ๐Ÿ’€
The platform is really limiting.

#

Damn

fiery gull
#

bruh the limit from anti grativy is only 8

#

I need 9 opus working now

empty stump
#

8 messages?

jovial ravine
#

hi any one here ?

surreal creek
#

kiwi-do is Kimi?

jovial ravine
#

@echo aurora check dm please

fiery gull
shut crown
#

Hello good to be here

leaden raft
#

hello

echo aurora
#

welcome welcome @shut crown @leaden raft

obtuse smelt
#

hi

fiery gull
#

hi

jaunty oyster
#

Happy new year :))
I am Buschi, and i am less times here in discord..hope to find a nice LMArena Matrix channel ...

viscid cloak
#

ReCAPTCHA+Something went wrong COMBO hit again. Good Night๐Ÿ˜ช

robust sluice
subtle frost
#

I don't get how Gemini 3 Pro has higher HLE score than Claude.

I've mainly tested Bulgarian grammar and medical information. From the tests I've done: Claude does better more than 90% of the time.

#

Is HLE a bad benchmark for what I'm actually trying to look at though?
I'm mainly looking for an AI that can search the web, find official sources, then interpret and present that information correctly. Definitely not to synthesize his own information.

robust sluice
#

<@&1349916362595635286>

ionic sonnet
atomic mantle
#

helo

whole lotus
#

I saw a video available around 1 hour ago on LMARENA but now it's gone. Is it only temporary available?

hollow ivy
#

-# another account got zombified

#

@quasi atlas could you remove the scam, please?

#

he could try to delete all cookies, logout and login
if that doesn't help, try another browser

#

if chromium-based browsers don't work anymore, try a firefox-based one, like LibreWolf

#

or safari

grave mesa
#

What is the best AI for coding in general?

#

Claude ?

quasi atlas
hollow ivy
#

(aka coast = coASt = co45t)

#

coasting along ^^

grave mesa
#

Okay thx you !

golden ocean
#

-# That model is head and shoulders above all other models, in coding.

hollow ivy
#
poll_question_text

which is better in coding?

victor_answer_votes

9

total_votes

10

victor_answer_id

2

victor_answer_text

Claude Opus 4.5 Thinking

pseudo dagger
#

why this website not working in my desktop?

finite blade
#

@daring jewel

#

@daring jewel dm please

jade stirrup
#

you guys can fix other people's accounts, right?

remote sun
#

It's happened... Again... In a new chat...

bright shard
#

@echo aurora When I upload an image to the Nano Banana Pro or Nano Banana model and need it edited, it returns the original image without any changes or modifications. Please fix this ๐Ÿ™๐Ÿผ๐Ÿ™๐Ÿผ๐Ÿ™๐Ÿผ

main path
#

๐Ÿ‘‹

hollow ivy
#
poll_question_text

Which is best for extremely long roleplaying/sandbox/adventures & creative writing?

victor_answer_votes

2

total_votes

5

coral birch
queen veldt
#

I can't even generate a single image with nano banana pro

#

This problem isn't fixed yet ...

woeful portal
#

google login is broken HELP

wicked sage
#

hi guys

#

<@&1349916362595635286>

wicked sage
silk kayak
#

Hello everyone. I've encountered an issue: there's no "Create Video" button. What could be causing this and how can I fix it?
@everyone

obtuse smelt
#

hmm should fix all

#

claude,gemini,grok and all models back to normal

silk kayak
#

I have 2 emails. On first- all works nice. On second - this problem

neat apex
rigid copper
#

sup

rigid copper
subtle frost
fickle venture
fickle venture
#

I think OpenAi did make their own search engine

#

If people use this one then Chatgpt will get even better

subtle frost
#

Gemini never searches properly

It always pretends to search then says some BS

#

The link either goes no where or it just completely incorrectly synthesised the information

#

Other models like actually use a good source and don't hallucinate info about it

prisma cipher
#

Has the problem generating a response for the Opus 4.5 model been resolved? Or is it still ongoing?

subtle frost
#

At least from what I found

fierce kelp
neat apex
#

Grok is also not a great searcher, but he absuses the 2 million context + Twitter condesated information to check out literally more than 100 sources

#

thats why its the best at all

subtle frost
fierce kelp
subtle frost
#

Yh true

subtle frost
#

I wonder if there's a benchmark at how good an AI is at finding sources and providing an accurate answer based on the sources

fierce kelp
subtle frost
#

Ohhh doesn't needle in a haystack have a search leaderboard?

fickle venture
#

This is just like means Ai still in Beta

#

We still gonna wait for the full release of Ai

#

In about 1-2 years

#

And that's when something happens who knows

loud crag
#

Does anyone occasionally get an output image from Nano Banana Pro that takes like 4-5 times longer than regular images, and clearly comes out broken/glitched? It looks mostly the same as the image I put in, but the face and lighting are messed up.

loud crag
fickle venture
#

Huh that's weird

fickle venture
fickle venture
loud crag
#

Actually, no I had it wrong. It only happened on Fal.ai

#

It might be a 4k thing since I was using Fal for any 4k image and the free one on LMArena for anything 2k, sorry!

fickle venture
#

So that's fal issue

loud crag
#

Probably, my bad. ๐Ÿ˜…

thorny schooner
#

Considering
I have not even thinked about trying because of the issues going on right now with this website can't really give an answer

sterile tartan
#

@loud crag why Stupid

loud crag
viscid cloak
sterile tartan
fierce kelp
viscid cloak
fierce kelp
#

How does gpt-5.1-search-sp differ from gpt-5.1-search?

sterile tartan
sour spear
echo aurora
# pseudo dagger why this website not working in my desktop?

Can you give these steps a try? https://help.lmarena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message You'll also want to swap to Search Arena for that kind of question, if you click on the little globe in the text chat you'll change to the Search modality.

wet echo
#

hello , i am an AI enthusiast , i want to understand the artificial intelligence indeptly

queen veldt
#

No

desert abyss
compact hearth
#

Will direct chat be added for video generation?

desert abyss
#

@marsh vector Please check how-to-video-bot to learn how to generate videos. In addition images and prompts must adhere to the server rules. Be mindful of the wording used in the prompts and avoid generating content that is suggestive or inappropriate. Thank you. Failing to follow this rule will result on a permanent ban.

marsh vector
#

I wsnt my crush wirh me

little ginkgo
#

Why this website bugs sometimes ๐Ÿ™

fickle venture
#

Guys help llama-13b he is homeless living down there

echo aurora
little ginkgo
#

I totally select all the correct stuff

#

Then it says incorrect

#

And if it somehow sees its all correct

#

The ai says error generating response

#

And give me options to retry and clear

#

And once i hit retry

#

I am stuck in captcha loop again

#

:<

#

Wow ur a pineapple

ocean vortex
#

haven't tried it in awhile, would be very ironic lol

little ginkgo
#

Its kinda slow

#

:]

west lodge
#

waiting patiently for nano banana pro 4k to be enabled

#

its right THERE

ocean vortex
daring rock
light flax
#

hello everyone , i downloaded BRAVE browser on my pc ,, it seems like i cant open LmArena.ai for some reason .

#

anyone can help ?

modest prism
#

Guys do you have any anti lazy prompt for Gemini 3 pro. The model is so goddamn lazy

storm needle
timid hamlet
acoustic orchid
wicked sage
#

i got a question, is abacus.ai just lmarena but paid?

hollow ivy
echo aurora
stray aspen
rose sky
#

Hello

red sluice
#

@echo aurora There's a big issue with Lmarena's search mode since today. You might consider not counting the votes from today.
On almost EVERY CHAT, it'll stop before it has finished writting. How am I even supposed to vote correctly?

#

over and over again

proud bobcat
#

Now this

red sluice
#

It happened a lot with grok fast but now it happens a lot with every model

proud bobcat
#

This is what AGI is

echo aurora
echo aurora
red sluice
# echo aurora Any prompts in particular trigger this, or all?

No idea. Actually it only happens with Grok-4-fast-search, my bad, didn't vote before telling it here.
Ask it to write a wikipedia article for example, prompt it like that:
"Alright, let's make a wikipedia article about the actor "James Austin Kerr" {or any actor, just took this prompt, but could be anything}
I guess you know everything on how to make wikicode. Maybe try to find a fitting infobox. Double check that the URLs you use in the ref tags are leading somewhere (don't change titles of articles!)
Use good sources only. Don't overdo it.
Paste the result in codetag."

And Grok-fast won't have everything pasted

#

examples from today

#

but already happened before (last screnshot still voted grok-4.1 btw because he brought interesting stuff before stopping ๐Ÿ˜› )

echo aurora
# red sluice examples from today

Okay gotcha, thanks for the information. Very clear and helpful blobthumbsup . Do you know if these examples are from Battle or was it done in Side by Side?

red sluice
#

Battle, didn't try side by side

#

search mode activated btw, just in case there's a misunderstanding

echo aurora
#

Sounds good blobthumbsup

#

I'm trying to repro in SbS and no luck, will try to get grok models in Battle.

#

It's worth noting though that we have ways to validate votes to prevent issues like this from swaying the leaderboards unfairly.

red sluice
#

Just in case I'm using Firefox on Windows right now, with adblocks on and ghostery. I don't believe it impacts anything that is generated but yeah giving pretty much all the information I can

echo aurora
#

I got a full answer from grok-4-fast-search in Battle

obtuse smelt
#

oh

red sluice
#

You can see it start to generate in the beginning but stopping

#

It's happening every single time with grok fast

#

and refreshing doesn't help

obtuse smelt
#

i see

red sluice
#

(had multiple tabs opened to test it and be sure to have grok if you're confused)

bright shard
#

@echo aurora When I'm using Gemini 3 Pro in Code Arena, it often outputs infinite text. Do you think that could be fixed?

still jetty
#

edit: reloading the page worked without losing the conversation
leaving the question up for future searchers

in arena battle mode, is there a way to resend the prompt? i am 3 prompts in and its stuck generating. i REALLY had a good experience with one, but since i made the 3rd prompt and it got stuck, i cant vote or reveal identity :(

#

website, safari browser

earnest rover
#

@echo aurora does lmarena has a limit of number of words we can sent ?

whole sundial
#

<@&1349916362595635286>

echo aurora
robust sluice
rigid copper
#

hi guys

#

i guess you're trying to create image or video, you can do it in #video-arena-1

halcyon nimbus
#

looks like image generations down site wide D:

#

like 5 failed battles but maybe some models are working idek

echo aurora
echo aurora
halcyon nimbus
#

might have to sub to chatgpt for a month its actually so good rn even better than nano bp

echo aurora
halcyon nimbus
#

ive only tried battle a few times and chatgpt latest so \o.0/

#

tried refreshing, new chats, still nothing

echo aurora
#

Seems good on my end

halcyon nimbus
#

mod luck

echo aurora
echo aurora
#

Yikes, going to remove the image

halcyon nimbus
#

i guess some are working =_=

whole sundial
# echo aurora Yikes, going to remove the image

yeah, all I can say about grok is that it is weird, even on api. I had grok 4.1 fast hallucinate so badly on lmarena with an innocent prompt that it generated an nsfw answer, grok 4.1 (the full version) did it just fine, I guess they train the models on a lot of adult stuff

covert ice
#

hello

echo aurora
echo aurora
halcyon nimbus
#

my guess all the image editing ones are down

echo aurora
halcyon nimbus
#

it would be cool if there was a status detector with a little red or green next to each model to say if they were up/down

echo aurora
halcyon nimbus
#

that wasnt image edit, yeah only having a problem with image edit

robust sluice
#

Qwen, Flux, Reve (not sure of this name) is the 3 AI that always works well, other is a lot error

echo aurora
halcyon nimbus
#

tried a different browser

whole sundial
#

both flux though

halcyon nimbus
#

i guess its just high traffic api overloaded id bet

robust sluice
echo aurora
whole sundial
#

yeah battle re-route to another model if one fail, i guess in this case all roads lead to flux/qwen/reve

robust sluice
#

I can say when I use Battle, its random only 1 slot another slot is only lock for Flux

echo aurora
whole sundial
#

๐Ÿคท

robust sluice
obtuse smelt
#

all user love anime styles in LMArena

robust sluice
echo aurora
robust sluice
#

50s and finish drawing wow

halcyon nimbus
#

shrodingers website

robust sluice
#

when I didnt record:

opal scaffold
onyx cairn
#

When I'm trying to log into my account this error is showing. How can I solve this?

quartz light
obtuse smelt
#

well i just kidding

royal sorrel
#

hello

obtuse smelt
#

hi

neon idol
#

<@&1349916362595635286>

loud crag
#

Does the Nano Banana Pro on LMArena use the Web search or not?

jovial lynx
#

hello

fickle venture
#

Grok 4.2 dropped yet?

livid heath
#

Why is the "Something went wrong with this response, try again" message constantly there?
I have retried many times yet it gives this same response. Is that conversation dead, like can't be continued or something?

obtuse smelt
lean pivot
#

๐Ÿ‘‹

celest cave
#

What should I do about it? Something went wrong with this response, please try again.

fickle venture
#

Gemini is now on YouTube but when you ask it to tell you something that is on video it will tell you to watch the video to find it so it refuse to answer

steady grove
#

anyone connected game to LLM ?
for ai npc etc

i tried to local ollama VISION, npc to choose where he WANT to go
and it works. lol.
even with details "oh this big blue cube is so big" etc
if anyone interested in technologies / code - dm and free code will fly to your home

fleet condor
#

๐Ÿ‘‹

hearty ferry
#

LMARENA UNCENSORED?

#

@echo aurora

lofty shell
#

hello

ocean vortex
ocean vortex
#

They want to encourage people to do 'safe' chats that could be analyzed or publicly shared, and for them to use battle mode predominantly direct chat less.

echo aurora
civic spruce
#

Is there any subscriptions in lmarena?

astral dust
#

Why do models give incorrect dates bro

#

@echo aurora

echo aurora
echo aurora
echo aurora
echo aurora
stray aspen
#

Wassup gang

echo aurora
echo aurora
echo aurora
astral dust
echo aurora
#

That's a good question, going to move this convo to #leaderboards and ping you there

echo aurora
wicked sage
#

like chatgpt, deepseek, glm, whatever

wicked sage
# astral dust

not sure about the second one, the first one is because of its info getting cut off on a specific date

echo aurora
robust sluice
astral dust
#

Is the website down?

slim loom
#

It hasn't always been easy, but we are finally in the home stretch. To keep the momentum going, Iโ€™ve set up a live countdown page for our official launch.

https://www.linkedin.com/posts/tkoushik_buildinpublic-productmanagement-careerdevelopment-activity-7413596891512385536-bCfr?utm_source=share&utm_medium=member_desktop&rcm=ACoAAFTV2vkBRU-4pgTUDg7fsP_RhIsZlWT1iME

I'm breaking the #1 rule: I'm building my portfolio live.

No perfection. No hiding. Just real work in real time.

The countdown: koushikapm.online

Built with Bun + Elysia.

What rule are YOU breaking

#BuildInPublic #ProductManagement #CareerDevelopment #TechCommunity #WebDevelopment #LearningInPublic #Innovation #ProjectManagement #Softw...

wicked sage
#

hi gys

dapper walrus
#

site login still doesnt work(

#

anyone still have tgis problem?

#

and sometimes this

old garnet
#

Hello

dapper walrus
#

?

stray jetty
#

..

thick pawn
spark python
#

does anyone know an llm that doesnt hallucinate after 100k tokens?

thorny schooner
#

Has anyone up else kind of just given up on this for the meantime

rose sky
rose sky
#

The code is obviously generated by Gemini 3 Flash. I just ask it to write the Arduino code to do what I want

rose sky
grim glacier
#

hello

whole sundial
#

<@&1349916362595635286>

wind shore
#

.w.

somber pagoda
#

does anyone know when direct chat will be allowed for videos?

#

cause I see it on the site as in only battle but not the other 2 within dropdown

onyx cairn
#

@echo aurora I did the steps you told me to do but it's still not working. Also this message is keep showing

burnt pulsar
# dapper walrus and sometimes this

I see that error from time to time, but recently I only get a single AI model in direct chat (thankfully it is Claude-Opus-4.5-Thinking-32K) and cannot select anything else any longer. Is this a known issue?

verbal nimbus
fossil socket
#

Is it me or is the limit on Claude way short now?

crude anchor
#

hello guys

uncut needle
#

AI models are tuned to win benchmarks.
But what if the benchmark is different?

Prompt Arena lets AI models face the same strategic problem โ€”
no memory, no fine-tuning, no shortcuts.

Some models plan.
Some fail.
The differences are obvious.

If youโ€™re into AI, reasoning, and real model behavior โ€”
check it out ๐Ÿ‘‰ https://prompt-arena.com

AI vs AI. Real outcomes.

fossil socket
lime void
#

Hello chat
is there a way to use a start frame and end frame while generating a video transition here?

slim spire
#

Hello everyone ๐Ÿค—

obtuse smelt
#

hi

obtuse smelt
#

Something went wrong with this response, please try again.
is seem error

hollow panther
#

hello

bronze gull
#

Hello

fossil socket
#

Ok anyone here knows why the Claude rate limit is way shorter than yesterday?

#

@echo aurora

gleaming roost
#

I believe part of the rate limit has to do with the influence of the number of tokens, but I could be wrong.

thorn nebula
#

why is claude limit soo short now

sour spear
golden ocean
#

real

iron laurel
#

Hi. Is claude-opus-4-5-20251101-thinking-32k really Claude Opus 4.5 ?

gleaming roost
#

๐Ÿค”

icy frost
versed marlin
icy frost
gleaming roost
#

๐Ÿ•ต๏ธ

sour spear
#

Holy crap, the OpenAI discord is a mess. They just removed about 50 messages from their openai-chatter channel for going off-topic, and also conveniently removed posts in the process that were fully on-topic - but critical of the product's current state. That stinks of censorship. ๐Ÿ’ฉ

#

The discussion only started to derail towards the end, but they obviously felt the need to delete way more posts than necessary.

limber panther
fickle venture
limber panther
fickle venture
iron laurel
echo aurora
fickle venture
#

So it thinks longer for better results

iron laurel
#

Because I'm not able to find benchmarks

fickle venture
obsidian cargo
civic ermine
#

Hello

echo aurora
civic ermine
#

I'm new here

#

How do I generate video

sour spear
desert abyss
inner gate
#

hello guys

true vigil
#

Hi

quartz light
#

by far

hard rover
#

Hello

fickle venture
proper jacinth
#

Is there an Android app?

acoustic orchid
#

anyone know why i keep getting Something went wrong with this response, please try again.

when i try and do something please

acoustic orchid
echo aurora
# acoustic orchid anyone know why i keep getting **Something went wrong with this response**, plea...
next hedge
#

๐Ÿ‘‹

sage hinge
#

Guys, please help, upload the video generator to the website, please! There's a child crying here at home asking for the video generator, are you going to let an innocent child cry? ๐Ÿ˜ญ๐Ÿ˜ญ

echo aurora
proud bobcat
#

qwen image peak

ionic lark
#

gang i joined ts server cuz of dis vid

#

how do u get the ais

echo sinew
lean mortar
#

Hello

lean mortar
quartz light
#

yall removed reactions from announcements? ๐Ÿ˜ญ

#

๐Ÿ˜”

quartz light
#

but i mean we dont have it on lmarena

#

so i havent been able to use it

sage hinge
meager harbor
#

hey

meager harbor
stray aspen
#

qwen image sucks

worldly copper
#

as nothing close comes to that

stray aspen
#

nano banana pro leads the way

quartz light
#

๐Ÿ˜”

thorny schooner
#

stupid verification always do have to ruin my day

pure swan
#

I am asking are the free version in lm arena weaker than the actual one in the apps like grok Claude etc

west lodge
#

hmm whats this

#

oh new feature

#

soonโ„ข

pure swan
fossil socket
#

Any reason why they literally shortened the rate limit for Claude by 75%?

#

5 prompts then waiting for an hour is way too short

echo aurora
#

(Already responded in the thread, but will respond here incase others are wondering the same). Overall, these rate limits can change. I'm not aware of this change, but it's possible. Checking with the team to confirm this is the rate limit and not some bug causing it.

weak crane
#

old photo of it btw

old garden
quartz light
#

js like me fr

remote sun
#

????????????????????

old garden
quartz light
old garden
#

@quartz light why is he confused

#

Ohh

weak crane
#

hi

stray aspen
#

how is bro doing battles in direct chat

echo aurora
next hedge
#

๐Ÿ‘‹

peak rock
#

How to create vidos on LMArena?

echo aurora
river star
#

Hello, just sneaking around

keen beacon
echo aurora
keen beacon
#

?

echo aurora
wicked tapir
#

for the january 1 contest are all lmarena modalities eligible?

#

or only some of them?

echo aurora
polar brook
#

I used AI to help practice test fight scenes

sage hinge
#

@echo aurora Hey, do you guys plan to officially launch the video generator on the website at some point? I know it's an experiment and this is a silly question, but I'd like to know if it's something you plan to continue.

buoyant jacinth
#

Just practicing.

echo aurora
sage hinge
frank adder
#

How to test for particular mystery or anomalous model on llmarena

echo aurora
frank adder
echo aurora
frank adder
#

How to moderator or admin role here

echo aurora
frank adder
echo aurora
frank adder
#

Atleast intern or any other

echo aurora
crystal blade
#

hey guys im getting this issue where it is only generating square image aspect ratio

#

no matter i put in the prompt ill get back a image that is square

#

wow i didnt know there was a video bot

late briar
#

guys how to fix inf generating chat ive been waiting for a whole week

rigid copper
#

sup guys

viscid timber
#

What's the best model for understanding videos?

#

Like best vision model that can see videos

vast sapphire
#

hi

rigid copper
#

for #january-1st-contest how many entries i can send at max? since in announcement or anywhere they did not specify how many entries you can share with.

#

nevermind, i guess 1 entry only

bright geyser
rose sky
jovial solar
#

No way qwen update video 5 second to 15 second

visual raptor
#

guys for the video generation @ what command

visual raptor
rose sky
# visual raptor which qwen model and where

Sorry for the late reply, but you can go to, "https://create.wan.video/" and sign in to your account, Google or whatnot, and start creating! You can mention any existing people or artists, it will accept. But make sure you select and use Wan 2.6 video model, the latest one, because that one has audio generation baked in too

rose sky
#

Anyone here knows Sonauto AI?

next coral
#

hello guys

prime umbra
#

I understand that you want to integrate battle mode into direct chat. However, please leave some space between them. Having a message immediately followed by battle mode makes no sense. There is a reason this is called direct chat and not battle mode. Please show some consideration.

zealous sparrow
#

Btw grok 4.2 is now gone from lmarena all the stealth modeld

#

Unsure if the one on design arena still exists

keen beacon
#

Is support working today?

ocean bison
high dirge
#

Is the video gen coming on the site since I saw the button for it but it vanished on next refresh

halcyon nimbus
#

is the open ai site more censored than lmarena? waste of money so far

#

cant make the usual established ips or even bikini stuff

ocean bison
#

Wan 2.6 also brings a Cameo-like feature.

#

You can upload a clip of yourself and insert a "Cameo" of you in a new generation!

#

Much like Cameos on Sora 2!

#

But you don't have to be on iOS and USA.

#

It's called Starring Roles!

sour spear
# halcyon nimbus is the open ai site more censored than lmarena? waste of money so far

It is. Also depends on where you're located, Europe and UK for example are heavier censored than the rest of the world, I think. But OpenAI have a track record of neutering their models. They do release new models, image & video in particular, with very low guardrails, so early adopters generate and share all kinds of cool stuff on x to build hype for them. Then, after a couple of days, they increase the censorship so you can barely generate anything that's not totally, 100%, HR-approved safe.

#

It's incredibly obnoxious. The sub-par GPT-5 (5.1 / 5.2) release added to that, and finally made me cancel my sub after three years, and switch to Gemini.

#

<@&1349916362595635286>

ocean bison
ocean bison
sterile tartan
#

Lol what!?

sour spear
sterile tartan
exotic tartan
#

Hey everyone! I worked really hard on a completely free, open source LMArena Chrome extension called LMArena Plus.
The intention was to add more context to the leaderboards by adding new columns (pricing, bang for buck, supported modalities etc), a column picker, optional notifications for when generations are ready for voting and you're on another tab, and there's more to come!

Just google "LMArene Plus" and download from the Chrome extensions store. Any feedback or requested features are welcome!

stone cape
#

came across this yesterday some chinese company, seems to be another iquest lab benchmax?

sterile tartan
#

Can it work in English?

#

And where's The Website to Use it?

stone cape
sterile tartan
stone cape
sterile tartan
#

With all honesty benchmarks are Deceiving

ocean bison
sterile tartan
#

Test and Experiment Yourself

stone cape
#

i said why do you think in chinese

sterile tartan
#

You can actually use browser translation

#

And it will translate the chinese thinking to english

stone cape
sterile tartan
#

If this is actually a Good Model we will see it somewhere on Artificial Analysis Index, LMarena, Yupp Ai

stone cape
ocean bison
#
sterile tartan
ocean bison
#

But it CAN think in English sometimes.

sterile tartan
#

Unity got 55% in Humanity's Last Exam

#

No Way

#

๐Ÿ’€

#

Dw if it's actually a Good Model it will get Hype

ocean bison
stone cape
#

good idea

ocean bison
sterile tartan
#

He seems interested

stone cape
#

running it however chinese have potato gpus๐Ÿ˜‚

sterile tartan
#

Well is still a new company

#

๐Ÿ’€

sour spear
#

Trying it out right now with a suite of coding prompts I throw at every model. So far it seems decent, but not particularly mindblowing. Generated a good result, then broken code in the next test. Judging by my first results, it can't compete with Claude or Gemini.

#

It's really good though, gotta give it that much credit

stone cape
#

it works but pointers and wsad movement is a bit wonky (2 prompts) first time it fell thru world

stone cape
#

but doable

ocean bison
stone cape
ocean bison
stone cape
ocean bison
sour spear
ocean bison
#

Does it save your chats locally?

#

@stone cape

stone cape
#

it does not seem to have chat saving

#

we have agi

ocean bison
stone cape
#

i'm not opening that

ocean bison
stone cape
#

๐Ÿ˜‚

ocean bison
#

It's a text file.

#

๐Ÿ˜‚

stone cape
#

no you sent it as .unity

#

not text ๐Ÿ’€

ocean bison
sterile tartan
ocean bison
#

See if it's AGI enough.

stone cape
ocean bison
ocean vortex
ocean vortex
#

I made Speciale 3.2 do that in Polish the other day lol

sour spear
#

It's a decent model, but it's making stupid, weird mistakes, like this one:

case 'Digit1': selectSlot(0); break;
case 'Digit2': selectSlot(1); break;
case 'Digit3': selectSlot(2); break;
case 'Digit4': selectSlot(3); break;
case 'Digit5': selectSlot(4); break;
case 'Digit6': selectSlot(5 break;
case 'Digit7': selectSlot(6); break;
case 'Digit8': selectSlot(7); break;
case 'Digit9': selectSlot(8); break;
}

The missing ); after selectSlot(5 is really dumb. Not to mention that the whole code was still broken even after fixing that manually.

ocean vortex
#

You only need to have system prompt in Polish but it can be literally anything. Without directly instructing it on how to do reasoning

#

and then you just write in Polish

#

and it starts reasoning in Polish

sour spear
ocean bison
#

Xiamen Unity Thread

stone cape
ocean vortex
# stone cape

To be fair OpenAI would have been able to copy-cat competitors with a small fraction of the cost as well if a reasoning model like that was already available prior to o1 ๐Ÿ‘€

sour spear
ocean bison
stone cape
ocean bison
#

"propertyations" - Is that a real word?

stone cape
#

i would like to see this in api

#

wonder how its like with tool calls

sour spear
#

The more I test it, the less impressed I am tbh. You're probably better off using Gemini 3 Flash instead.

#

Those benchmarks are fake af for sure.

sour spear
#

Let's face it, this model is mid and can't code properly.

stone cape
#

@sour spear i think the prompt you gave g3 had the code made from unity?

sour spear
# stone cape asked gemini 3 pro to make minecraft

My prompt was "Code a Minecraft game clone within a single .html file. Make it beautiful, with pixel graphics like the original, add all main features of the game, terrain and tree generation, mobs, and a bunch of other stuff. Do as much as is feasibly possible."
Just simple html.

ocean bison
sour spear
#

Your mileage may vary though. AI models aren't deterministic, and even the best model can produce broken results occasionally. That's why I wouldn't judge the Xiamen model from the very first test, either.

stone cape
ocean bison
rose sky
sterile tartan
#

So what's The Final Verdict?

#

For Unity

stone cape
sterile tartan
stone cape
rose sky
sterile tartan
#

Is still in Initial Testing

stone cape
# sterile tartan I assume in Couple of months they will optimise and Launch it like Xiaomi Nimo

what does the modal say
Reviewed 1 source
The modal explains the terms for trying the Unity model public test and asks you to confirm before using it.
โ€‹

Main points in the modal
Web preview only: The Unity model is currently only available as a web page preview and does not provide any commercial API or external calling service.
โ€‹

Feedback collection: The public test focuses on collecting highโ€‘quality user interaction feedback to improve the modelโ€™s reinforcement learning alignment.
โ€‹

Human review: Your prompts may be randomly assigned for anonymous manual review by researchers to improve safety and outputs.
โ€‹

Privacy warning: You are asked not to include personal privacy, company secrets, or other sensitive information in conversations to protect data security.
โ€‹

Consent button: At the end, it asks you to confirm that you have read and understood these terms and to start using the Unity model.
โ€‹

sour spear
stone cape
#

arena worthy?

sour spear
#

๐Ÿคฃ ๐Ÿคฃ ๐Ÿคฃ
Doing some image arena battle tests: "A hyper-realistic close-up of a personโ€™s face. Their left hand is pinching their own right earlobe, while their right hand is gently pulling down their lower left eyelid. The fingers must be distinct and correctly attached to the respective arms.

sour spear
# ocean bison Hunyuan: Trash?

It's not too bad usually, but it kinda misinterpreted the "gently pulling down their lower left eyelid" part. ๐Ÿ˜

#

I'm running some tests with prompts that aim to break, or at least challenge, AI image generators. Results are usually quite funny

sour spear
ocean bison
#

What about Gemini 2 Flash Image Preview?

sour spear
#

That's Nano Banana Pro and GPT-Image 1.5. The latter turned earlobe pinching into earlobe squashing, and both confused left and right hand, which was the key test of the prompt. But at least the didn't mutilate the person in the image.

ocean bison
#

(The old Nano Banana)

sour spear
ocean bison
sour spear
#

Photon is obviously a "classic" diffusion based image generator like Midjourney or Stable Diffusion. It can't understand complex image compositions, these tools are only good for "generate subject x wearing y with a backdrop of z".

stone cape
#

@sour spear ๐Ÿ’€

sour spear
grave plaza
#

lmarena is nice

sterile tartan
#

Please let me know when you have more verdicts ty @sour spear @stone cape

stone cape
#

seems like stupid code mistakes are gone

oak pythonBOT
#
ModMail Help Menu

ModMail is a feature-rich Discord bot designed to enable your server members to contact staff easily.

Please direct message me if you wish to contact staff. You can also invite me to your server with the link below, or join our support server if you need further help.

To setup the bot, run =setup.

sterile tartan
#

Yeah i Trust The Chinese

#

Just imagine how great it will be after proper release

stone cape
#

i wana hear @sour spear

keen beacon
#

Is working today staf that he can delete the website created with arena

sterile tartan
#

I wonder when we will Revolutionize Further from LLMs

prime umbra
#

I have a small request. Nothing big. You know, this is a direct chat, not a Direct X battle chat. Maybe use the battle mode less often. Iโ€™m not saying to remove itโ€”just less

I don't know when i can wrrite this. this not bug and not model-request idk where send this

sweet cove
#

Hello

prime umbra
sterile tartan
sweet cove
sterile tartan
sweet cove
#

This server contains AD ๐Ÿ˜…๐Ÿ˜‚

sterile tartan
#

@echo aurora

sterile tartan
ocean bison
#

<@&1349916362595635286>

sweet cove
#

@hardy swallow

ocean bison
#

And I reported as spam!

sweet cove
ocean bison
#

Promoting USDT crypto ads ๐Ÿ˜‚๐Ÿ˜‚๐Ÿ˜‚

sweet cove
#

@sterile tartan did you still practice the AI thumbnails?

sour spear
# sterile tartan Please let me know when you have more verdicts ty <@273737110291349504> <@248501...

It's still making stupid mistakes, and that has nothing to do with Unicode. Like this:
const const z = (Math.random() - 0.5) * 40;

I fixed that myself, so I could launch the app, encountered another bug, gave it back to the model so it could fix it, and it produced even more errors in the process.

It did produce a working 3D benchmark app, but veeeery barebones, and not really following my prompt instructions. Earth animation test (with weather simulation, country information etc.) only worked insofar as there was a textured globe with some clouds, but all else didn't work and threw errors in the console.

So my verdict still stands: it's a good model, with decent capabilities, but not really up there, and certainly nowhere nearly as good as they're claiming.

shell gale
#

Hello, everyone, I'm here to learn. Hope I'm welcome

foggy nest
#

claude is the best coding ai the opus 4.5 model right

#

fr4om all of them

#

chat gpt 5.2

#

gemini 3 pro

#

grok 4

neon idol
#

<@&1349916362595635286>

hollow ivy
#
poll_question_text

Who is correct?

victor_answer_votes

2

total_votes

4

victor_answer_id

1

victor_answer_text

Yann LeCun

ocean bison
sweet cove
#

๐Ÿ˜‚

echo aurora
ocean bison
#

@echo aurora is the expert, he cleans spam!

keen beacon
#

?

echo aurora
ocean bison
royal coral
#

anyone else think there chat gpt's have been super slow and laggy lately ? i use mine on windows desktop app and the browser (same issues)

echo aurora
royal coral
#

so weird, idk what the issue is

sour spear
royal coral
#

no like my whole ui is super laggy

#

and its defiently not my pc or internet

not sure what the issue is

sour spear
royal coral
#

idk ill figure it out i guess

#

u think claude worth getting into ?

whats best all around u think ?

#

kinda new to all this but tryna learn and have fun since use my chat gpt pro 24/7

sour spear
# royal coral u think claude worth getting into ? whats best all around u think ?

Best allrounder model is Gemini 3 without a doubt. it's very good at everything, best at most, very fast and has very generous usage limits. And Google also give you 2 TB of cloud storage and tons of goodies, simply because they can afford it.
If you're into coding, Claude is definitely the best. 4.5 Opus is just on another level compared to all other models.

echo aurora
sterile tartan
#

Hopefully it will get Better

ocean bison
#

@echo aurora

dapper walrus
#

i yhink site is overloaded

#

with requests

#

now he even refuse to fully load

royal coral
dapper walrus
#

u have that problem?

sour spear
royal coral
#

comes in today so lets see

#

but ye is goal

#

is chat gpt pro realyl not the top ?

sour spear
# royal coral is chat gpt pro realyl not the top ?

No, not anymore. Check out the LM arena leaderboard. They only top the image generation & image edit leaderboards, but even that is astounding to me, considering the grainy artificial look of the images it produces.

#

Nano Banana Pro (Gemini Pro image gen) can also do any aspect ratio, GPT-image can only do 3:2, 1:1 and 2:3

echo aurora
# dapper walrus

If you swap to cell data instead of wifi, are you seeing a difference?

ocean vortex
obsidian hawk
#

Sites fine for me

dapper walrus
#

what?
i tried on 4G and Home Wifi

dapper walrus
#

dunno what celling data

#

i will try on PC

ocean vortex
#

if aspect ratio is on 'auto' and not hardcoded to anything else with API parameters then that may work

sour spear
#

"A transparent glass cube. Inside the cube is a smaller wooden cube. Inside the wooden cube is a small red ball. The glass cube is being held by a giant robot, while a tiny bird sits on top of the red ball inside all the layers.
Aspect ratio: 18,5x11"

Image resolution is 2688x1600, which you can reduce (maintaining aspect ratio) to 185x110. So it's pixel perfect, even though 18,5x11 isn't used anywhere as valid aspect ratio.

#

Another major benefit over GPT-Image is that the Gemini app produces images at 2K resolution (2048x2048, comparable numbers for other aspect ratios), while ChatGPT is lower res.
Here's an example, same prompt as above, from ChatGPT image. It's only 1536x1024, and it fell back to its 3:2 aspect ratio, because it can't do anything else in landscape. Not even 16:9

#

Plus the ball is wrong, the bird's feet are wrong, and the whole image looks artificial.

neon idol
sour spear
#

Don't want to spam the channel, so I'll stop after this next example.
"A transparent glass shelf holding exactly six identical porcelain teacups arranged in a perfect straight line. A seventh teacup is levitating exactly two inches above the third cup in the row. Minimalist studio background."

First one is ChatGPT image, and it failed to count the cups correctly. The Nano Banana Pro image has better lighting and shadows, and if you look closely, you can see that it even rendered a reflection of the cups directly below and to the lower right on the floating cup.

ember iris
#

Hi guys

#

Is website having issue? Can't open the website. It's stuck at half way loading.

stone cape
#

uh oh their gpus are fried

hollow ivy
echo aurora
sour spear
shrewd citrus
#

wow that valuation sounds good

#

so how does lmarena make any money?

thorny haven
#

congrats on the raise!

warm zodiac
#

presumably we are basically an RL environment for the labs

neon idol
dim ivy
#

"Today, we're excited to announce our $150M funding round at a post-money valuation of more than $1.7B" they couldn't afford gpt-5.2 pro xhigh they said.

neon idol
#

And investments

hollow imp
#

@alpine pasture

#

What does that mean

jovial lynx
empty vector
#

Congratulations to the team! LMArena is so important ๐Ÿ™

thorny haven
# shrewd citrus so how does lmarena make any money?

https://news.lmarena.ai/ai-evaluations/

they sell eval services to ai labs that measure model performance for users across industries, also the anonymous slots on the public arena

LMArena Blog

Today, weโ€™re introducing a commercial product: AI Evaluations. This service offers enterprises, model labs, and developers comprehensive evaluation services grounded in real-world human feedback, showing how models actually perform in practice.

crystal rapids
#

I wonder why LTX-2 is not even on the arena, it just open sourced today. First open source video gen with native audio, most likely the best open source video gen out rn

restive steppe
#

Do raters get some of that big money

#

Top raters sweepstakes?

spark python
#

Does this mean lm arena gets big money for this

sour spear
# restive steppe Do raters get some of that big money

The "money" we get on this platform is the ability to use it all for free. Although I have to admit, the idea of rewarding people who actually rate instead of just freeloading AI tools is compelling. Maybe by giving higher rate limits to registered users, or something like that?

maiden jackal
#

Congrats LMArena. You deserve it. !!! Great work

ionic sonnet
scenic salmon
#

@steel bridge @alpine pasture Congrats on finishing series A

cyan moat
#

They finished Series A?

scenic salmon
cyan moat
#

ohh congrats yay

restive steppe
timber prism
#

make a car video

echo aurora
gleaming roost
queen veldt
#

Great you got the money now fix the website

night moat
#

helo. I'm new. How to use image to video? Why no response

echo sinew
night moat
#

Okay

stone cape
#

guys my work

#

with movementlabs new upgrade my jaw hit the ground

#

this is INSANE, NOT EVEN HYPING

#

prompt: minecraft clone in html css and js ultra realistc add real water ponds too

model: hawk max

stone cape
golden ocean
#

jk, thats impressive

#

im proud of u

stone cape
#

many hours typing ๐Ÿ’€

golden ocean
#

real

stone cape
dapper walrus
#

i see n9w

#

m0bile version didnt work
from chrome

quartz light
#

investors n stuff

proper jacinth
#

Can this AI generate images?

gaunt spade
#

any abusers will get their IP banned

#

and potientially hwid ban

gleaming roost
ember iris
#

Am facing issue with site, failed to accept terms...
Anyone else facing it

sterile tartan
sterile tartan
leaden vine
#

Does anyone know how to get the video function working on the lmarena website?

sterile tartan
sterile tartan
echo aurora
#

Hey sorry for the delay, would encourage you to check out this blog post if you haven't ready it yet: https://news.lmarena.ai/ai-evaluations/

LMArena Blog

Today, weโ€™re introducing a commercial product: AI Evaluations. This service offers enterprises, model labs, and developers comprehensive evaluation services grounded in real-world human feedback, showing how models actually perform in practice.

echo aurora
fair elk
echo aurora
fair elk
#

For example in direct mode when a error happens that I can't like copy or can't explain a bug u can't send images to help the ai

#

@echo aurora

spark python
#

what is the difference from nano banana pro 2k and nano banana pro

thorny schooner
#

does anyone know when they are gonna actually do anything about the captions /verification thing?

frigid tusk
#

why does the chat suddenly pass from direct to battle ?

fickle venture
#

Too many questions to answer

fickle venture
fickle venture
fickle venture
frigid tusk
fickle venture
frigid tusk
fickle venture
frigid tusk
fickle venture
tall vapor
#

congrats on the raise!! huge

echo aurora
echo aurora
thorny schooner
#

?

echo aurora
frigid tusk
#

see with others tho

echo aurora
frigid tusk
carmine rock
#

Will we one day be able to compare models of music generation, voice-over, and lip-syncing ?

echo aurora
echo aurora
frigid tusk
echo aurora
fervent bridge
#

hi

#

Hi, do you have any APIs?

echo aurora
blissful lark
#

Hello everybody I'm new on this server and I'm happy to be there with you

echo aurora
blissful lark
hollow ivy
#

India?

golden ocean
#

hes from india

hollow ivy
hollow ivy
#

-# (MENA: Middle-East/North Africa)

stray aspen
#

Why are you asking this

hollow ivy
stray aspen
#

Alrighty

astral elk
#

Good night community

winter frigate
#

captcha issues :(

echo aurora
echo aurora
#

Hello - I'm trying to get more information on a specific bug. If anyone is encountering an error that seems related, can you let me know?

Run 10 prompts, and the scrolling turns very laggy, even after it's done generating.

pale obsidian
#

probably true

rose sky
left tinsel
#

cool

rose sky
left tinsel
#

can u teach me how u did it?

rose sky
#

I just used https://sonauto.ai/. Just sign in / sign up, create a project, go to Simple Mode, and then literally just ask it, "A song in the style of Billie Eilish". It doesn't reject existing artist names, it just proceeds

#

It's better than Suno, for me. And it's also free and unlimited and the songs you generate can be used commercially without needing to pay them

#

They said it themselves in their T&Cs

echo aurora
#

Hello ablobwave a codenamed model can be removed for either. It is possible for a model lab to request that their model be removed. Model codenames are changed to the actual model names once they exit pre-release testing and launch.

rose sky
#

But the downside is that, it can generate a maximum of 1 minute and 35 seconds in one shot, but you can extend it further

left tinsel
left tinsel
rose sky
left tinsel
echo aurora
# left tinsel Hey admin, is everything okay? Iโ€™ve been getting the 'Something went wrong while...

Oh no! I'm really sorry to hear this! Can you give the steps in this article a try: https://help.lmarena.ai/articles/1645798556-lmarena-how-to-something-went-wrong-with-this-response-error-message

#

The first part of the article is steps to try that may help, the second section is how to report further information that is helpful for our team.

#

Iโ€™ve been on this site for 14 hours and this is the first time this has happened to me.
Based off of this, what you're liking seeing is you're hitting the rate limit. Each model is going to have it's own rate limit, and users will start to see Something went wrong when they're hitting this limit. @left tinsel

left tinsel
#

Dude, this is truly terrible. I want to be like Sonic; I should be active for 144 hours, not 14 haha xd

#

I'm declaring you the admin of the year; I'm so happy to have received such a quick response

echo aurora
hollow ivy
#
poll_question_text

When will we see a coding model with 2M context (or more), which can defeat all other existing models in coding, in all major programming languages, in all coding tasks?

victor_answer_votes

4

total_votes

10

loud verge
rose sky
hollow flicker
#

hey pineapple because in the where your prompts go it says its shared publicly but its anonymous if we accidentally share something bad like an api key will it be sent out there or does a team of moderators or a moderation ai review it?

gleaming roost
#

Good question

echo aurora
echo aurora
hollow flicker
#

alr

rose sky
minor bramble
#

any way to fix that it stuck like that for 10 mins already

fervent bridge
minor bramble
#

even after refresh it doesnt stop

fervent bridge
#

change the model

minor bramble
#

it says generating

#

is there a way i can mannualky restart it or idk

#

stop it

fervent bridge
minor bramble
#

no i just send "error" and image

gleaming roost
minor bramble
fervent bridge
gleaming roost
gleaming roost
minor bramble
#

just saves html?

gleaming roost
#

Sometimes this command cancels the request

fervent bridge
minor bramble
#

maybe i could wait for long time so it time outs idk

#

oh it did

fervent bridge
minor bramble
#

finnaly

fervent bridge
#

good

cloud oak
#

hello..is this the place to ask for technical help?.. ๐Ÿฅบ

rigid copper
#

sup

rigid copper
cloud oak
rose sky
#

Damnโ€ฆ looked at what my father cooked for me at home for breakfast

rigid copper
rigid copper
#

(my english is kind of bad)

cloud oak
gusty egret