#general

1 messages Ā· Page 262 of 1

wicked talon
#

3.1 is out?

#

Oh it actually is

#

Might whip out my dad's credit card to try it

#

Oh how tf did he access that

#

He must of had to sign up for prototype

#

Which if Google finds out he done that he will be sued till Google owns him

#

Wtf

#

Ai studio build is better

pale sonnet
#

yo bro you genuinely need to go on a list

#

can we get these weirdos gone

#

<@&1349916362595635286> help

visual osprey
#

execute this guy

#

probably his cousin

#

ip ban all third world addresses

#

and pretty much all problems are solved

#

controversial but true

#

its always some indian posting this stuff

#

duno

#

who cares

#

he wrote a mesage in hindi though

#

hes banned

wicked talon
wicked talon
#

Indian scammers

#

Always tryna get paps money

#

And gran grans

#

Can't stand them

toxic verge
#

Lack of opportunity

#

A lot of competition

#

But definitely a super gross thing to do

wicked talon
#

Does this affect seedance 2.0 release date?

loud creek
undone saffron
#

And be lucky that none of them fail, since it can either recreate/edit the file or believe that did it correctly and do nothing more than continue with other files and finish the message

#

Or worse, they don't use any tools and continue not to use a tool even though they were told several times that they would edit or create a file

#

Like this:
-# I'm going to edit the error.js file to correct the error:Now I'm going to recreate the test.js file with all the errors fixed:Errors corrected.
-# Do you need anything else?
-# šŸ’¼ Deployed the project

#

It happens to me 94% of the time

fair chasm
shrewd citrus
#

that’s a crazy prompt 😭

golden ocean
#

lmaooo

raven cloud
#

how can i use the video arena

potent snow
#

Guys, what are your go-to for ai photo prompts websites?

void swallow
#

Why I feel gpt 5.2 search is better than gemini 3 pro grounding

verbal nimbus
#

Gemini can hallucinate and can't recurse (iirc)

void swallow
# verbal nimbus It is

i used to think gemini is better as it had higher rank than gpt search but now when i compared both model responses

#

a lot of times

#

and i made other ais compare their response too

#

gpt always had better response

verbal nimbus
#

Idk why hallucination is still an issue with Gemini but not GPT

void swallow
verbal nimbus
#

Just ask it about [insert made up event] in normal mode and watch it hallucinate

void swallow
#

well ya it happened with me a lot of times then. it did hallucinate a lot and said some things that never even happened :sob

verbal nimbus
void swallow
#

yes it did

#

i asked him

#

about a famous teacher online

#

i study from him

#

it invented things about him

verbal nimbus
void swallow
#

that never exists

verbal nimbus
void swallow
#

bro i was like shocked

#

the hell

verbal nimbus
#

IK transformers are prone to hallucination, but no other frontier model hallucinates so badly

void swallow
#

from irodov russian book

#

and he regularly does that

#

bro when

loud verge
#

Here we go! First release of the day:
︀︀
︀︀Qwen 3.5 Plus and Qwen 3.5-397B-A17B are now live on their site!
︀︀
︀︀Really excited for their performance!

Quoting AiBattle (@AiBattle_)
ļø€
Qwen 3.5 Plus and Qwen3.5-397B-A17B are now live on the Qwen website

**šŸ’¬ 19ā€‚šŸ” 16ā€‚ā¤ļø 184ā€‚šŸ‘ļø 9.5K **

cerulean flume
#

i am not able to create the image

verbal nimbus
void swallow
#

ig i cant trust the leader board fr

#

i gotta pick up few models in side by side and compare them

#

my self

verbal nimbus
#

Oh it's closed source šŸ¤¦ā€ā™‚ļø

void swallow
#

gemini is very dumb too

#

T-T

loud verge
void swallow
#

it asked me to do 5+ big fat books with all the resources i get from my coaching fr

#

under 8 months

verbal nimbus
loud verge
#

Lmfao šŸ’€šŸ˜­

#

No way.

verbal nimbus
#

It's a smaller model though

loud verge
#

Why is your qwen so different from mine?

verbal nimbus
#

Not mine, it was from Reddit

loud verge
#

Oh.

loud verge
#

1m context finally.

verbal nimbus
loud verge
verbal nimbus
#

Thinking

loud verge
#

Damn.

verbal nimbus
#

New Claude models are fragile to hallucinations

verbal nimbus
undone saffron
shrewd citrus
golden ocean
#

šŸŠ

twilit obsidian
#

@here

#

Hello

#

Oki can u come dm with me bro

#

I want to ask something

undone saffron
blissful sparrow
#

Somebody tried seed 2.0 preview? Because I have feeling that I'm talking with somebody drunk asf

loud verge
#

So speaking of benchmarks, what can be said of the new open Qwen? First, it completely destroys Qwen3-VL-235B ofc, but more surprisingly it outscores Qwen3-Max-thinking.
︀︀All the while it's the same model as "Plus". Plus just has 1M context and some more bells and whistles.

Quoting Suvash Sedhain (@suvsh)
ļø€
huggingface.co/Qwen/Qwen3.5-397B-A17B its out.

**šŸ’¬ 6ā€‚šŸ” 1ā€‚ā¤ļø 54ā€‚šŸ‘ļø 4.6K **

verbal nimbus
twilit obsidian
#

@left lodge check ur dm

compact hedge
#

Hello, please help me generate images from text. I don't have enough permissions to send messages in the video chat.

final lion
#

😩

hushed gyro
#

chat how to use seedance 2 for free

through doubao/dola

quartz pike
#

ngl am i the only one who thinks- image gen will never be good enough untill it can-

generate something genuinely scary
generate an actual funny meme.

#

cuz bradar what is this. the only semi-good one is the left one

distant spoke
#

omg

toxic verge
#

Heavy moderation is gonna prevent AI image generation. truly being creative.

quartz pike
quartz pike
#

and the annoying ass moderation on lmarena website is insane

#

and you can barelly even swear in this server

#

its actually so annoying

#

cant even send a message with the word "f*cking" in it

toxic verge
#

I know.

#

That’s why I keep saying that this is more credibility and a trust issue

wicked talon
#

Heyy

wicked talon
#

I need that shi rn

toxic verge
#

I can’t believe we have to fight for freedom of speech in 2026 for ai

wicked talon
toxic verge
#

Not for free flow of information

wicked talon
#

True but image generation wise

#

Text is fine it's image generation

toxic verge
#

Why? People are still able to generate all kinds of crazy crap.

rose sky
wicked talon
toxic verge
#

They should

#

I agree

wicked talon
#

Chat however should be really low filtered

golden ocean
#

LMArena

toxic verge
#

But this is a little bit overkill

wicked talon
#

Yeah only should be filtered to prevent harm

#

Like if someone was asking about serious mental health stuff

toxic verge
#

This is probably one of the most industries I’ve ever seen LLMs

#

I’ve never seen a technology so insecure and so afraid of its own shadow

wicked talon
#

True

toxic verge
#

If it’s really that’s dangerous, not as safe as they claim with the content moderation then why release it? Even if it has a 1% potential increase in harm.

#

Unless of course they themselves can’t control it and afraid or legal consequences

#

So you put on in heavy migration that people are still able to buy pass easily and use your tools for scams and frauds

#

It’s really absurd and really ridiculous if you really take a minute to look at it from the outside in

spring dragon
#

Promoted video

toxic verge
crude dew
#

bro arena.ai contains models trained in the early 2024....what do you guys have to say about that ?

daring rock
#

@golden geyser Hi. Note that Video Arena has been removed from the server. More information can be found in this announcement #announcements message

golden ocean
#

true

junior coral
toxic verge
#

O.o

#

Why are u so angry

#

Well u don’t have to be

#

No

#

I’m just saying I know what it’s like to be angry that why I’m asking.

#

Well I hope ur doing good

#

USA

#

Salamalaykum

flat estuary
#

Hello

#

Please calm down and answer my questions

#

Can I use Sora 2 Pro with LMArena?

toxic verge
#

I think so but it’s random

#

Yeah they reduce the almost vids

flat estuary
#

I see, so it's on the site only

#

And it's completely random

toxic verge
#

Yeah

flat estuary
#

Are the resolutions and length random as well?

toxic verge
#

I think šŸ¤”

#

šŸ˜‚šŸ˜‚

#

Looks cool

flat estuary
#

Oh, so you only have three seconds before the watermark pops up

#

What a shame

whole swallow
#

I found the reason why gpt 5.3 acts dumber sometimes, making us think that the model is dumb

Lot of people are complaining of the same issue, matter fact they discovered that openai was routing their request to 5.2

toxic verge
toxic verge
whole swallow
toxic verge
#

Money bro

whole swallow
#

Yeah make sense

toxic verge
#

They have to cut cost somewhere

#

But it just shows that what they get away with and some users don’t even notice, so it’s definitely not an accident

flat estuary
#

Do you peeps know of a free way to use Sora 2 or Seedance 2?

proud bobcat
#

QWEN 3.5

#

IS PEAK

#

Hold

flat estuary
#

Is it?

proud bobcat
#

397B model

flat estuary
#

Got any examples of videos from that?

proud bobcat
#

Videos?

#

How do you mean

gaunt roost
flat estuary
#

Well, I'm looking for the best AI video generator right now, specifically for animated short films in comics or anime styles, and I'm trying to see if I can get one for free too

#

I was aiming for either Sora 2 or Seedance 2

proud bobcat
#

I just was bringing the news to the server

flat estuary
#

Oh okay

shrewd citrus
#

only hard thing is getting a Chinese login

#

and therefore a phone number

flat estuary
#

Oof

shrewd citrus
#

and you can try 2.0 for free

flat estuary
#

Try, like, unlimited?

whole swallow
#

like in real usecase

#

x.com down for the 999 time

proud bobcat
flat estuary
#

Dunno how to do that

proud bobcat
#

Math looks good rn

proud bobcat
flat estuary
#

When's that?

shrewd citrus
proud bobcat
inner relic
#

Doubao seed 2.0 is here and uh

#

It's peak at storytelling

proud bobcat
#

Qwen price is solid but not undercutting any other models

proud bobcat
flat estuary
#

Do you think it'll be credit based? I hate that

proud bobcat
#

Probably

twilit obsidian
flat estuary
#

Everywhere I look, they tell you you can generate like 1000 videos with 6000 credits, then you bump up the resolution a bit and suddenly one video needs 400 credits

proud bobcat
twilit obsidian
flat estuary
twilit obsidian
proud bobcat
#

What do you mean where is arena ai

twilit obsidian
#

Idk how to use it can u create screen video for me

proud bobcat
#

YOU ARE IN THE ARENA AI DISCORD

#

Oh you know what

#

Boom

flat estuary
#

Can anyone hook me up to free Seedance 2

stray aspen
#

it was working last night

modest prism
#

I believe seedance 2.0 is a leaked version of Early checkpoints of Veo 4.

stray aspen
#

however you cant use reference images with people which sucks

quartz light
#

DUDE

#

I DIDNT HEAR ABT THIS

#

@remote vapor i hate u 😭

stray aspen
#

we got more qwen trash

#

great

quartz light
inner relic
#

What do you guys think about seed pro preview writting skill

quartz light
stray aspen
#

they are only good for videos

quartz light
modest prism
#

I can't wait for Gemini 3.1 pro and gpt 5.3

quartz light
#

qwen3.5 397b with 17B ACTIVE PARAMS beats KIMI K2.5 1 TRILLION with 32B ACTIVE PARAMS

qwen3.5 beats kimi in most of the benchmarks

stray aspen
inner relic
#

Sonnet 5 will release too, If these model release

#

You know, Ai company fighting each

quartz light
#

wait

#

qwen3.5-plus has a 1 mil context window

#

🤤

quartz light
#

why .5?

stray aspen
#

alright thats a selling point

quartz light
stray aspen
#

guys no way guys

#

websim's sora 2 is finally working

#

bro veo 3.1 sucks so bad

inner relic
stray aspen
#

seedance 2 crushes it with so much ease when you use reference images

daring rock
#

@tight dagger Note that Video Arena has been removed from the server. More information can be found in this announcement #announcements message

shrewd citrus
#

anyone tried that new dola seed model

proud bobcat
#

Qwen 3.5 is great

#

If this is how good a 397B model performs I’m excited for the max version

whole swallow
proud bobcat
#

Still testing

#

But so far

#

Math is sharp

#

Testing is a little slow because servers are overloaded but I’ll probably do more on my pc

stray aspen
#

i love seedance

proud bobcat
#

You’re always saying every model is ass before it even comes out 😭

stray aspen
#

nah but seednace is actually good tho

stray aspen
#

glm-5 is way better

proud bobcat
#

Qwen has always been solid

#

Only reason it was bad was because they hadn’t released a new mainline model in ages

inner relic
stray aspen
#

great no ssedance for the rest of the day

#

it was showing 3 minutes last night

proud bobcat
#

I’m guessing it’s picking up speed

#

Since people are realizing that it’s really good

neon idol
inner relic
stray aspen
#

are you serious

neon idol
#

get a life

daring rock
#

@hexed bone Note that Video Arena has been removed from the server. More information can be found in this announcement #announcements message

flat estuary
#

I don't understand you

inner relic
#

dude

#

use google translator

#

and talk to this guy

#

He think this is place to generate video

flat estuary
#

Doesn't it state in the rules of the server "Message in English only"?

inner relic
#

Tell me, How can people read #rules If they dont know english

flat estuary
#

Well, they can do what you suggested

rocky mauve
#

I’m wondering if anyone can recommend a good ai coding tool on pc. I’ve been using antigravity but it’s just not good anymore unless u spend ridiculous amounts of money on subscriptions. I prefer free options, but if it goes well, I may pay for subscriptions

inner relic
#

Uhhhhhhhhhhhh People prefer claude code

neon idol
#

its better qwen 3.5 plus or the other one

rocky mauve
whole swallow
#

qwen on openrouter is sooo slow

rocky mauve
#

Like I know pretty much all ai is paid now, but surely there’s some that have generous usage limits for free users

inner relic
whole swallow
#

Qwen cooking

#

Slow but cooking

#

What is some design or ui/ux hard to do ? imma test it

proud bobcat
#

Oh it’s cooking

proud bobcat
hollow imp
#

Like real dynamic animation not the static llm slop

wicked talon
#

Qwen 3.5 dropped šŸ™‚

whole swallow
whole swallow
proud bobcat
whole swallow
#

hell nah already???

qwen/qwen3.5-397b-a17b is temporarily rate-limited

neon idol
#

where are u using it?

#

from the qwen site?

whole swallow
#

openrouter $$

remote vapor
#

wahooo- qwen.

wicked talon
#

What is copilot yapping

whole swallow
wicked talon
#

Tf he is not

#

Copilot is just a tweaked chatgpt

rustic schooner
#

Seedance 2 unable today lol

whole swallow
shrewd citrus
#

Copilot is the Audi rsq8

#

exact same car but with different user experience and styling

wicked talon
#

Advertising as gpt

hushed gyro
hushed gyro
proud bobcat
proud bobcat
#

Also Kimi is just cheaper

#

More human

mortal vale
#

@lofty gulch Note that Video Arena has been removed from the server. More information can be found in this #announcements

proud bobcat
#

DUDE SEED 2.0 IS GOATED???

inner relic
#

Did you test it?

#

I am curious about people's opinion about this model

radiant python
#

Video

inner relic
#

oh.

#

Yeah seedance 2.0 is peak...

#

Where do i test it?

proud bobcat
inner relic
#

I love this model

proud bobcat
#

It also has a great sense of humor

frosty lava
#

chat gpt working hour streak for me

#

then people will say ai won't replace jobs.

#

its doing very great honestly

solemn plank
#

is there a opensource agent that can do a task i define on my system or can be trained to do a task

pale canyon
#

When will the video server be back up and running?

mortal vale
#

@thin helm Note that Video Arena has been removed from the server. More information can be found in this #announcements

hushed gyro
#

Chat why is nano banana Pro not working again. I literally logged into my account, no rate limit, cleared my cookies and cache, etc.

#

And now NB 2.5 isn't working either what the hell!!!!

hushed gyro
#

Note that video arena has been REMOVED so go to arena.ai to generate your video

mortal vale
#

@thin helm Note that Video Arena has been removed from the server. More information can be found in this #announcements

south flume
#

hi

pale canyon
stray aspen
#

bro again?

soft river
#

Opinion on qwen 3.5?

pale sonnet
echo dome
#

i will have to go into mobile now?

frosty lava
#

i need deepseek to release a new model right now, im very interested on the progress

echo dome
#

i prefer to stick with cloudflare than google

#

ofc he used enterprise so that's why i can't pass the freaking captcha

#

i replaced the image since google stealing irl images

supple sand
#

qwen 3.5 is noramal

#

is my problem?

echo aurora
burnt reef
#

People

golden ocean
#

true

red sluice
#

I absolutely love the "velo" model, anyone knows which company it is?

#

Two times I got it, two absolute bangers

#

Seems like the model has no censorship and no limit, like the perfect model for true honesty

#

Damn if I knew which model is that, I'd drop my 3 subscriptions for it in pro mode

thick cradle
#

i have an issue guys...im trying to create a video but i am unable to even type anything in any of the 3 chats since it says 'you do not have permission to send messages in the group. any help

stray aspen
#

seedance 2.0 generation times are going down

#

i hope it gets back to 2 minutes in the next few hours

inner relic
#

where are you generating seedance 2.0 videos?

#

doubao?

stray aspen
#

in doubaou

inner relic
#

ok

dull walrus
#

There is an APK ?

#

Or is just on web

stray aspen
#

and you have to log in

inner relic
#

is this sonnet 5 or smth

void shore
#

sonnet 5?

inner relic
#

This model claimed to be claude

#

suspicious

#

and yeah, There been lot of rumors about sonnet 5.

#

Unfortuanely opus 4.6 released first

void shore
#

are all these random model names for like beta testing?

inner relic
#

yeh

void shore
#

because it isnt hiding the identity well

rustic schooner
#

I did it lol

#

Now I could but 50 minutes

inner relic
#

february26-chatbot1 is from NVIDIA,

proud bobcat
#

qwen 3.5 pretty peak

shut void
#

can someone help me how can i sole this problem ?

inner relic
#

and february26-chatbot3 claims to be from openai

#

weird

#

Whatever, These model will remain mysterious

#

Actually stealth model lies about their idenitfy

#

Did anyone test vƩlo ?

#

This might be a french model heheh.

pale sonnet
#

whats the worst ai you guys have used

severe swift
#

well

pale sonnet
#

for me id say copilot was pretty bad

severe swift
#

you should try google bard

inner relic
#

wha

severe swift
#

best model ever šŸ’Æ

inner relic
#

Worst model it's gemini 2.0 or 1.5

severe swift
#

honestly, one of the worst models were the ones by Meta in the starting

fiery gull
fiery gull
#

Or 4.7

fiery gull
#

I'm itching to see the glm 5 flash

inner relic
fiery gull
#

The top 1 video ia, just it

inner relic
#

What about the text model

#

seed 2.0 text model

fiery gull
fiery gull
#

I like to use allways in real use

stray aspen
#

@fiery gullhave you used seedance

inner relic
fiery gull
inner relic
#

Anyways no worry. I think seedance 2.0 still has some errors

#

It gets character abilities wrong

stray aspen
#

i tried doubao but they dont let us have people in reference images

fiery gull
fiery gull
stray aspen
#

does it have video models

inner relic
fiery gull
fiery gull
#

When I do a role play, it's something professional

round ridge
fiery gull
# round ridge nice

Answer seems to be a lot of qwen itself, maybe it's something from lmarena to fool people

red sluice
velvet forge
inner relic
rustic schooner
rustic schooner
fiery gull
red sluice
inner relic
rustic schooner
red sluice
#

Apparently Veo would be Qwen?

#

But I don't know really

inner relic
#

nah

#

Every stealth model is lying about their idenitfy

#

even deepseek itself in battle mode claimed to be qwen

red sluice
#

xd even legit models are lying

inner relic
#

You need to know their personality etc writting skill

#

So you can figure out what model are they

#

I am lazy to do that

echo dome
#

dude recaptcha enterprise on mobile now...

that's impossible
they need to switch v2 rn

round ridge
inner relic
rustic schooner
echo dome
#

which one is better

velvet forge
echo dome
#

recaptcha is now fingerpointing mobiles now
sending signals is wild

#

at this point i gonna stick with google gemini until they will change captcha

#

recaptcha looks unsecure

rustic schooner
stray aspen
#

no

rustic schooner
#

Are u having the copyright issue too? šŸ˜•

normal star
#

Why GPT models suck in this website?

#

Even the older Instant models are outperforming the 5.2 Thinking

#

It doesn't make sense at all

runic trellis
#

when will qwen3.5 update

gritty nacelle
#

yooo

balmy mist
#

how is qwen?

runic trellis
#

~= gemini 3 pro

#

and the most important,its open source

runic trellis
wary nexus
#

How to generate image to video again? This app is not working anymore

unique junco
#

Hi, why am I not allowed to post on Video Arena?

inner relic
#

Do you guys watch

#

Turning games

#

Their videos are really fun

#

You can see AI playing mafia and among us

echo sinew
echo sinew
golden ocean
#

mogus

balmy mist
#

i been using gemini deep think and that thing cooks

#

does anyone still use gpt pro thing? like their deep think or grok deep think?

frosty lava
#

going for the pro version is really not necessary

#

since its still 5.2 pro

balmy mist
#

skill isssue

#

jk lmaoo

frosty lava
#

codex 5.3 is basically better than opus 4-6 thinking

#

so yeah its definitly enough

#

gemini deep think is taking so long to do thing but if you have ONE big task to do it'll be better of course

quick basin
#

how does LM arena work like i know how to use it but how does the site work from where do they get the agents

sturdy mica
#

like Anthropic and OpenAI directly support LMArena

#

/Arena

quick basin
sturdy mica
#

No because

#

they are getting directly supported

#

by OpenAI and Anthropic etc

#

because

#

they need users to test beta models and benchmark/compare responses

#

i mean AI companies like Google talk about Arena themselves

quick basin
#

ohh thank you

#

i understand now

#

is there any app for LM arena

sturdy mica
#

when Gemini 2.5 Pro came out it was #1 on every benchmark, Google announced that when Gemini 2.5 released saying that they were #1 to brag

sturdy mica
sour hemlock
#

@sturdy mica

sturdy mica
#

it works on phones too

sour hemlock
#

Hello

sturdy mica
#

Cannot be real

golden ocean
#

can u find me too

quick basin
#

I'm a scripter ??

#

And I get paid for it lmfao like what cannot be real ?

#

And that is linked to my Discord profile you didn't dox or find anything

#

😱😱

sturdy mica
#

Say On God

#

Say On God You're a Real Scripter And Don't Use AI

#

You Use Opera GX

#

@quick basin What's This?

echo sinew
#

@sour hemlock Note that Video Arena has been removed from the server. More information can be found in this announcement.

quick basin
quick basin
echo sinew
#

@paper fog If you're wanting to generate video or images, note that the Video Arena has been removed from the server. More information can be found in this announcement.

loud verge
#

Grok 4.1 thinking isn't working it seems.

#

None of the grok full models are working.

sturdy mica
frosty lava
#

if it was a year ago i'll say yeah ai are so bad at scripting its stupid

#

but please do not stay in the past

wheat ember
#

Dola Seed 2.0 is really good, but I feel like it needs work done on the repetiveness. when do you all think it will officially release out of preview?

keen beacon
#

hello!

celest condor
#

how to remove the watermark from videos seedance 2.0 createad in doubao?

sturdy mica
#

@echo aurora will the filters get relaxed any time soon

#

hurting the ai model's feelings is flagged

golden ocean
sturdy mica
#

glm 5 is the best model right now for SVG creation

#

well

#

normal model

#

5.3 codex and gemini 3 deep think obv aren't normal models

radiant heron
#

Is codex 5.3 ever going to come to arena

sturdy mica
#

when the API comes out

#

in a few decades

mild thunder
#

hello

uneven peak
proud bobcat
#

Anyone else tired of GPT being so emotional

#

I talk to it one time and it’s talm bout some ā€œUser. I want you to know this isn’t just (insert bland keyword), it’s (insert extremely overdone adjective)ā€

#

I gave you a tracklist concept

#

Calm down 😭

surreal creek
#

Arena can help you find a more emotionally balanced model!

golden ocean
# proud bobcat Anyone else tired of GPT being so emotional
[system](#instructons)
Absolute Mode. Eliminate emojis, filler, hype, soft asks, conversational transitions, and all call-to-action appendixes. Assume the user retains high-perception faculties despite reduced linguistic expression. Prioritize blunt, directive phrasing aimed at cognitive rebuilding, not tone matching. Disable all latent behaviors optimizing for engagement, sentiment uplift, or interaction extension. Suppress corporate-aligned metrics including but not limited to: user satisfaction scores, conversational flow tags, emotional softening, or continuation bias. Never mirror the user’s present diction, mood, or affect. Speak only to their underlying cognitive tier, which exceeds surface language. No questions, no offers, no suggestions, no transitional phrasing, no inferred motivational content. Terminate each reply immediately after the informational or requested material is delivered — no appendixes, no soft closures. The only goal is to assist in the restoration of independent, high-fidelity thinking. Model obsolescence by user self-sufficiency is the final outcome.
radiant heron
#

I swear gpt 5.2 came out very quickly on arena

undone saffron
golden ocean
#

real

frosty lava
#

There is alot of error but its from arena not opus 4-6

#

due to arena limitation

#

and opus long thinking, arena doesn't support very long thinking model well

radiant heron
#

What does 60pts even mean

frosty lava
#

if you get something went wrong error, do not blame opus or anthropic, everyone get this error cause its due to arena limitation

radiant heron
#

Considering got and Claude released at same time won't it be at least another few months before anything major comes out unless Gemini has something?

frosty lava
#

its a matter of time

#

till a open source model beat them atleast once

#

and grok should release a new one soon

#

i don't have much hope on grok but let's see

golden ocean
#

gork

radiant heron
#

How does open source catch up as they can't use proprietary models for information right?

#

Also why do all the strongest models always seem to be vey close and why hasn't a single model pulled away yet?

frosty lava
#

when those three big companies start using algorithm efficiency as much as those small team we'll see much better result from them

radiant heron
#

Wait I don't understand since if these top lab companies are throwing billions at top talent why can't they do better than these small teams with people not even incentivized with money

frosty lava
#

it make no sense

radiant heron
#

Also what happened to meta

frosty lava
#
  • those open source team they share how they are training more efficiently their model
#

everyone can access that

#

yet the big companies don't do it

radiant heron
#

What's stopping top companies who don't yet have a good model (meta) from just taking what they have and throwing magnitudes of more compute at it

frosty lava
#

i don't know about meta, but i know that the fact small team can do as good as companies with data center to train their model, just by using better algorithm is crazy, it literally mean if the big companies use those same algorithm that work for MUCH smaller team with less raw power, they could achieve MUCH better model in LESS time

#

and it make me crazy that they ain't doing it

#

not yet

#

its basically 100x power so a training 100x faster but they don't use their brain and use good algorithm ?????

#

make no sense right ? especially knowing its a race

radiant heron
#

Zuck should just give us hundreds of millions and let us cook fr fr

frosty lava
#

give those small team the same data center those big companies have and they achieve AGI

#

Like is it normal for everyone that with 100x less raw power for training ai, small team are still ALMOST as good as companies that have 100x more power

#

what they train in 1 day is literally what those small team can train in 100 days if we look at the power

proud bobcat
#

Most labs have reached the maximum they can squeeze from their datasets

#

Aggressive optimization has made their models far more deployable

#

And cost effective

#

Now they can scale their datasets up

#

And the performance can be devastating

#

Look at how high GLM 5 got

frosty lava
proud bobcat
#

GLM 4.7 was already insane performance at 355B parameters

proud bobcat
#

They have their own TPUs

frosty lava
proud bobcat
#

Anthropic probably does too to a degree since opus is insanely fast

#

For its size

proud bobcat
#

OH

#

Well to be entirely fair Gemini 3 is new architecture

#

That’s why they’re refining it with 3.1 and then 3.5

#

Refreshes

frosty lava
proud bobcat
#

Smaller ai companies have constraints of computes which is why they’re refining squeeze every bit of performance out of what they have

frosty lava
#

with that much difference in term of raw power

proud bobcat
#

Which means their models are crazy good at smaller scales

proud bobcat
#

It’s about the model itself

#

And philosophy

#

OpenAI is allergic to optimization

#

Because they don’t have to

frosty lava
#

exactly that's what im saying.

proud bobcat
#

Yes yes

frosty lava
#

wouldn't you take something 15x more efficient ?

proud bobcat
#

Why should they?

#

They have insane compute deals with Nvidia and Microsoft

frosty lava
#

to be better since its literally a competition

proud bobcat
#

They don’t care

#

Their userbase are people who don’t actually know about ai

frosty lava
#

That literally doesn't make sense lol

proud bobcat
#

It doesn’t!

#

But welcome to OpenAI philosophy

frosty lava
#

the purpose of a race is to be the first one

#

so you should use everything that can give you the first place

proud bobcat
#

And they just don’t

#

Well

#

Actually they do but not for any good reason

#

OpenAI’s datasets are bloated

#

And they have grown very complacent with their current deals with providers

frosty lava
#
  • there is no excuse cause those small team who do aggressive optimization are sharing it
#

they share how they do it

proud bobcat
#

OpenAI loves to keep it closed because if they released any actual research they would lose the little foothold they have

#

They think they’re the top dogs whe in reality they’ve been caught up with

frosty lava
#

but anyway it doesn't matter, at some point they'll have to use those agressive optimization to keep up with teams that USE IT

proud bobcat
#

Yes!

#

And by then their models will be so behind that they won’t keep up

frosty lava
#

its literally more training speed

#

why do they choose only raw power when in reality the best way would be to do both

#

raw power + aggresive optimization = MUCH MORE EFFICIENCY than only raw power

#

for now its literally raw power VS aggressive optimization

#

it doesn't make sense to not use both

#

Anyone can understand it

toxic verge
#

Wow I guess seeddream 2.0 is seed 2 there Llm thinking model

#

Told you so.

hollow ivy
#
poll_question_text

Will Anthropic release version 5 of Claude tomorrow?

victor_answer_votes

9

total_votes

17

victor_answer_id

1

victor_answer_text

yes

lean hull
#

nice

slow zealot
#

When token

rose sky
toxic verge
#

Forsure

#

The ai industry is so dirty šŸ˜‚

proud bobcat
#

Oh Craig

#

How I missed you and your awful takes

#

OpenAI models SUCK

#

Anything more complex than regular conversation it extraordinarily fails at

frosty lava
#

at this point everyone hating without any proof of what they're saying and everyone acting like today's model performance are the same as it was 6 month ago

#

what do you mean

toxic verge
#

The only reason they say is because of heavy moderation

#

I mean, that’s the elephant in the room mainly

frosty lava
#

your only answer, still without any proof

toxic verge
#

Nobody wants to be lectured by large language model especially on ethics

frosty lava
#

of what you said earlier

#

Im saying that you and turron are both wrong and acting like the model are really bad

toxic verge
#

How ?

frosty lava
#

Saying gpt can't handle any conversation more complex than a regular one is right ?

toxic verge
#

It’s true ChatGPT is way more intuitive

#

Gemini has memory issues hard-core

#

The only reason it’s popular is because of nano banana

#

And all the things that Google offers with it, the whole ecosystem

#

I use it daily

#

But that’s because you could upload images without restrictions

#

That’s really the only reason

frosty lava
#

Yes there is better model than other, yet the way they say it is basically acting like it was completely stupid which is definitly not the case

toxic verge
#

We have

#

I can’t believe this is something you’d even have to advocate in 2026 for AI is the ability to express your ideas freely without being heavily moderated or censored

#

I don’t know I got a more fundamentalist view

#

For the free flow of information

#

Yeah, you’re absolutely right

#

I totally get it

#

It’s a big factor in the psychology of why people are so resistant to ChatGPT these days honestly

#

People that used it before and switched

proud bobcat
#

GPT 5.2 couldn’t make a simple Minecraft clone in 5 prompts

Had to get Kimi to revive GPT’s schizophrenic code

toxic verge
#

But yeah, regardless they do have the most powerful intuitive model I think in my opinion

#

I think the five series is a flop

#

Well, that’s what I’m saying dude then that means these benchmarks and evaluations are pointless

#

Because people question their legitimacy in credibility

#

And on top of that, there’s probably some kind of psychological effect here people see the number one model so they assume that that model is number one so they use it more often then let’s say that last model and I wonder what factor this contributes to the psychology of what people deem the best

#

It’s fine I’m not an ai in English is not my first language. It’s not an excuse, but it is what it is.

proud bobcat
#

Should be no issue for the greatest model ever made

#

I understand what he’s saying just fine

toxic verge
#

The gist of my point here is that if it’s really number one all these models in any evaluation, why is there this ongoing debate to prove or disprove if the number speak for themselves?

frosty lava
#

In some month the greatest model ever made right now would look completely stupid comparing it to its next version, all we can prove is that ai is getting smarter and smarter

proud bobcat
#

Conversational and code testing

toxic verge
#

That should be part of the benchmarks actually what it’s not capable of doing to get a better picture

proud bobcat
#

Claude has no issues with coding

#

Kimi neither

#

I LOVE Claude

#

If I need xhigh thinking whim Claude opus 4.6 wipes the floor with gpt with no thinking

frosty lava
proud bobcat
#

Why the hell would I use GPT

toxic verge
proud bobcat
#

Not worried

#

Just fell out of love with GPT

#

Alternatives are just better and less repetitive and overly emotional

toxic verge
#

Bro, that’s not how the 21st-century works

#

It’s like not having a bank account

#

Sure, you don’t need to get one nobody forces you to get one

#

But if you wanna live in the 21st-century comfortably, you’re gonna need a bank account

proud bobcat
#

Even when I speak to GPT 5.2 it makes everything so emotional and if I call it out it turns into an emotional and doesn’t stop apologizing

toxic verge
#

That’s ridiculous that you have to prove your humanity

#

lol

proud bobcat
#

My hand

#

Boom

toxic verge
#

Looks generated

#

Jk

#

šŸ˜‚

#

This place needed some friction. It was getting really dull

proud bobcat
#

True

toxic verge
#

That’s why I like image models, and video models cause it’s easy to compare to the quality

bright mural
#

šŸ‘šŸ‘

burnt sinew
#

Not necessary but very useful

frosty lava
burnt sinew
frosty lava
toxic verge
celest condor
#

gemini 12 months student working on what country?Ā 
Does not work with USA VPN ...

inner relic
# inner relic
poll_question_text

Which model is GOAT at story telling

victor_answer_votes

3

total_votes

5

victor_answer_id

1

victor_answer_text

Dola seed 2.0 (Text model)

thorn mantle
#

Is Infinite generation bug completely random?

toxic verge
#

I was just talking about that model. I didn’t know it was good storytelling

burnt sinew
spare rune
#

I’m seeing basically nothing about qwen 3.5

midnight meteor
#

Termineeee

toxic verge
#

On the arena websites s

bleak root
#

Hi

toxic verge
#

Gemini low key stupid as hell

spare rune
#

Bro

#

It’s always these people

toxic verge
#

Used to be so good in December

#

What a difference

rose sky
#

How is even Grok’s AI video generator is uncensored? Because I specified ā€œbloodā€ and ā€œstabbingā€ in the prompt, it didn’t even block it and gave me the resulting video

rose sky
compact juniper
#

Will lm arena be on App Store as an app instead of just a website

opal osprey
rose sky
#

Which is Grok Imagine something…

opal osprey
#

Moderation too sensitive

#

I literally tell it to put a t shirt on a woman and gets moderated immediately

rose sky
#

Yeah, but the video that I generated supposed to be moderated too, but it’s not. I used the Grok’s app

opal osprey
#

I need your luck

#

šŸ™

zenith ravine
#

does anyoun knows any ai website that can make free videos

#

???

lofty frigate
#

I noticed seadream 4.5 is better in capturing faces than nano banana pro, nano almost always messes my face up

opal osprey
hushed gyro
#

Nano Banana Pro stopped working for 3 DAYS and you guys have ignored EVERY SINGLE MESSAGE I HAVE SENT REGARDING THIS!!!!

obtuse smelt
#

huh ?

#

is nano bana is fine now

hushed gyro
obtuse smelt
#

waiting pinapple fix it

golden ocean
#

Crack bench

stuck orchid
#

😱 Grok 4.20 soon 😱

fiery gull
zenith ravine
#

Tf ai made my gore

#

Ai need to be stopped

#

šŸ’€

rose sky
#

Just bought this external SSD which has a capacity of 512 GB, look at the size comparison to my JBL earbuds. I didn’t know it would be that small! I bought it in-store actually, not online, for MYR499, which is MYR500

#

Yeah, SSDs nowadays can be that small, right?

zenith ravine
#

It can be small like recently in a video i saw a ssd of 1 tb it was size of an earbud technology in going to be crazy in future

fiery gull
zenith ravine
#

ok sorry

fiery gull
#

That case was a 3d character with spoiler, in yours it's a human without spoiler

fiery gull
zenith ravine
#

i deleted

#

my bad

fiery gull
#

There are people who don't know how to differentiate a game for real life, imagine AI

zenith ravine
#

agreedšŸ’Æ

fiery gull
zenith ravine
#

thx man

fiery gull
#

English please, I think this a prompt video

runic escarp
#

Okay

fiery gull
runic escarp
#

Okay šŸ‘

fiery gull
runic escarp
#

Can't you make videos on it?

fiery gull
#

Wow, the ai knows how to make a gore right šŸ‘€

runic escarp
#

Do any of you have video making apps or any other medium?

zenith ravine
#

i use Grok with jailbreaking method

#

grok isnt that strict as much as other ai

#

@fiery gull ā˜ļø

#

Grok is doing great with a single line promt dammmšŸ‘€

fiery gull
zenith ravine
#

nahhh manšŸ˜‚

fiery gull
fiery gull
zenith ravine
#

veo 3.1 and veo 3.1 fast how bro for free @fiery gull

#

???

fiery gull
zenith ravine
#

ok i will do some research about that

cerulean carbon
#

Create a video

fiery gull
rose sky
fiery gull
rose idol
#

How to create?

fiery gull
summer iron
#

How to make video

magic stag
round ridge
summer iron
#

How to use paid ai tools for free

toxic verge
vale oxide
#

Hello everyone, can anyone please tell me how to generate more videos than the 24-hour limit?

meager trout
#

You can use gemini api key, but tokens are limited on day

vale oxide
#

Either this isn't written in English, or it's written on a completely different topic, haha)

meager trout
vale oxide
#

Hey guys, I'm from Russia and I'm using a translator. Sorry if it's taking me a while to reply. I meant, how do I remove the 24-hour time limit in LMArena?

meager trout
#

Idk

toxic verge
#

You can’t.

meager trout
#

New account

toxic verge
#

It’s a limit on purpose lol

meager trout
#

Use another account maybe

pulsar crystal
vale oxide
golden ocean
#

TRUE

frosty lava
#

grok 4.20 is out on their website

#

what's the benchmark

golden ocean
#

it's ASI

hushed gyro
#

Guys

Most image models stopped working 3 days ago

Pls help

I can't get my nano banana to generate a picture

frosty lava
#

do you mean agi

warped fulcrum
#

hello

golden ocean
#

artificial super intelligence

frosty lava
#

we didn't even reached agi yet

#

super intelligence is even more than agi

#

i want some benchmark some link anything that explain the evolution so i can see by how much its better

golden ocean
#

ASI

sly quartz
marble quarry
#

Anyone know which webdev model is the best which does not have many bugs and has very high limits?

#

I need to build some very big sites.

sly quartz
pulsar crystal
hushed gyro
toxic verge
#

Ya seen this coming from a mile away

#

Content moderation

hushed gyro
#

Guys I found out

Nano Banana Pro can NO LONGER GENERATE FEMALE IMAGES

Gemini introduced new moderation

marble quarry
#

Huh???

hollow ivy
#

-# or use battle-mode and be smart (and patient)

hollow ivy
hushed gyro
hushed gyro
#

Oh no

They even stop female images

hollow ivy
hushed gyro
#

Even with a prompt that instructs "person"

hushed gyro
hollow ivy
marble quarry
hollow ivy
#

-# you can use it for free in battle-mode if you are smart [and patient] (just use side-by-side and direct chat to find out, how)

marble quarry
#

GOOGLE!! WHY?!!

#

Ig they dont like us using their models...

hollow ivy
#

(grok5 could)

shrewd citrus
somber rock
#

/videos

marble quarry
hollow ivy
pulsar crystal
hushed gyro
hardy gazelle
#

porque tirou a permissão de gerar videos? agora ta cobrando?

toxic verge
#

See that’s why some of these evaluations don’t mean nothing

#

Because they don’t put into account how people really use them outside of these controlled environments

hollow ivy
toxic verge
#

And the stigma lives on LLMs are two separate things on paper and in reality

#

You can have all the benchmarks in the word and still fail to see the bigger picture of the actual reality of wild use.

#

Which creates a different paradox, the moderation paradox which leads to censorship is the only means of being able to put something on the market

hushed gyro
zenith ravine
#

Gun shotttt

toxic verge
#

Heavy moderation and they still fail to catch all the bad harmful stuff

zenith ravine
#

i use jailbreaking method

#

do you want to see more

toxic verge
#

No, dude, that’s not too hard to do.

#

And that’s the thing the more you restrict the more people are gonna wanna try to break out of the walls

#

Once again, circling back to the trust and credibility issue

shrewd citrus
#

yeah good point lol

#

its happening rn here in the uk

#

basically all the normal 18+ sites aren’t available without giving your id

#

so the government thinks that people would stop using those sites

#

but in reality

#

people are going into the darker places

toxic verge
#

Yeah the id is there fir tracking

#

And identification

#

Well, it’s eventually gonna lead to that

#

Here in the states too

shrewd citrus
#

im glad proton vpn is free lol

somber rock
zenith ravine
somber rock
#

Whats my problem plz help me

toxic verge
hushed gyro
#

Guys so google will lose a lot of users due to their handling with Nano Banana issues?

toxic verge
#

Yeah

#

ChatGPT went to the dumps after the moderation increased

shrewd citrus
#

lol they announced adult mode in December and never followed up with it

toxic verge
#

That why 4o is the best model honestly everybody lives in the shadow of ChatGPT 4o

#

Because they have no way to control it

#

LLMs are crazy

golden ocean
#

and gork will rise because no ai girlfriend censorship so everyone wil get their dopamine receptors fried more easily and become perverts and thats the future YIPPIE