#general

1 messages ¡ Page 142 of 1

fiery gull
#

@gregorius229

#

The end is coming 😞

#

The one guy of the just ones 5 atives guy in lmarena banned

#

20% of ALL Lm Arena is... Dead...

misty vault
#

NOOOOO not crack

#

He was famous for his crack bench

robust yoke
#

R.I.P.
Greg
???? - 2025
He was a good man, and he always made people laugh with his funny humor. May God bless his soul.

violet orchid
#

Nah chat dont worry

robust yoke
#

All we have left of him is his floating soul to talk to usâ€Ļ

violet orchid
#

Remmber my nani banani and sodadream wifes

robust yoke
#

I'll take care of them while you're gone.

violet orchid
#

Remember

robust yoke
violet orchid
#

hope is always the last to die

robust yoke
#

The wise words of the dead.

violet orchid
#

Maybe for now I can rest

#

Idk

violet orchid
robust yoke
violet orchid
#

Mod killed me đŸĨ€

#

Vote them they are the imposter!!!

robust yoke
#

Pineapple is the imposter, and we're the crewmates.

violet orchid
#

Real

#

But he wasnt the mod that killed me

robust yoke
#

Ah.

violet orchid
#

Lemme see who was

#

was @daring rock

#

@robust yoke

robust yoke
#

Ah.

#

I can't believe the crewmate got killed by Toothless the imposterâ€Ļ

#

😔

violet orchid
#

@fiery gull hey sorry for ping

#

I have a question for u

#

I have said something of politic/NSFW... today?

sharp mirage
#

Yo

#

What happened to. We Want to Learn from YOU

violet orchid
#

What

sharp mirage
violet orchid
#

Im not understanding

echo aurora
sullen quest
robust yoke
#

He's Toothmaxxing that jawline.

#

Alsoâ€Ļ

#

No way!!!1!1!11!

#

LM Arena, and LM Arena 2!!!1!1!

hollow imp
#

@echo aurora WHATS GOING ON

fiery gull
hollow imp
#

Huh?

🍍 !!

robust yoke
#

@fiery gull

#

@fiery gull

fiery gull
#

Hi

robust yoke
#

@fiery gull Greetings.

echo aurora
fiery gull
#

Thx u

echo aurora
#

Incase one goes down

fiery gull
robust yoke
echo aurora
fiery gull
#

Just joke I know this 😭

covert pewter
#

mm

fiery gull
#

But I want greg revived đŸĢŖ

fiery gull
fiery gull
robust yoke
fiery gull
#

Idk

robust yoke
#

Anyhow, if anyone needs a reliable translator that can translate into any language with accuracy and slang usage, then I've got one here: #ai-creations message

robust yoke
#

Even DeepL sometimes makes mistakes, but that one is the most efficient.

toxic egret
#

GPT-5-High should show reasoning trace?

sullen quest
#

..

polar venture
#

/video make this city alive

sullen quest
#

ok

hollow imp
sullen quest
#

..

polar venture
#

/video prompt make this under water city alive

hollow imp
sullen quest
fiery gull
agile barn
#

any idea on gemini 3 launch date frends

fiery gull
#

I'm 48 hours waked up 😭

desert abyss
fiery gull
low lion
#

how can I access the different modes in LMArena? (battle, side by side, and direct)

agile barn
#

mostly losing the money

hollow imp
agile barn
#

😭

hollow imp
#

@agile barn

sullen quest
#

gato is a new mod and already looks tired

agile barn
hollow imp
agile barn
fiery gull
hollow imp
agile barn
fiery gull
#

Just a 70 dollars???

#

😭

hollow imp
#

Mera bhala hojaye

fiery gull
#

Is a max glm 4.6 plan

agile barn
hollow imp
#

@agile barn ?

brave cloak
#

Yooooo
Is there a prompt checker or rewriter, that makes a prompt lmarena terms of use friendly, with changing the prompt as little as possible?

cyan barn
#

hello

#

where can I view my videos created?

sullen quest
#

just wait\

brave cloak
quartz pike
#

how do i fix my gemini being depressed

brave cloak
#

turns out lmarena really hates the term 'bent over'.

quartz pike
brave cloak
sullen quest
#

maek a new chat

hollow imp
quartz pike
#

and gemini decided it wanted to be depressed today

#

idk why

hollow imp
#

Increase the temparature

quartz pike
#

i did.

#

i did before i even started it

#

increased from 1. to 1.25.

hollow imp
#

Huh

#

1 is max

brave cloak
#

What if you edit gemini's responses

#

with toxic postivity

hollow imp
#

Why even use gemini

#

Use gpt 5 high

brave cloak
#

fool it into thinking it did it itself

quartz pike
#

thats the max

hollow imp
#

Oh

#

@brave cloak 😡

brave cloak
#

huh

burnt sinew
#

with cli, at least

hollow imp
burnt sinew
sharp mirage
stray aspen
#

I only use it when gemini can get stuff right

violet orchid
#

No Gemini news?

hollow ivy
wraith grotto
#

anyone know how to make consistent characters on sora?

stray aspen
burnt sinew
stray aspen
#

gemini 4 is not testing stage

burnt sinew
violet orchid
#

Gemini 5 when?

stray aspen
#

in the case we get that far without a nuclear war

toxic egret
inner gate
#

Gemini 3 is out?

hollow ivy
#

and if a guy was suicidal and tried it, the operators would reject it

#

(like Arkhipov and Petrov etc.)

#

as long as there are humans controlling the firing, it would not happen

echo sinew
#

Hey guys! Let's please keep the chat around AI topics. Thank you.

hollow ivy
#

(sorry)

echo sinew
hollow ivy
#

(for certain people or groups)

#

i wonder, why do people think, that GPT5-high is better in coding than Claude-4.5-thinking?

sullen quest
#

its higher in web dev

#

and thats what I care about

pastel basin
#

Oi gente

stray aspen
#

It better be SotA

golden ocean
#

crack

viscid cloak
#

Honestly, what’s the point of vote craps made by Gemini 2.0 when there’s apparently an advanced Gemini 2.5 (image gen)đŸ¤Ŗ10+ consecutive round of NO Gemini 2.5 results. Battle mode should have those kind of limit.

vital lake
#

Guys is GenSpark GPT 5 Pro unlimited legit or fake?

sullen quest
#

I don't know

#

idk what the argument was about, but it sounds like to me like its talking about how reasoning models solve problems vs non reasoning models and how its inefficent?

eg. solve 1+1 the reasoning model does the thoughts and solves 1+1 =2, then has to say it again, meanwhile the non reasoning model just says 1+1 = 2?

#

which is?

vital lake
#

Lot of people want Pro

sullen quest
#

yeah looking at the paper, sounds like to me they are saying exactly what I said

sullen quest
delicate frost
vital lake
sullen quest
#

yeah the hybrid model seems to be just a better version of the autorouter, which isn't that complicated since I thought of that a while ago too.

burnt sinew
verbal nimbus
#

Wasn't there a Singaporean paper on this

#

When they increased the samples, the base model started match or exceed the RLVR model.

mild knot
#

"Create a video for me where two 6-year-old children, dressed in panjabi, pajama, and wearing caps, walk from opposite directions along a lush green rural path and meet each other. Upon meeting, they begin a conversation in Bengali: Dialogue 1: Assalamu Alaikum Dialogue 2: Wa Alaikumus Salam Dialogue 1: Do you know that the first right of a Muslim is to greet another Muslim with Salam upon meeting? Dialogue 2: Yes, you’re absolutely right. It’s also mentioned in an authentic Hadith that those who exchange Salam abundantly are destined for Paradise. Dialogue 1: Subhan-Allah"

novel crater
#

brother what

toxic egret
hard quiver
#

Do you think the Sora 2 app could fall into irrelevance now that uploading images of real people isn’t allowed and copyrighted characters can’t be used? Honestly, this is what almost everyone wants it for.

zealous ledge
#

Hello!

violet orchid
#

Greetings

robust yoke
#

Greetings, Greg.

violet orchid
robust yoke
surreal robin
#

.

violet orchid
robust yoke
#

Sapeva che ho creato un traduttore molto avanzato che utilizza un LLM per la traduzione?
Translation: Did you know that I made a very advanced translator that uses an LLM for translation?

robust yoke
violet orchid
robust yoke
violet orchid
#

I dont use the pc

#

:\

robust yoke
violet orchid
robust yoke
violet orchid
#

Damn its awesome

robust yoke
#

Nice.

violet orchid
#

Nice job

robust yoke
#

You can type in Italian, and it'll translate to native English.

violet orchid
#

Seen

robust yoke
#

You can even toggle slang usage which will incorporate common slang that language uses.

violet orchid
#

Bht qait, u used the google translate service and then you uograded hin whit Gemini?

robust yoke
violet orchid
robust yoke
violet orchid
#

Damn

vestal oyster
#

please help me how to use sora 2 on lmarena

robust yoke
vestal oyster
#

nah it did not works

violet orchid
robust yoke
vestal oyster
#

i see other videos

violet orchid
robust yoke
violet orchid
robust yoke
vestal oyster
#

yes

robust yoke
#

And what message did you get?

vestal oyster
#

i get without audio video

robust yoke
#

That's normal.

#

You'll just have to keep generating until you get Sora 2.

vestal oyster
#

thanks

violet orchid
#

Average iq

robust yoke
violet orchid
vestal oyster
#

sora 2

robust yoke
#

Yours?

violet orchid
#

I have also tried hailuo 2.5 and the other of dream machine

#

I dont remember the name

robust yoke
#

Hm.

vestal oyster
#

anyone need sora 2 invite code?

robust yoke
#

Nah.

violet orchid
polar niche
#

Anyone tried 2.5 pro deepthink?

robust yoke
violet orchid
#

Who is the dumbass that pay 200â‚Ŧ in AI

#

Absolutely no me

robust yoke
polar niche
#

I just want try the the latest AI

#

Without paying that much

violet orchid
#

Its not so new

#

It cane out 3 months ago or 2

polar niche
#

What's the best/newest

robust yoke
polar niche
#

Smartest

robust yoke
#

The smartest model right now would be Claude Sonnet 4.5.

violet orchid
polar niche
robust yoke
polar niche
#

Also how is LM Arena profiting from adding all models for free?

polar niche
robust yoke
robust yoke
polar niche
#

I am testing them to solve some hard tasks

#

And they can't

robust yoke
#

What "hard tasks", specifically?

violet orchid
#

Yes i am interested

#

What is the prompt

polar niche
full tangle
#

guys how can i access sora 2?

polar niche
#

Like encrypted messages

violet orchid
#

Oh yes yes

#

Can I have one pls

#

A prompt

robust yoke
robust yoke
full tangle
verbal nimbus
polar niche
polar niche
#

Shouldn't be so hard for a robot

verbal nimbus
robust yoke
verbal nimbus
#

Maybe try word unscrambling first

robust yoke
#

Like for instance, I'm in a Telegram server that gives out that stuff.

polar niche
verbal nimbus
vestal oyster
#

C5472W sora 2 invite code

robust yoke
#

See? Just like that.

violet orchid
polar niche
robust yoke
full tangle
verbal nimbus
robust yoke
polar niche
#

gpt-5-high is the best so far

robust yoke
#

DeepSeek is very good at thinking out complex problems.

#

It contradicts itself, which is efficient in improving its answers.

polar niche
#

I was shocked about this but actually microsoft copilot solved very well

polar niche
full tangle
polar niche
#

It just thinks for a long time then errors out

robust yoke
polar niche
#

Do they have 3.2 exp thinking?

robust yoke
#

Yes.

#

And it supports multiple file types, including HTML.

polar niche
verbal nimbus
polar niche
#

I've yet to try gpt-5-pro thinking

verbal nimbus
polar niche
#

Any good?

#

Yes

verbal nimbus
#

I feel like it's kinda cheating lol

#

since timeouts aren't counted

full tangle
verbal nimbus
#

but probably just a bug

polar niche
#

Should be a clear indicator for a timeout

#

Or reasoning count

verbal nimbus
#

because it can just timeout for stuff it doesn't know how to solve

polar niche
#

Not just an error

full tangle
verbal nimbus
#

it's not counted anyways

#

but i think it's theoretically possible to cheat that way...

#

coz it'll answer on prompts its confident in but crash on ones its not confident in (high reasoning time)

#

it's like being able to ignore questions you don't know on an exam and not losing marks for it

polar niche
#

Is the gemini 2.5 flash lite preview any good?

verbal nimbus
#

it's fast but otherwise not really

robust yoke
polar niche
#

The latest added other than deepseek

#

Text model

verbal nimbus
#

glm 4.6?

polar niche
#

Oh yeah, no way it can help me

verbal nimbus
#

it's actually quite good

lusty vortex
#

āϰāĻžāϤ āϤāĻ–āύ āĻĒā§āϰāĻžāϝāĻŧ āĻŦāĻžāϰ⧋āϟāĻžāĨ¤ āĻļāĻšāϰ⧇āϰ āĻļ⧇āώ āĻŸā§āϰ⧇āύāϟāĻž āĻ›āĻžāĻĄāĻŧāϤ⧇ āϝāĻžāĻšā§āϛ⧇āĨ¤ āϰāĻžāĻšāĻžāϤ āĻĻ⧌āĻĄāĻŧ⧇ āĻāϏ⧇ āĻĒā§āĻ˛ā§āϝāĻžāϟāĻĢāĻ°ā§āĻŽā§‡ āωāĻ āϞ āĻ āĻŋāĻ• āϏāĻŽāϝāĻŧ⧇āĨ¤ āĻŸā§āϰ⧇āύ⧇āϰ āϭ⧇āϤāϰ āĻĒā§āϰāĻžā§Ÿ āĻ–āĻžāϞāĻŋ—āĻŽā§āϞāĻžāύ āφāϞ⧋, āϜāĻžāύāĻžāϞāĻžāϰ āĻŦāĻžāχāϰ⧇ āϕ⧁āϝāĻŧāĻžāĻļāĻžāĨ¤

āϏ⧇ āĻāĻ•āϟāĻž āϕ⧋āϪ⧇āϰ āϏāĻŋāĻŸā§‡ āĻŦāϏāϞāĨ¤ āĻšāĻ āĻžā§Ž āϖ⧇āϝāĻŧāĻžāϞ āĻ•āϰāϞ, āϏāĻžāĻŽāύ⧇āϰ āϏāĻŋāĻŸā§‡ āĻāĻ• āĻŦ⧃āĻĻā§āϧ āĻŦāϏ⧇ āφāϛ⧇āύāĨ¤ āĻŽā§āϖ⧇ āĻšāĻžāϞāĻ•āĻž āĻšāĻžāϏāĻŋ, āĻ•āĻŋāĻ¨ā§āϤ⧁ āĻšā§‹āϖ⧇ āϝ⧇āύ āϕ⧋āύ⧋ āĻ—ā§‹āĻĒāύ āϰāĻšāĻ¸ā§āϝāĨ¤ āϰāĻžāĻšāĻžāϤ āĻāĻ•āϟ⧁ āĻ…āĻ¸ā§āĻŦāĻ¸ā§āϤāĻŋ āύāĻŋāϝāĻŧ⧇ āϤāĻžāĻ•āĻŋāϝāĻŧ⧇ āϰāχāϞāĨ¤

āĻŦ⧃āĻĻā§āϧ āĻŦāϞāϞ⧇āύ, “āϤ⧁āĻŽāĻŋ āϤ⧋ āĻāχ āĻŸā§āϰ⧇āύ⧇ āĻ“āĻ āĻžāϰ āĻ•āĻĨāĻž āĻ›āĻŋāϞ āύāĻž, āϤāĻžāχ āύāĻž?”
āϰāĻžāĻšāĻžāϤ āϚāĻŽāϕ⧇ āωāĻ āϞ, “āφāĻĒāύāĻŋ āϕ⧀āĻ­āĻžāĻŦ⧇ āϜāĻžāύāϞ⧇āύ?”
āĻŦ⧃āĻĻā§āϧ āĻļāĻžāĻ¨ā§āϤāĻ­āĻžāĻŦ⧇ āĻŦāϞāϞ⧇āύ, “āĻ…āύ⧇āϕ⧇āχ āĻāχ āĻŸā§āϰ⧇āύ⧇ āĻ“āϠ⧇ āύāĻžāĨ¤ āϝāĻžāϰāĻž āĻ“āϠ⧇, āϤāĻžāϰāĻž āĻ•āĻŋāϛ⧁ āϖ⧁āρāϜāϤ⧇ āφāϏ⧇â€Ļ”

āϰāĻžāĻšāĻžāϤ⧇āϰ āĻļāϰ⧀āϰ āĻ āĻžāĻ¨ā§āĻĄāĻž āĻšāϝāĻŧ⧇ āϗ⧇āϞāĨ¤ āĻŸā§āϰ⧇āύ⧇āϰ āĻ—āϤāĻŋ āĻŦāĻžāĻĄāĻŧāϛ⧇, āĻ•āĻŋāĻ¨ā§āϤ⧁ āϜāĻžāύāĻžāϞāĻžāϰ āĻŦāĻžāχāϰ⧇ āϕ⧋āύ⧋ āĻĻ⧃āĻļā§āϝ āύ⧇āĻ‡â€”āĻļ⧁āϧ⧁ āϘāύ āĻ…āĻ¨ā§āϧāĻ•āĻžāϰāĨ¤ āϏ⧇ āĻŦ⧁āĻāϤ⧇ āĻĒāĻžāϰāϛ⧇ āύāĻž, āĻŸā§āϰ⧇āύāϟāĻž āϕ⧋āĻĨāĻžāϝāĻŧ āϝāĻžāĻšā§āϛ⧇āĨ¤

āĻŦ⧃āĻĻā§āϧ āĻŦāϞāϞ⧇āύ, “āϤ⧁āĻŽāĻŋ āϝāĻž āĻšāĻžāϰāĻŋāϝāĻŧ⧇āĻ›, āϏ⧇āϟāĻž āĻĒ⧇āϤ⧇ āĻšāϞ⧇ āϝāĻžāĻ¤ā§āϰāĻž āĻļ⧇āώ āĻ•āϰāϤ⧇ āĻšāĻŦ⧇āĨ¤â€

āĻ āĻŋāĻ• āϤāĻ–āύāχ āĻŸā§āϰ⧇āύ āĻĨ⧇āĻŽā§‡ āϗ⧇āϞāĨ¤ āϰāĻžāĻšāĻžāϤ āĻšā§‹āĻ– āϖ⧁āϞ⧇ āĻĻ⧇āĻ–ā§‡â€”āϏ⧇ āύāĻŋāĻœā§‡āϰ āĻŦāĻŋāĻ›āĻžāύāĻžāϝāĻŧ, āϘāĻžāĻŽā§‡ āϭ⧇āϜāĻž āĻļāϰ⧀āϰ āύāĻŋāϝāĻŧ⧇ āĻļ⧁āϝāĻŧ⧇ āφāϛ⧇āĨ¤
āϘāĻĄāĻŧāĻŋāϤ⧇ ⧧⧍āϟāĻž āĻŦāĻžāĻœā§‡āĨ¤
āĻŦāĻžāϞāĻŋāĻļ⧇āϰ āύāĻŋāĻšā§‡ āĻĒāĻĄāĻŧ⧇ āφāϛ⧇ āϏ⧇āχ āĻĒ⧁āϰāύ⧋ āϟāĻŋāĻ•āĻŋāĻŸâ€”āϝ⧇āϟāĻž āϏ⧇ āĻŦāĻšā§ āĻŦāĻ›āϰ āφāϗ⧇ āĻ›āĻŋāρāĻĄāĻŧ⧇ āĻĢ⧇āϞ⧇āĻ›āĻŋāϞāĨ¤

āϚāĻžāĻ“ āϚāĻžāχāϞ⧇ āφāĻŽāĻŋ āĻāϰ āĻāĻ•āϟāĻž āϭ⧟āĻ‚āĻ•āϰ āĻŦāĻž āφāĻŦ⧇āĻ—āϘāύ āϏāĻ‚āĻ¸ā§āĻ•āϰāĻŖāĻ“ āϞāĻŋāϖ⧇ āĻĻāĻŋāϤ⧇ āĻĒāĻžāϰāĻŋāĨ¤ āϕ⧋āύāϟāĻž āϚāĻžāĻ“? 😄

verbal nimbus
#

Brokk? I'll google it

#

I found it, but I don't see GLM 4.6 on the leaderboard: https://brokk.ai/power-ranking

Brokk

Comprehensive AI model benchmarks and performance rankings comparing different LLMs on real-world coding tasks. See which AI coding agents perform best across cost, speed, and accuracy metrics.

#

What's the difference?

robust yoke
# lusty vortex āϰāĻžāϤ āϤāĻ–āύ āĻĒā§āϰāĻžāϝāĻŧ āĻŦāĻžāϰ⧋āϟāĻžāĨ¤ āĻļāĻšāϰ⧇āϰ āĻļ⧇āώ āĻŸā§āϰ⧇āύāϟāĻž āĻ›āĻžāĻĄāĻŧāϤ⧇ āϝāĻžāĻšā§āϛ⧇āĨ¤ āϰāĻžāĻšāĻžāϤ āĻĻ⧌āĻĄāĻŧ⧇ āĻāϏ⧇ āĻĒā§āĻ˛ā§āϝāĻžāϟāĻĢ...

ΕÎŦÎŊ ÎĩĪ€ÎšĪ‡ÎĩÎšĪÎŋĪĪƒÎąĪ„Îĩ ÎŊÎą δΡÎŧΚÎŋĪ…ĪÎŗÎŽĪƒÎĩĪ„Îĩ ÎŧΚι ΀΁Îŋ΄΁ÎŋĪ€ÎŽ, Î´Ī…ĪƒĪ„Ī…Ī‡ĪŽĪ‚, ÎąĪ…Ī„ĪŒ δÎĩÎŊ ÎĩίÎŊιΚ Ī„Îŋ ÎēÎąĪ„ÎŦÎģÎģΡÎģÎŋ ÎēÎąÎŊÎŦÎģΚ ÎŗÎšÎą Ī„ÎŋÎŊ ΃ÎēÎŋĪ€ĪŒ ÎąĪ…Ī„ĪŒ.

verbal nimbus
#

Their graph is a bit crowded, but I found it 😆

#

GPT-5-Mini seems surprisingly high

#

(even higher than GPT-5, lol)

#

The cost graph is good

polar niche
#

Where is an updated lb?

timber spire
#

GM, anyone know which app to use , to recreate existing videos with your avatar ? Like video cloning but with your reference image. thx

robust yoke
polar niche
#

Link?

verbal nimbus
#

I thought METR is for text embedding models

#

Oh that's MTEB

#

Power ranking

polar niche
verbal nimbus
#

Not sure what tasks they tested it on, but it might be a bit competition-programming (CP) heavy

#

Since o4-mini is scoring higher than Sonnet

#

Also it's scoring higher than o3 for some reason

#

That happens on LiveCodeBench, but I thought that was due to CP-style problems

polar niche
#

Glm-4.6 pretty good

verbal nimbus
#

Quite interesting, because on one Gemini models are on the Pareto cost frontier, but on the PowerBench it isn't 🤔

polar niche
#

@echo aurora

robust yoke
#

āϝāĻĻāĻŋ āφāĻĒāύāĻŋ āĻāĻ–āĻžāύ⧇ āĻāĻ•āϟāĻŋ āĻ­āĻŋāĻĄāĻŋāĻ“ āϤ⧈āϰāĻŋ āĻ•āϰāĻžāϰ āĻšā§‡āĻˇā§āϟāĻž āĻ•āϰ⧇ āĻĨāĻžāϕ⧇āύ, āϤāĻžāĻšāϞ⧇ āĻĻ⧁āĻ°ā§āĻ­āĻžāĻ—ā§āϝāĻŦāĻļāϤ, āĻāϟāĻŋ āϏ⧇āϟāĻŋāϰ āϜāĻ¨ā§āϝ āωāĻĒāϝ⧁āĻ•ā§āϤ āĻ¸ā§āĻĨāĻžāύ āύāϝāĻŧāĨ¤
Translation: If you were trying to create a video here, then unfortunately, this isn't the place for that.

polar niche
#

Robot?

verbal nimbus
echo aurora
#

@lusty vortex If you're trying to use the video bot please review the information in #1397655624103493813

verbal nimbus
#

GLM 4.6 seems to be drop 7 places after style control for some reason

robust yoke
#

āĻ­āĻŋāĻĄāĻŋāĻ“ āϤ⧈āϰāĻŋ āĻ•āϰāϤ⧇ āφāĻĒāύāĻžāϕ⧇ #video-arena-1-āĻ āϝ⧇āϤ⧇ āĻšāĻŦ⧇āĨ¤
Translation: You'll wanna go to #video-arena-1 to create videos.

verbal nimbus
polar niche
#

Prompt to get exactly version number the model is?

#

And knowledge cutoff

robust yoke
verbal nimbus
#

I wish there were more with multiple languages

robust yoke
#

Models will usually tell you that they're a previous version of themselves due to the knowledge cutoff.

verbal nimbus
#

LMArena could help with that if they could make it easier to run code on the website

verbal nimbus
#

In a a way such that even a layman can run C code, like Web Dev Arena

robust yoke
# polar niche Seems counter productive

Like, for instance, if you ask GPT-5 what version of GPT it is, then it will respond that it's using GPT-4, even though that's not the case. That is due to its knowledge cutoff being somewhere in January 2025.

verbal nimbus
polar niche
#

Why can't they have updated knowledge?

#

Why is there a cutoff?

robust yoke
polar niche
#

I feel like nobody asked that question before

robust yoke
#

Then again, I don't see how that could possibly be an issue, considering these types of companies tend to make billions upon billions of dollars every single day.

verbal nimbus
#

Well if there was a Copilot-like plugin (LMArena had one for code completion), then you wouldn't need to create the benchmark yourself. It'll be real codebases from the real world.

robust yoke
#

Well, no crap.

#

Because it's on Twitter, so of course it's going to be updated every single day.

verbal nimbus
#

I'm sure companies like Copilot, Windsurf and Cursor have a ton of internal stats

polar niche
#

The later the model the later the knowledge cutoff.

verbal nimbus
#

I think votes are trustable as long as we know it was executed

robust yoke
polar niche
#

Thought so yeah, then again why wouldn't they update it to actually be latest

verbal nimbus
#

Also there are basic telemetry that can be reported such as number of errors from the linter, etc.

robust yoke
verbal nimbus
#

Well, not really, you can assume that the voters are voting correctly if they're executing the code.

polar niche
verbal nimbus
#

You can get basic metrics like number of compile and runtime errors per language, to start with.

polar niche
#

That doesn't charge $200 a month to use AI lol.

robust yoke
#

They actually care about the customer experience, unlike other AI companies that are dishonest.

polar niche
#

I would actually support this site rather than them

robust yoke
#

True.

#

And we already are, in technical terms.

#

By voting for which model we think is best.

polar niche
#

Yeah I guess data is worth a lot

robust yoke
verbal nimbus
#

Companies spend billions on market research 😛

robust yoke
#

Rather than simulated ones.

#

Makes me wonder if I should create a real-time updating bar graph showing the preference of different AI models on different benchmarks.

#

Whenever they get a new score, they smoothly update by moving up or down.

robust yoke
#

And whenever a new model gets introduced, a new space is created for that model, and it is assigned a color.

candid bison
#

Hey guys their used to be an icon on the left panel which allowed me to go to a page which showed all the content I had created here. Its no longer there. How can i access the page now please!?

robust yoke
polar niche
#

Like a live score website

robust yoke
#

Yeah.

#

That way, you never have to refresh the page.

candid bison
#

SO all the photo to video content I had preformed. It would all come up in a window in a row.

verbal nimbus
#

It's possible ig, but they're using MLE to fit the Bradley-Terry model which is slightly different from chess ratings systems since its basically trying to solve a global optimization problem on the entire dataset rather than updating turn-by-turn, so might be a bit inefficient unless an online ELO system could approximate it

polar niche
#

That way lmarena doesn't have to update the leaderboard manually.

candid bison
#

Like one place to see all the videos one has created.

polar niche
candid bison
#

It looks like that however it is only your works and no one esles!! You can scroll back and redownload all the content you have created

candid bison
#

See on how the left panel now there is an icon that looks like a bin with the number 4 on it? It was above that same icon!

robust yoke
candid bison
#

I dont know where it went 🙁 I cant access all the previous videos i did.

candid bison
#

Is there a way to see the history of photo to videos one has created?

polar niche
candid bison
#

Here icon on left with red 4. There was one that used to be above that exactly the same and I was able to access the history of content created

candid bison
polar niche
#

@tiny palm

#

Click and send a message

candid bison
polar niche
#

No problem!

atomic stream
#

What's the best AI for coding which is not claude and free?

robust yoke
polar niche
atomic stream
#

Z ai?

polar niche
#

Yes

atomic stream
#

Okay, thank you, i will try deepseek and z again as i am using qwen now.

#

Another qus, Is lmarena working fine now?

robust yoke
#

I'm using it right now and it's working fine.

polar niche
#

Site's broken on my mobile device

#

Other than that yes

robust yoke
atomic stream
#

GLM-4.6 without search and thinking, good or bad?

polar niche
#

Always use thinking when possible.

#

But decent yes.

hollow imp
#

GUYS

#

WHAT

atomic ember
#

Hello

robust yoke
robust yoke
hollow imp
robust yoke
#

It's able to mimic different shows very realistically.

hollow imp
#

Openai cooked with sora

#

But didn't cook with gpt 5 pro 😡

robust yoke
#

True.

#

Very rarely do they cook up something good.

polar venture
#

/video prompt make the city sink under water

hollow imp
polar venture
#

for a short movie about atlantis

polar niche
#

Deepseek has a knowledge cutoff of 2024 january

#

It's unusuable for my projects

#

Why so outdated knowledge?

robust yoke
#

Well, if you wanna use the most recent knowledge, just enable its searching mode.

burnt cypress
#

anyone who wants 150 images of nano banana a month?

robust yoke
#

No need, considering Dreamina already offers a free, rate-limit-free version of nano-banana.

burnt cypress
#

what about seedream 4

robust yoke
#

They've got that too.

#

And that's free too.

polar niche
#

Darkness, which model is best for cryptography would you say?

polar niche
#

Should I run a local llm that's designed for it

#

Train my own data

robust yoke
#

If you'd like.

odd kindle
#

Seedream 4-2k, please return 😭

polar niche
#

Which model is best to use for this? Code Llama?

robust yoke
#

You can always just use Dreamina to use Seedream 4 2k.

#

It's free on there.

#

And there's more customizeability.

timber sandal
robust yoke
#

And surprise, surprise, it can't be used.

timber sandal
#

I think the link is broken

whole sundial
#

we can't hack into your computer

#

localhost isn't going to work

timber sandal
#

I'm not hacking you guys

robust yoke
#

Because a localhost network requires hosting one directly from the user's terminal.

#

In order to actually publish your game, you'd need to buy a public domain.

robust yoke
brave cloak
#

awwww

#

I need an vpn?

robust yoke
#

Depends.

#

Which country?

brave cloak
#

'Coming Up Soon'

#

Latvia

robust yoke
#

Ah.

#

Likely then.

#

Use a VPN for the U.S.

brave cloak
#

I remember this website months ago, couldn't get in

robust yoke
#

It's available here.

barren prairie
#

GLM4.6 Air and V ?? Some models with s đŸ˜ļ

prime mulch
hollow imp
prime mulch
#

Yes ik

hollow imp
#

Did you see what I sent

prime mulch
prime mulch
#

I think sora is just edit

#

Wait a minute

hollow imp
#

Bro you cannot do video to video in sora 2

prime mulch
#

Does it have editing capabilities?

prime mulch
hollow imp
#

Yes

prime mulch
hollow imp
#

đŸ”Ĩ

#

@robust yoke do you realise aswell

prime mulch
#

Editors gonna lost their job if sora release the editing capabilities

robust yoke
hollow imp
# robust yoke Realize what?

That is exactly as the real clip in the anime, sora made it exactly 1 to 1 as the anime + with that badass edit, sora doesn't have access to anime clips ofc

prime mulch
# robust yoke Realize what?

Bro sora don't have editing capabilities but It can recreate scene from anime what if it got editing capabilities

hollow imp
#

Focus on the anime clip instead of the edit for a sec

#

The recreation is soooo sooooo

prime mulch
robust yoke
hollow imp
#

It's much more complex than some game

prime mulch
#

Wait a min what if people give models to ability to view video analyst the video content we are totally done

hollow imp
prime mulch
hollow imp
#

This is from the turn back the pendulum arc but which ep

prime mulch
hollow imp
#

Bruh

#

You don't know that?

#

Fake bleach fan

#

💔

hollow imp
robust yoke
hollow imp
#

The man is urahara

prime mulch
#

This is the real clip

prime mulch
#

From bleach episode 207

polar niche
prime mulch
robust yoke
#

You can do both.

robust yoke
#

It's pretty crazy.

polar niche
robust yoke
prime mulch
polar niche
#

Oh nevermind yeah!

hollow imp
robust yoke
polar niche
#

Does deepseek have a memory option?

hollow imp
prime mulch
violet orchid
#

Greetings

robust yoke
prime mulch
robust yoke
hollow imp
#

@prime mulch wait a sec don't go offline

prime mulch
#

Tell me

#

I'm here

robust yoke
dry siren
#

Is anyone else having issues with LMarena today? Every few turns I get an error message, I refresh the page and it asks me to verify I’m human. This happens to me but usually it’s on occasion and it only asks me to verify every other week or so.

But today it’s asked me to verify about ten times now. Ping me if you’re also having this problem

robust yoke
#

Sometimes, it'll ask more frequently, and other times not.

#

Unsure as to why.

prime mulch
prime mulch
robust yoke
prime mulch
hollow imp
#

I'm downloading episode 207 dvd iso

prime mulch
#

Or use telegram to download

hollow imp
#

I always watch blu ray

#

The highest quality and the bestest audio

prime mulch
hollow imp
prime mulch
hollow imp
#

@prime mulch what is the anime you're currently thinking about watching

prime mulch
#

But i only got 1080p links

hollow imp
#

Let me give you a blu ray release of that if it's available

prime mulch
hollow imp
#

You don't have pc/tv to watch now?

prime mulch
hollow imp
#

Did you watch Naruto?

violet orchid
prime mulch
#

Then i start read stories in manga

hollow imp
prime mulch
hollow imp
#

You really watched 500 anime?

#

Or dropped 450 anime

prime mulch
hollow imp
#

I know

robust yoke
#

Dal momento che tutti i moderatori sono offline, posso esprimermi in un italiano fluente.

hollow imp
#

My Myanimelist shows that I've watched 1000+ animes

prime mulch
hollow imp
#

@prime mulch

hollow imp
prime mulch
prime mulch
hollow imp
#

Do you know hindi

prime mulch
hollow imp
#

Are you jee aspirant

robust yoke
#

What about Indonesian?

prime mulch
hollow imp
#

Be

#

What college?

prime mulch
hollow imp
#

Do you plan on giving gate

prime mulch
#

I don't fail in any subject still i don't have job

prime mulch
hollow imp
#

Why

#

@prime mulch thampi

prime mulch
#

I believe in pg not give high salary

prime mulch
hollow imp
#

@prime mulch Thampi I'm only 16 years old

hollow imp
#

Anna

#

Annan

prime mulch
hollow imp
#

Anyways, coming back to the 500 anime you've watched, is code geass, monster, banana fish, gurren lagann in there?

ruby river
#

Doorbell camera POV — ultra-wide fish-eye lens typical of a Ring or Nest smart camera. The image should have subtle barrel distortion and mild digital grain, resembling real home surveillance footage. A quiet suburban porch at night is softly lit by warm overhead lighting and the flicker of jack-o’-lanterns with carved grins. A large wooden bowl overflowing with Halloween candy sits invitingly on a small table near the doormat. The setting looks calm — but a tall, shadowy figure dressed as a ghostly scarecrow (stitched burlap mask, glowing orange eyes, long straw-like fingers) lurks just off-center, half in the shadows near a column. The camera catches faint motion blur from the scarecrow’s head turning slightly toward the lens, as if it’s aware of being watched. Optional faint timestamp overlay in the top corner: “10/31/2025 — 08:42 PM.”

Visual Tone:

Nighttime ambient lighting

Cool blue-black shadows contrast with orange pumpkin glow

Slight chromatic aberration from lens edges

Realistic doorbell-cam framing and vignette

Atmosphere: tense but still, cinematic realism

prime mulch
hollow imp
#

@prime mulch You can't be serious you missed out on all the good anime

hollow imp
prime mulch
hollow imp
#

What happens in the ending

prime mulch
hollow imp
#

Actually he doesn't

#

He has his father's code. He pretends to be dead

prime mulch
#

He have the power to manipulate

hollow imp
#

He didn't die in the end

#

And the movies and other stuff are not canon

prime mulch
hollow imp
prime mulch
sturdy mica
#

its out guys

robust yoke
#

W.

leaden sun
robust yoke
#

True.

#

It ain't even showing up for me yet.

sturdy mica
#

damn this model is good at coding

#

they were not lying

leaden sun
sturdy mica
#

def yes

#

yes 10000x

hollow imp
#

It's not showing up for me bro

#

How is it showing up for you

#

Did u use U.S vpn?

sturdy mica
#

it probably just came up

#

no im in canada without vpn

#

or........

#

i lied and made it up

#

with inspect element

#

you guys got baited

robust yoke
#

Rage bait of the century.

hollow imp
#

@robust yoke @leaden sun 😭

leaden sun
sturdy mica
#

u guys wanna see my system prompt for coding claude

#

well not the prompt

#

the results of the prompt

dry phoenix
#

āφāĻŽāĻžāϟ āĻ­āĻŋāĻĄāĻŋāĻ“ āϖ⧁āρāĻœā§‡ āĻĒāĻžāĻšā§āĻ›āĻŋ āύāĻž

sturdy mica
#

@robust yoke

#

remember this

robust yoke
#

What about it?

sturdy mica
#

nothing

robust yoke
#

Well, it must've meant something if you pinged me just to see it.

sturdy mica
#

yeah i was just curious if you remembered that

#

also dont use the comet browser

robust yoke
#

I don't see what significance that message holds to you.

sturdy mica
#

it has a security vulnerability dont use comet

#

it has a few actually

robust yoke
#

I've since gotten rid of it.

sturdy mica
#

ok

#

is it better to use claude 4.5 thinking with perplexity pro or lmarena

#

a perk with perplexity is that the model can search online

sturdy mica
#

okay

#

how is life

robust yoke
#

Good, and yours?

sturdy mica
#

ok

robust yoke
#

As in it's "okay", or is that simply a response?

sturdy mica
#

its okay

#

sorry

flint sandal
sturdy mica
robust yoke
robust yoke
sturdy mica
#

thanks

robust yoke
#

My pleasure.

glass junco
#

hello

robust yoke
#

Greetings.

violet orchid
#

In here

#

Soooo

#

No Gemini 3 yet?

robust yoke
#

Al momento non si hanno ancora notizie, purtroppo.

prime mulch
#

Guys

robust yoke
#

Hm?

brisk turret
#

I feel like there's a bunch of models missing like qwen flash and the newer gemini flash lite version

prime mulch
fast zenith
#

What does "hard prompts (english)" mean?

robust yoke
fiery gull
#

but Idk what is a HARD prompt 🙃

robust yoke
fast zenith
#

like some complex scientific problems etc.?

robust yoke
#

Sure.

flint sandal
#

Is there any other way to access gemini 3 than A/B test?

brisk turret
#

Hard, it's the opposite of easy

#

Gemini 3 isn't in the a b testing

flint sandal
#

People call it gemini 3 pro

brisk turret
#

I think grok4 is a beast for its price

#

Grok4 fast

robust yoke
#

I agree.

brisk turret
#

But gemini 3 is gonna be a nuclear bomb

#

Damn I'm dying to see it

robust yoke
#

So am I.

brisk turret
#

Gimme those gamma rays

robust yoke
#

I'm gonna test it and see if it can code up a game.

brisk turret
#

Sonnet 4.5 can do pretty well

#

There isn't gonna be an opus 4.5 probably heh

robust yoke
#

That's quite true.

flint sandal
brisk turret
#

Opus is the server melter

robust yoke
flint sandal
brisk turret
#

50% of new games will be ai. Genie

fiery gull
brisk turret
#

The way we make games now is so inefficient

#

Gta6 costing over a billion

#

Silly stuff

#

It's a bunch of pixels

#

Stuck to a physics engine

#

With some voice acting thrown in

#

Ai will humble our notions of creativity

#

Ai slop will be more creative than peak human creativity

prime mulch
fiery gull
#

I hope this turns out really well

prime mulch
languid crescent
#

Is lmarena down again?

robust yoke
fiery gull
languid crescent
#

damng

#

@robust yoke same issue again lol

brisk turret
prime mulch
robust yoke
prime mulch
languid crescent
robust yoke
brisk turret
#

But yes it was overhyped

#

Gemini 3 is the one

#

Don't doubt it

#

Don't let Sama's failure make you doubt Demis

prime mulch
barren prairie
languid crescent
#

uhhh

#

wutdahelly

robust yoke
#

Hm?

languid crescent
#

it worked when i turned on my vpn

paper spoke
#

What did

robust yoke
#

Majiq.

robust yoke
languid crescent
#

maybe a prob on my router or sum?

robust yoke
paper spoke
#

Oh true

languid crescent
#

i tried it on another device without vpn, still the same prob

#

but oh well ig this is the fix for it

robust yoke
languid crescent
#

UHM

knotty fable
#

I'm constantly on VPN, never seen any problem.

languid crescent
#

@robust yoke

#

UHHM

#

LOL

robust yoke
#

Eh?

polar niche
#

Which AI is the best for custom LLm building?

languid crescent
#

idk wtf is going on

brisk turret
#

Petition for qwen3 flash

languid crescent
#

IT NOW WORKED WHEN I DISABLED VPN

#

and refreshed it

#

now its even faster

#

????

robust yoke
#

Er...

languid crescent
#

XDXXDDXXDXD

polar niche
#

Curently trying ollama 13B

robust yoke
#

-mild confusion-

burnt cypress
#

has anyone here fine tuned a model?

robust yoke
polar niche
languid crescent
#

also, what's the best ai model out there for coding? grok? claude? gpt?

burnt cypress
fiery gull
burnt cypress
#

and what are you tuning it for

robust yoke
flint sandal
#

im now comparing the secret a/b gemini 3 checkpoint in ai studio to gpt-5 codex high, and i think gemini 3 is waaay better than gpt-5 in landing pages but im still dissapointed from gemini 3 result.

polar niche
languid crescent
#

trying ya'll suggestion for my next project

fiery gull
robust yoke
flint sandal
robust yoke
split ibex
#

hello

polar niche
languid crescent
polar niche
#

Is it just random in text?

flint sandal
flint sandal
#

lets do a blind test,i will send two sites and yall will tel which is better and then i will tell. prompt was: "Code visually appealing landing page for cpu manufacturers, no gradients. In one html document. It should be $10K design."

#

Zero-Shot

#

no iterations

#

The AION one is A, the other one is B

robust yoke
#

Index 9 immediately.

flint sandal
robust yoke
#

No, to Gemini 3, apparently.

polar niche
#

Not available to select

flint sandal
#

just tell

#

ig

polar niche
flint sandal
#

The Index 9 was 2HT which apparently is Gemini 3.0 Flash, and Index 10 was GPT-5 Codex high.

#

and the gpt-5 didnt listen to me and it did add gradients

#

i think flash won here

polar niche
#

How did you get gemini 3 in compare mode?

flint sandal
#

also in gpt-5 there is too many things that are unnecesary and colors are not great like white text on light green buttons and icons dont work in gpt-5 response.

flint sandal
flint sandal
#

this

#

2hT in console means flash and X38 or 28 means Pro

#

you will get rate limited to next day until you will get a/b test so it isnt practical

bitter plinth
#

Hello.

robust yoke
#

Greetings.

flint sandal
bitter plinth
#

I new here

polar niche
#

Hello and welcome!

bitter plinth
#

I am new here

robust yoke
bitter plinth
robust yoke
#

We hope you have fun here.

bitter plinth
polar niche
robust yoke
#

Sure.

fiery gull
polar niche
fiery gull
#

for agentics use the gpt oss 20b still the better

polar niche
#

No for my crypto project

fiery gull
polar niche
#

32 GB ram

#

Is DeepSeek down?

fiery gull
robust yoke
#

Lemme check.

polar niche
#

Messages not going trough

robust yoke
#

Seems like it's doing good.

fiery gull
fiery gull
polar niche
noble adder
#

My name is Ammar and I'm a beginner on the Discord app and I want to learn artificial intelligence with you.

spark roost
#

hi everyone

spark roost
#

Why dont we intergrate

robust yoke
#

Greetings.

spark roost
#

LLMs reasoning with video

noble adder
robust yoke
#

That's already an integration.

spark roost
#

oh it is?

robust yoke
fiery gull
noble adder
#

Can someone help me learn artificial intelligence?

spark roost
#

WHta

#

What

#

Do you need help with?

polar niche
#

Should I use Llama-4-Scout-17B-16E-Instruct

fiery gull
fiery gull
polar niche
#

No?

fiery gull
noble adder
fiery gull
#

or qwen3 30b

noble adder
spark roost
#

You should learn some basic photoshop

hollow ivy
#

Does anyone know, when Claude-4.5-Opus-Thinking will come out?
(should be the top coding model)

spark roost
#

I think

#

dont know

fiery gull
spark roost
#

GLM 4.5 Air is a gem because it fits perfectly on 128gb of system ram

fiery gull
spark roost
#

bye everyone

noble adder
fiery gull
noble adder
#

Does anyone here speak Arabic?

fiery gull
#

@sharp mirage

sharp mirage
#

.

#

?

#

Hi

robust yoke
#

Just use my translator.

sharp mirage
#

I cant speak Arabic cuz its broke the rules

noble adder
fiery gull
fiery gull
sharp mirage
#

;/

fiery gull
sharp mirage
#

Hi :)

fiery gull
#

but in this server we just can english okay?

sharp mirage
polar niche
#

Lmarena should add gpt-5-pro

#

Here

sharp mirage
#

😭

fiery gull
#

I think he is resident of Egypt

polar niche
#

Why isn't deepseek working for me

fiery gull
noble adder
#

Unfortunately, everything here is in English and I couldn't find anything to translate here.

fiery gull
polar niche
polar niche
#

Don't want that

fiery gull
#

I think it's just simple anti-bot error

violet orchid
#

Facts

prime mulch
#

What his happening here