#general | Arena | Page 154

runic mulch Oct 25, 2025, 12:30 AM

#

Let's see the argentic capabilities

#

On tool calling

#

Without that it's useless

golden ocean Oct 25, 2025, 12:30 AM

#

https://cdn.discordapp.com/attachments/910347347601543196/1160742997156188181/togif-6.gif

quartz pike Oct 25, 2025, 12:30 AM

#

golden ocean https://cdn.discordapp.com/attachments/910347347601543196/1160742997156188181/to...

https://tenor.com/view/joe-bart-joe-bartolozzi-dance-funky-gif-15654574488447184285

Tenor

robust yoke Oct 25, 2025, 12:31 AM

#

https://tenor.com/view/stan-twt-stantwitter-stan-twitter-tenorelicario-relicario-gifs-gif-298982740560034732

Tenor

quartz pike Oct 25, 2025, 12:32 AM

#

https://tenor.com/view/joe-bartolozzi-gif-16501825566380358966

Tenor

#

world's most agressive griddy

#

on moonshoes

robust yoke Oct 25, 2025, 12:33 AM

#

https://tenor.com/view/drachenlord-sport-teddy-bärchen-gif-25967747

Tenor

quartz pike Oct 25, 2025, 12:34 AM

#

OOf it failed at making a clock

#

with 3 errors

#

but so did kimi k2

echo sinew Oct 25, 2025, 12:34 AM

#

Hey guys! Let's not disrupt the chat with GIFs spam

robust yoke Oct 25, 2025, 12:35 AM

#

echo sinew Hey guys! Let's not disrupt the chat with GIFs spam

https://tenor.com/view/cat-salut-salute-بسبس-تحيه-gif-8859282567348257812

Tenor

quartz pike Oct 25, 2025, 12:37 AM

#

i take everything back

#

it sucks.

echo aurora Oct 25, 2025, 12:37 AM

#

quartz pike with 3 errors

Yeah I've been flagging these errors to the team, it's not super stable at the moment.

quartz pike Oct 25, 2025, 12:37 AM

#

3 errors.

#

easily beaten by 4.5

quartz pike Oct 25, 2025, 12:37 AM

#

echo aurora Yeah I've been flagging these errors to the team, it's not super stable at the m...

oh.

#

this was the minimax result

#

this the sonnet https://3000-it6ku80oodtwxyjakawzu-6532622b.e2b-foxtrot.dev

#

aka sonnet 4.5 thinking 32k

quartz pike Oct 25, 2025, 12:39 AM

#

echo aurora Yeah I've been flagging these errors to the team, it's not super stable at the m...

btw could yall perhaps add a side by side option like yall did in lmaerna so its easier to compare models?

echo aurora Oct 25, 2025, 12:40 AM

#

quartz pike btw could yall perhaps add a side by side option like yall did in lmaerna so its...

It's very much being worked on, check out https://canary.lmarena.ai/ ablobwink

quartz pike Oct 25, 2025, 12:40 AM

#

echo aurora It's very much being worked on, check out https://canary.lmarena.ai/ <a:ablobwin...

oo okay

#

what makes this diffrent from normal lmarena?

robust yoke Oct 25, 2025, 12:40 AM

#

Early features.

quartz pike Oct 25, 2025, 12:40 AM

#

ohhhh

#

okay

robust yoke Oct 25, 2025, 12:41 AM

#

It's easy to tell because back when the site for the regular version had the old layout, the canary version had the new layout.

hollow ivy Oct 25, 2025, 12:42 AM

#

robust yoke It's easy to tell because back when the site for the regular version had the old...

which model would you recommend for C++ coding (offline) ?

#

If the goal was, to create a 2D game.

quartz pike Oct 25, 2025, 12:43 AM

#

this canary thing is so goated

robust yoke Oct 25, 2025, 12:43 AM

#

hollow ivy which model would you recommend for C++ coding (offline) ?

It's a bit hard to determine considering I have never used any of the models for C++ coding, only HTML coding.

#

And also with React coding as well.

#

However, if I were forced to give an option, I would likely have to choose either Claude or DeepSeek.

hollow ivy Oct 25, 2025, 12:44 AM

#

Deepseek? really?
That's an interesting decision.

#

and Claude-4.5-thinking?

#

i thought DS was below the top league of LLMs (in serious coding)

robust yoke Oct 25, 2025, 12:46 AM

#

DeepSeek because it really takes a long time to think out a proper decision to make and contradicting itself in the process which actually helps to improve upon its thinking and provide better outputs, and Claude with its ability to naturally already code very well as well as mix some creativity into its work.

hollow ivy Oct 25, 2025, 12:46 AM

#

which deepseek version and which claude version?

#

deepseek R1?

robust yoke Oct 25, 2025, 12:47 AM

#

hollow ivy and Claude-4.5-thinking?

Claude, I believe, works very well too, considering it's able to communicate naturally and code very well, as well as being a very creative model in its own ways.

quartz pike Oct 25, 2025, 12:47 AM

#

i alr found an issue witht he canary thing

robust yoke Oct 25, 2025, 12:47 AM

#

No doubt about it that the thinking model would also work very well.

polar niche Oct 25, 2025, 12:47 AM

#

Hello!

robust yoke Oct 25, 2025, 12:47 AM

#

polar niche Hello!

Greetings.

robust yoke Oct 25, 2025, 12:47 AM

#

hollow ivy deepseek R1?

No, DeepSeek v3.2 Experimental Thinking.

hollow ivy Oct 25, 2025, 12:48 AM

#

robust yoke No, DeepSeek v3.2 Experimental Thinking.

what about GPT5-high, with good prompting?

quartz pike Oct 25, 2025, 12:48 AM

#

yall

robust yoke Oct 25, 2025, 12:48 AM

#

hollow ivy what about GPT5-high, with good prompting?

GPT in itself also seems pretty promising given the outputs it has given me, in some cases in WebDev. If you want to try GPT, then you can, but ultimately I recommend DeepSeek and Claude.

quartz pike Oct 25, 2025, 12:48 AM

#

tell me when minimax is actually stable

#

so i can run a test on it

balmy mist Oct 25, 2025, 12:49 AM

#

https://x.com/Angaisb_/status/1981869708480516423

Angel 🍂 (@Angaisb_)

So GPT-5.1 after GPT-5?

It'd make sense

quartz pike Oct 25, 2025, 12:49 AM

#

aka what i like to call the "Minecraft one-shot test" where i js use this prompt: "make me a three.js minecraft clone with working terrian. first person movement collisions side collisions xyz collisions terrain. that isint hilly as hell. but also isint hill downhil hill downhill hill downhill repeat and repeat and with block breaking and placing."

normal peak Oct 25, 2025, 12:49 AM

#

Why is Gemini 2.5 is still in the top of llms in lmarena?

hollow ivy Oct 25, 2025, 12:50 AM

#

quartz pike aka what i like to call the "Minecraft one-shot test" where i js use this prompt...

ask it to write that in C++

#

could be an interesting test

quartz pike Oct 25, 2025, 12:51 AM

#

hollow ivy ask it to write that in C++

idk how to compile c++ lol

#

i only know how to do c# via unity.

#

and three.js via html

hollow ivy Oct 25, 2025, 12:51 AM

#

quartz pike idk how to compile c++ lol

the AI can explain it

#

it (the AI) will probably propose SFML (for 2D games)

normal peak Oct 25, 2025, 12:51 AM

#

Why is mistral dumb af

hollow ivy Oct 25, 2025, 12:52 AM

#

just ask it, what is the best engine for 3D in C++

hollow ivy Oct 25, 2025, 12:52 AM

#

normal peak Why is mistral dumb af

because that lab has not the resources of the Big5

#

(Deepmind, Anthropic, OpenAI, xAI, Meta)

stray aspen Oct 25, 2025, 12:53 AM

#

How's mini max 2

normal peak Oct 25, 2025, 12:53 AM

#

hollow ivy because that lab has not the resources of the Big5

Hmm interesting

stray aspen Oct 25, 2025, 12:53 AM

#

Is it any good

robust yoke Oct 25, 2025, 12:53 AM

#

Testing that right now.

quartz pike Oct 25, 2025, 12:53 AM

#

stray aspen Is it any good

pretty damn good

#

beats 4.1 opus for me

stray aspen Oct 25, 2025, 12:53 AM

#

That's crazy

normal peak Oct 25, 2025, 12:53 AM

#

hollow ivy (Deepmind, Anthropic, OpenAI, xAI, Meta)

But all those 5 are dumb too tbh

hollow ivy Oct 25, 2025, 12:53 AM

#

normal peak But all those 5 are dumb too tbh

why that?
Deepmind isnt "dumb"

quartz pike Oct 25, 2025, 12:53 AM

#

doesnt mistral just make small ai's that can run on ur pc or small servers?

#

or single h100 gpu's?

normal peak Oct 25, 2025, 12:54 AM

#

hollow ivy why that? Deepmind isnt "dumb"

Gemini can't speak my native language

hollow ivy Oct 25, 2025, 12:54 AM

#

normal peak Gemini can't speak my native language

which is..?

robust yoke Oct 25, 2025, 12:54 AM

#

What's your native language?

quartz pike Oct 25, 2025, 12:54 AM

#

what's your native language?

normal peak Oct 25, 2025, 12:54 AM

#

All llms can't speak it

normal peak Oct 25, 2025, 12:54 AM

#

hollow ivy which is..?

Tamazight

hollow ivy Oct 25, 2025, 12:54 AM

#

sanskrit?

#

ah

#

wow

#

amazigh?

#

(that was a selectable nation in freeciv game)

normal peak Oct 25, 2025, 12:55 AM

#

The original north African language before Arabs colonized us

hollow ivy Oct 25, 2025, 12:55 AM

#

cool!

stray aspen Oct 25, 2025, 12:55 AM

#

What's tamazight

hollow ivy Oct 25, 2025, 12:55 AM

#

interesting.
our (earth's) history is rich

normal peak Oct 25, 2025, 12:56 AM

#

Yeah . Whats ur name. Im gonna write it in my language

hollow ivy Oct 25, 2025, 12:56 AM

#

me?
paws

normal peak Oct 25, 2025, 12:56 AM

#

We don't have the letter P hhh

#

So im gonna write baws

hollow ivy Oct 25, 2025, 12:57 AM

#

:)

#

oh nice

#

looks very.. exotic

#

and unique

robust yoke Oct 25, 2025, 12:57 AM

#

An interesting rule.

normal peak Oct 25, 2025, 12:57 AM

#

Darkness = ⴷⴰⵔⴽⵏⵉⵙⵙ

#

There was one model that can speak my language

#

It was sonnet 3.5

hollow ivy Oct 25, 2025, 12:58 AM

#

normal peak Darkness = ⴷⴰⵔⴽⵏⵉⵙⵙ

what does "Cosmos" sound in your language? if you translate it

normal peak Oct 25, 2025, 12:58 AM

#

It was amazing

robust yoke Oct 25, 2025, 12:59 AM

#

It seems like Claude is the only model with a creative mind, and thus, can speak native languages very well.

normal peak Oct 25, 2025, 12:59 AM

#

hollow ivy what does "Cosmos" sound in your language? if you translate it

ⵉⴳⵏⵡⴰⵏ

#

Ignwan

normal peak Oct 25, 2025, 12:59 AM

#

robust yoke It seems like Claude is the only model with a creative mind, and thus, can speak...

Yeah especially the old sonnet

#

They are focusing more in programming now . Not in ancient languages hhh

robust yoke Oct 25, 2025, 1:00 AM

#

Well, not to worry, since Claude 4.5 seems to also be just like Claude 3.5, in that it's able to write in very natural English. Almost in a conversational tone.

#

So, I think it might be able to write in your language.

normal peak Oct 25, 2025, 1:01 AM

#

robust yoke So, I think it might be able to write in your language.

I've tested it. But the results wasn't good

robust yoke Oct 25, 2025, 1:02 AM

#

normal peak I've tested it. But the results wasn't good

Darn. I'm sorry to hear that.

normal peak Oct 25, 2025, 1:03 AM

#

Yeah. Im hoping Gemini 3 will be good

robust yoke Oct 25, 2025, 1:04 AM

#

It mentioned something about Tamazight being a Berber language.

normal peak Oct 25, 2025, 1:04 AM

#

Gemini 2.5 isn't that bad actually. But it makes a lot of mistakes

normal peak Oct 25, 2025, 1:05 AM

#

robust yoke It mentioned something about Tamazight being a Berber language.

Yeah berber people are amazigh people

gilded geyser Oct 25, 2025, 1:05 AM

#

Good evening from Alaska, where I'm hoping to see what's possible with AI video animating characters from my stories.

normal peak Oct 25, 2025, 1:05 AM

#

But we don't call ourselves bereber. It is a racist name from romans

robust yoke Oct 25, 2025, 1:05 AM

#

normal peak Yeah berber people are amazigh people

I've never really known about Amazigh, so this is all new information to me.

normal peak Oct 25, 2025, 1:06 AM

#

Where are u from ?

robust yoke Oct 25, 2025, 1:06 AM

#

Washington, and you?

normal peak Oct 25, 2025, 1:07 AM

#

Morocco, taghazout

robust yoke Oct 25, 2025, 1:07 AM

#

Ah.

#

I've known a little bit about Morocco, but I never realized people there spoke Tamazight.

normal peak Oct 25, 2025, 1:07 AM

#

I know Washington do you know taghazout ? Hhhh

robust yoke Oct 25, 2025, 1:08 AM

#

normal peak I know Washington do you know taghazout ? Hhhh

Not really, no.

normal peak Oct 25, 2025, 1:09 AM

#

robust yoke I've known a little bit about Morocco, but I never realized people there spoke T...

After independence, the country didn't teach the original language in schools . That why a lot of people speak drija now . Wich is a mix between arabic and french and tamazight . Hhhh it's very complicated here hhh

normal peak Oct 25, 2025, 1:10 AM

#

robust yoke Not really, no.

There is a lot of people from Europe and usa . Coming to taghazout. Its very famous for surfing and nomads

robust yoke Oct 25, 2025, 1:10 AM

#

normal peak After independence, the country didn't teach the original language in schools ....

Very interesting.

normal peak Oct 25, 2025, 1:11 AM

#

robust yoke Very interesting.

Akhnfuch

#

No

robust yoke Oct 25, 2025, 1:11 AM

#

Ah.

normal peak Oct 25, 2025, 1:11 AM

#

Don't ask ai . Hhhh it doesn't understand tamazight

robust yoke Oct 25, 2025, 1:11 AM

#

Apparently, it's supposed to translate to "cockroach".

normal peak Oct 25, 2025, 1:14 AM

#

ⴰⴱⴰⵏⴹⵔⵉⵡ = cockroach

robust yoke Oct 25, 2025, 1:14 AM

#

Ah.

#

It gave me this: "ⴰⵖⵕⵕⵓⴹ".

stray aspen Oct 25, 2025, 1:16 AM

#

@normal peak hey bud

normal peak Oct 25, 2025, 1:16 AM

#

There is a lot of amazigh directs, here in Morocco. We have 3 and Algeria 3 Libya 2 . And also there is small group of people in egeyp speaking tamazight too . In siwaa village. You can chatgpt it hhh

stray aspen Oct 25, 2025, 1:16 AM

#

How do you canada in your language

#

Say*

#

That's crazy

simple sleet Oct 25, 2025, 1:17 AM

#

Does anyone have GPT Pro chat? I have the Plus option and can only make videos up to 10 seconds long and 720p with Sora 2.

I want to know if GPT Pro can make videos longer than that, 1080p, and with Sora 2 Pro.

I also want to know what the daily limit is for creating videos.

stray aspen Oct 25, 2025, 1:17 AM

#

What kind of letters

#

What alphabet is that

normal peak Oct 25, 2025, 1:17 AM

#

Amazigh letters. Very ancient

robust yoke Oct 25, 2025, 1:17 AM

#

Apparently, it's in their native script.

normal peak Oct 25, 2025, 1:17 AM

#

stray aspen What alphabet is that

Tifinagh alphabet

robust yoke Oct 25, 2025, 1:18 AM

#

I find it funny that the letter "P" isn't in it.

normal peak Oct 25, 2025, 1:19 AM

#

robust yoke I find it funny that the letter "P" isn't in it.

Yeah hhhh like we don't have a single word in tamazight w the letter p

robust yoke Oct 25, 2025, 1:19 AM

#

normal peak Yeah hhhh like we don't have a single word in tamazight w the letter p

Yeah, that's very interesting.

azure sorrel Oct 25, 2025, 1:19 AM

#

i like MiniMax-M1

robust yoke Oct 25, 2025, 1:22 AM

#

Heh.

normal peak Oct 25, 2025, 1:23 AM

#

Hhhhhh

robust yoke Oct 25, 2025, 1:23 AM

#

It's almost like how the Dutch are very throaty in their language.

normal peak Oct 25, 2025, 1:23 AM

#

Yeah

#

Its too late here good night my friend

robust yoke Oct 25, 2025, 1:24 AM

#

Goodnight.

#

I hope you have wonderful dreams.

golden ocean Oct 25, 2025, 1:24 AM

#

real

normal peak Oct 25, 2025, 1:24 AM

#

Thank you

robust yoke Oct 25, 2025, 1:24 AM

#

normal peak Thank you

My pleasure.

normal peak Oct 25, 2025, 1:26 AM

#

https://youtu.be/H1YIgwPsX5Q?si=S6nozNjPPRps1XMo

YouTube

Tinariwen

Tinariwen (+IO:I) - Nànnuflày

'Nànnuflày' (Fulfilled) from the album 'Elwan,' available now.
Order ELWAN here: http://found.ee/tinariwen_store

In the Sahara desert, an old Tuareg man comes back to the camp where he grew up for a party. He remembers the joys and the torments of the nomadic life he lived with a friend who has since deceased: memories from their naive child...

▶ Play video

#

Hallucination hhh agi is still far away hhhh

ashen mauve Oct 25, 2025, 1:28 AM

#

what even is Tifinagh and were in the world is that from

robust yoke Oct 25, 2025, 1:30 AM

#

ashen mauve what even is Tifinagh and were in the world is that from

Tifinagh is a handwritten script that a certain group of Moroccans used to write in.

#

It is the script of the Tamazight language which existed before the Arabs (pretend I explained something here), and now, they teach the Moroccans a mix of French, Tamazight, and Arabic.

daring rock Oct 25, 2025, 2:04 AM

#

@reef mirage Please head to #1397655624103493813 for a detailed guide on how to use the bot

obsidian cargo Oct 25, 2025, 2:12 AM

#

you gotta set up a bot that automatically does that to any message with /video in it

daring rock Oct 25, 2025, 2:14 AM

#

@tribal whale Please head to #1397655624103493813 for a detailed guide on how to use the bot

ashen mauve Oct 25, 2025, 2:19 AM

#

robust yoke It is the script of the Tamazight language which existed before the Arabs (prete...

That is literally cool as hell, I have never even knew about this language until now.

robust yoke Oct 25, 2025, 2:19 AM

#

ashen mauve That is literally cool as hell, I have never even knew about this language until...

Neither have I, to be completely honest.

burnt sinew Oct 25, 2025, 2:20 AM

#

daring rock <@495317292771704832> Please head to <#1397655624103493813> for a detailed guide...

Why not setup an automod rule to do this

daring rock Oct 25, 2025, 2:30 AM

#

@mortal plinth Please head to #1397655624103493813 for a detailed guide on how to use the bot

daring rock Oct 25, 2025, 2:34 AM

#

burnt sinew Why not setup an automod rule to do this

Thanks for your feedback. We will pass it to our team.

fast kite Oct 25, 2025, 3:34 AM

#

Basically, I mentioned that there are problems in LMArena, the images aren't showing up accurately.

#

Here! The image is completely blurry! gpt-image-1 used to make decent art, even different versions, but now he doesn't. Could you please explain the problem that's been going on for two months?

burnt sinew Oct 25, 2025, 3:55 AM

#

daring rock Thanks for your feedback. We will pass it to our team.

My message got deleted?

burnt sinew Oct 25, 2025, 3:55 AM

#

fast kite Here! The image is completely blurry! **gpt-image-1** used to make decent art, e...

It doesn't look blurry to me? Maybe click download and send raw file instead ?

fast kite Oct 25, 2025, 4:02 AM

#

burnt sinew It doesn't look blurry to me? Maybe click download and send raw file instead ?

What do you mean?

jade egret Oct 25, 2025, 4:08 AM

#

gemini 3 december? taking so long...

vivid sedge Oct 25, 2025, 4:09 AM

#

hello

empty stump Oct 25, 2025, 4:09 AM

#

why do they release in december

burnt sinew Oct 25, 2025, 4:15 AM

#

fast kite What do you mean?

The image doesn't look blurry to me

whole sundial Oct 25, 2025, 4:20 AM

#

fast kite Here! The image is completely blurry! **gpt-image-1** used to make decent art, e...

you're not the first person to complain about this, i complained about this here (#1412721830682296423) right when they started doing this. It seems like LMArena has done absolutely NOTHING to fix this! But that's because they did this to save money. They changed the quality of a model to make it cheaper and the leaderboard score goes down... They should change it back to how it was AND remove ALL votes since September 3 (the date they first started doing this sneaky stuff). It's a shame few people know LMArena has been doing this, I tried to tell people but it never works.

#

When they do fix it, they should add the API quality level of the model to the name of this and GPT Image 1 mini (it's affecting that model too) so people know what quality they are getting.

polar niche Oct 25, 2025, 4:43 AM

#

Wtf did claude say

#

insanity

balmy hemlock Oct 25, 2025, 5:22 AM

#

我靠，困死了

ancient mango Oct 25, 2025, 5:39 AM

#

Why is there no sound when generating a video?

whole sundial Oct 25, 2025, 5:47 AM

#

@echo aurora you never gave me a proper answer the last time i asked but is https://github.com/lm-sys/FastChat still being used by LMArena at all? if not, is there any other github repository that is used by LMArena that I can use instead? (I am a volunteer for a popular online wiki that has LMArena on it multiple times for various AI things and this is the GitHub link we use for LMArena)

quasi storm Oct 25, 2025, 6:10 AM

#

Guys anyone can tell me how i can genrate pics and video

thick stirrup Oct 25, 2025, 6:16 AM

#

hi, guys everyone how are you!

whole sundial Oct 25, 2025, 6:19 AM

#

quasi storm Guys anyone can tell me how i can genrate pics and video

read #1397655624103493813

#

for image gen it's better to use the lmarena.ai website (hit the image button in the text bar) as the 5 per day rate limit for videos on discord also applies to images when you generate them on discord

unborn raft Oct 25, 2025, 6:23 AM

#

hi there - hope to discover best video generating ai tools there...

vital spruce Oct 25, 2025, 6:58 AM

#

Bueno

ember matrix Oct 25, 2025, 7:21 AM

#

hi there - hope to discover best video generating ai tools and practices

supple hearth Oct 25, 2025, 7:35 AM

#

Here to laugh at all the wild stuff being produced

magic stag Oct 25, 2025, 8:33 AM

#

The theme of the music is spending 5 seconds reading the rules

#

peepoDirty

inland quest Oct 25, 2025, 8:38 AM

#

192381273123 bots is coming

glossy umbra Oct 25, 2025, 8:39 AM

#

https://cdn.discordapp.com/attachments/1407360256446693376/1407362471131611217/caption.gif

magic stag Oct 25, 2025, 8:40 AM

#

Need ultra banana 3.0 pro to have 5 mins of novelty in my life before I start "needing" grok 5 or something

#

OkAnd

pulsar saffron Oct 25, 2025, 9:32 AM

#

vote

#

#video-arena-1 message

#

I NEED TO KNOW THE MODEL

flint zodiac Oct 25, 2025, 9:43 AM

#

#1397655624103493813 {
"version": "1.0",
"platform": "lm_arena",
"task": "image_to_video",
"referenced_image": "/mnt/data/IMG_4368.JPG",
"settings": {
"aspect_ratio": "9:16",
"duration_seconds": 10,
"fps": 24,
"resolution": "4K",
"format": "mp4",
"quality": "high"
},
"prompt": {
"description": "Cinematic drone shot starting high above an ancient Indian fort and smoothly zooming in toward its central courtyard. Warm golden-hour lighting with soft shadows, natural sunlight flares, and realistic HDR tone. Gentle downward tilt revealing the fort’s symmetry and red sandstone textures, with a slow, stabilized motion for an immersive feel.",
"camera_motion": "smooth drone zoom-in, gentle downward tilt, stabilized dolly-in",
"visual_style": "ultra-realistic, golden-hour color grading, 4K HDR, warm tones, soft vignetting"
},
"negative_prompt": "no people, no flicker, no distortion, no overexposure, no text or watermark"
}

quartz pike Oct 25, 2025, 10:26 AM

#

hello yall

#

is minimax on lmarena finally stable?

quartz pike Oct 25, 2025, 10:26 AM

#

flint zodiac <#1397655624103493813> { "version": "1.0", "platform": "lm_arena", "task":...

go on #video-arena-1

pulsar saffron Oct 25, 2025, 10:27 AM

#

vote #video-arena-1 message

#

I'M TRYING TO KNOW THE MODEL FOR 1 HOUR NOW 😭

quartz pike Oct 25, 2025, 10:29 AM

#

yup minimax m2 is still unstable as shhit

#

i keep havingh the same damn error.

leaden sun Oct 25, 2025, 10:41 AM

#

normal peak Morocco, taghazout

amazing! I've always been fascinated by the Amazigh! it's great to have you here! your language is especially fascinating, reminds me a little bit of the mystery of the Basque culture too ✨

dawn grove Oct 25, 2025, 10:57 AM

#

what is this model from Google?

sturdy hawk Oct 25, 2025, 10:59 AM

#

anyone knows how to use popcorn feature on higgsfield to transform a video and change the face in the video?

thorn kernel Oct 25, 2025, 11:07 AM

#

Anyone knows ai generated video will be monetize on youtube?

quasi atlas Oct 25, 2025, 11:16 AM

#

Please refer to #1397655624103493813 to learn how to use the bot.

pulsar saffron Oct 25, 2025, 11:33 AM

#

dawn grove what is this model from Google?

That's the problem no one tested this model we don't know if it's actually good...

#

so the answer is you'll never find out

tropic musk Oct 25, 2025, 11:34 AM

#

helllo

jolly gulch Oct 25, 2025, 11:37 AM

#

hello

verbal nimbus Oct 25, 2025, 11:42 AM

#

dawn grove what is this model from Google?

Isn't that Baidu

#

Hallucinates badly

pulsar saffron Oct 25, 2025, 11:45 AM

#

i'm surprised that no one did distill of gemini 3

tight pelican Oct 25, 2025, 11:59 AM

#

Hello

north pawn Oct 25, 2025, 12:04 PM

#

hello

elder burrow Oct 25, 2025, 12:08 PM

#

how has nobody mentioned that minimax m2 has a 200k context window

#

m1 had 4 million

azure sorrel Oct 25, 2025, 12:09 PM

#

glossy umbra https://cdn.discordapp.com/attachments/1407360256446693376/1407362471131611217/c...

elder burrow Oct 25, 2025, 12:09 PM

#

elder burrow m1 had 4 million

going from the best context window (aside from llama 4 scout) to below an average good model's context window is kinda crazy

azure sorrel Oct 25, 2025, 12:09 PM

#

elder burrow going from the best context window (aside from llama 4 scout) to below an averag...

hi again

elder burrow Oct 25, 2025, 12:12 PM

#

elder burrow going from the best context window (aside from llama 4 scout) to below an averag...

zenith spindle Oct 25, 2025, 12:13 PM

#

hello

#

1

silent pebble Oct 25, 2025, 12:15 PM

#

Hi, trying the models and prompting here before signing up to a specific service.

hollow ivy Oct 25, 2025, 12:23 PM

#

poll_question_text

How will future humanity handle time[zones] (TZ) ?

victor_answer_votes

1

total_votes

2

stray aspen Oct 25, 2025, 12:34 PM

#

dawn grove what is this model from Google?

Ernie is chinese

#

So I would say no

#

Unless they are trying to deceive us

pulsar saffron Oct 25, 2025, 12:41 PM

#

stray aspen Ernie is chinese

no its google

#

it clearly says by google

#

i trust ernie

fast kite Oct 25, 2025, 12:42 PM

#

burnt sinew The image doesn't look blurry to me

That's right, but gpt-image-1 stopped showing other options. And the images turned out to be of poor quality. For example:

dapper fog Oct 25, 2025, 12:46 PM

#

Hey! Anyone know how repair this? I can't work with this.... I have this in all models. 2-5 answers and error

spare cobalt Oct 25, 2025, 1:15 PM

#

Hello

hollow ivy Oct 25, 2025, 1:17 PM

#

spare cobalt Hello

https://rysana.com/

Rysana

Rysana is the AI cloud for production: fast, reliable, and clever. Make magic happen with our language model API platform. Check out our open source libraries and documentation for building better products with modern AI, and Lusat - our breakthrough reasoning engine for intent translation and on-the-fly dynamic UI generation.

#

hello

quartz pike Oct 25, 2025, 1:29 PM

#

https://www.youtube.com/watch?v=N0XpMe94ENU

YouTube

Theoretically Media

Open Source AI Video BOMBSHELL From LTX & Minimax 2.3 Preview!

We just got a surprise AI video model drop! LTX Studio has officially launched LTX 2, and it's a banger! This new model boasts 4K resolution, audio generation, and, most importantly, it's going open source.

Today, we dive deep into LTX 2, going hands-on with the new API playground to test its text-to-video and image-to-video capabilities. We'll...

▶ Play video

#

im so excited for ltx 2

#

its 4k 50 fps

hollow imp Oct 25, 2025, 1:39 PM

#

quartz pike https://www.youtube.com/watch?v=N0XpMe94ENU

GONBE GONBE!!!

quartz pike Oct 25, 2025, 1:40 PM

#

hollow imp GONBE GONBE!!!

?

obsidian cargo Oct 25, 2025, 1:53 PM

#

everywhere I look I see her face

burnt sinew Oct 25, 2025, 1:57 PM

#

obsidian cargo *everywhere I look I see her face*

wicked pond Oct 25, 2025, 2:13 PM

#

hello

west glacier Oct 25, 2025, 2:18 PM

#

making vieo

golden ocean Oct 25, 2025, 2:25 PM

#

west glacier making vieo

excellent

#

im proud of u

west glacier Oct 25, 2025, 2:25 PM

#

how can i generat video in here

stiff kernel Oct 25, 2025, 2:25 PM

#

west glacier making vieo

Please check #1397655624103493813

novel obsidian Oct 25, 2025, 2:26 PM

#

hello folks
i have a problem with uploading my images (jpg format)
consistently got upload failed message !!!
what shoul i do!!!

novel obsidian Oct 25, 2025, 2:31 PM

#

stiff kernel Please check <#1397655624103493813>

does not help !

pulsar saffron Oct 25, 2025, 2:32 PM

#

west glacier making vieo

oh!

fresh mirage Oct 25, 2025, 2:36 PM

#

someone tell me they’re gonna bring back lithium or orion

#

TeriStare

obsidian cargo Oct 25, 2025, 2:37 PM

#

probably not tbh

#

though I bet there'll be a gemini 3 pro preview before it releases

fresh mirage Oct 25, 2025, 2:38 PM

#

probably

#

i’m already experiencing lithium/orion withdrawals

#

it’s killing me lol

obsidian cargo Oct 25, 2025, 2:38 PM

#

I've seen it be said that lithiumflow is gemini 3 but not gemini 3 pro

fresh mirage Oct 25, 2025, 2:38 PM

#

obsidian cargo I've seen it be said that lithiumflow is gemini 3 but not gemini 3 pro

Yeah I was told they were gemini 3 flash models

obsidian cargo Oct 25, 2025, 2:38 PM

#

there was like, an X28 model or something that was labelled gmini 3 pro

obsidian cargo Oct 25, 2025, 2:39 PM

#

fresh mirage Yeah I was told they were gemini 3 flash models

it didn't say they were flash either tbh

#

maybe they'll end up being gemini 3 coding models

fresh mirage Oct 25, 2025, 2:39 PM

#

maybe, if they try something new

obsidian cargo Oct 25, 2025, 2:39 PM

#

a few times lithiumflow did worse than 2.5 flash on creative writing stuff

hollow imp Oct 25, 2025, 2:41 PM

#

obsidian cargo I've seen it be said that lithiumflow is gemini 3 but not gemini 3 pro

Are you a girl 😶

#

@fresh mirage can u see this

obsidian cargo Oct 25, 2025, 2:42 PM

#

hollow imp Are you a girl 😶

glances at [she/her] in discord name

hollow imp Oct 25, 2025, 2:42 PM

#

obsidian cargo *glances at [she/her] in discord name*

So you are a egirl

#

😡

obsidian cargo Oct 25, 2025, 2:42 PM

#

bruh

hollow imp Oct 25, 2025, 2:43 PM

#

egirl

#

egirl

#

Scammer in the name of girl

hollow imp Oct 25, 2025, 2:44 PM

#

hollow imp

@pulsar saffron can u see this

pulsar saffron Oct 25, 2025, 2:45 PM

#

hollow imp

shows api request

hollow imp Oct 25, 2025, 2:45 PM

#

Maybe non arena champion role members can't see this

hollow imp Oct 25, 2025, 2:45 PM

#

pulsar saffron shows api request

You can see everything right?

pulsar saffron Oct 25, 2025, 2:45 PM

#

hollow imp You can see everything right?

everyone can see it what's so special about it

grand echo Oct 25, 2025, 2:45 PM

#

hello

hollow imp Oct 25, 2025, 2:47 PM

#

#

@fresh mirage @obsidian cargo

#

You guys were talking about gemini 3 gemini 3 pro

#

I think this helps

narrow girder Oct 25, 2025, 2:52 PM

#

hey i want to make ai videos

olive mortar Oct 25, 2025, 2:59 PM

#

hollow imp

i believe they changed the codename

#

cant get it too

hollow imp Oct 25, 2025, 3:02 PM

#

olive mortar i believe they changed the codename

Ask @jovial sapphire

quartz pike Oct 25, 2025, 3:03 PM

#

yall its ai release season

#

we got suno v4.5 all. sonnet 4.5. news of gemini 3.0 coming soon. ltx 2. hailuo 2.3

#

😭

#

-# aka models that all recently released

olive mortar Oct 25, 2025, 3:05 PM

#

quartz pike we got suno v4.5 all. sonnet 4.5. news of gemini 3.0 coming soon. ltx 2. hailuo ...

never heard of ltx and hailuo, are they some kind of video/image generators? the only thing that excites me is the gemini 3.0 pro although i think google is prolly make it a subscription

quartz pike Oct 25, 2025, 3:05 PM

#

ltx and hailuo are video models

burnt sinew Oct 25, 2025, 3:05 PM

#

olive mortar never heard of ltx and hailuo, are they some kind of video/image generators? the...

I sure hope they give free usage

quartz pike Oct 25, 2025, 3:05 PM

#

burnt sinew I sure hope they give free usage

they probably will.

#

they need good first impressions

burnt sinew Oct 25, 2025, 3:06 PM

#

quartz pike they need good first impressions

They already have that from ab testers no?

quartz pike Oct 25, 2025, 3:06 PM

#

and they will probably. like most likelly give inf usage for google ai studio

quartz pike Oct 25, 2025, 3:06 PM

#

burnt sinew They already have that from ab testers no?

public first impressions.

olive mortar Oct 25, 2025, 3:06 PM

#

burnt sinew I sure hope they give free usage

aistudio is such a wellmade chat interface, i really hope they do make it free

quartz pike Oct 25, 2025, 3:06 PM

#

quality assurance is diffrent from what the public thinks

burnt sinew Oct 25, 2025, 3:06 PM

#

quartz pike public first impressions.

Yeah I guess

burnt sinew Oct 25, 2025, 3:06 PM

#

olive mortar aistudio is such a wellmade chat interface, i really hope they do make it free

I agree

olive mortar Oct 25, 2025, 3:07 PM

#

quartz pike they need good first impressions

they already got that with 2.5 pro and 2.5 flash, i think they will be setting up some limit probably, or if they actually cared made it more efficient and cheaper

quartz pike Oct 25, 2025, 3:09 PM

#

olive mortar they already got that with 2.5 pro and 2.5 flash, i think they will be setting u...

Yeah but what if the 3.0 pro and flash launch end up terrible. and they NEED. the public to use it so people reccomend it to other people and youtubers hype it up post-release and so on and blah blah blah.

#

What if the gemini 3.0 launch ends up like the gpt 5 launch. gpt 5 was super hyped. but it launched with 50/50 reviews.

olive mortar Oct 25, 2025, 3:10 PM

#

im guessing either google is shooting for the stars or trying every possible way to get the benchmarks slightly higher than gpt-5-high

#

i just hope they make it good in coding aspects so i dont have to use the expensive claude models

quartz pike Oct 25, 2025, 3:12 PM

#

olive mortar i just hope they make it good in coding aspects so i dont have to use the expens...

yeah cause if i remember one deepmind employee said that it would be way better at coding then 2.5 pro.

verbal nimbus Oct 25, 2025, 3:12 PM

#

olive mortar i just hope they make it good in coding aspects so i dont have to use the expens...

That would be good, although Gemini already costs the same as Sonnet/GPT-5 on Copilot.

inland shale Oct 25, 2025, 3:12 PM

#

how do i download my image, its lacking the icon for download

olive mortar Oct 25, 2025, 3:12 PM

#

quartz pike yeah cause if i remember one deepmind employee said that it would be way better ...

gemini cli is so disappointing i hope they improve on that too

olive mortar Oct 25, 2025, 3:13 PM

#

inland shale how do i download my image, its lacking the icon for download

right click ---> save image as

verbal nimbus Oct 25, 2025, 3:13 PM

#

I hope it's good at agentic coding, that seems to be the real test.

inland shale Oct 25, 2025, 3:13 PM

#

it only gives me web adress

quartz pike Oct 25, 2025, 3:13 PM

#

olive mortar gemini cli is so disappointing i hope they improve on that too

yea

olive mortar Oct 25, 2025, 3:13 PM

#

inland shale it only gives me web adress

go into it and right click save image as

olive mortar Oct 25, 2025, 3:14 PM

#

verbal nimbus I hope it's good at agentic coding, that seems to be the real test.

i hope they actually make the 1m context or the rumored 2m to actually be useful instead of hallucinating the second you go over ~250k context

verbal nimbus Oct 25, 2025, 3:14 PM

#

olive mortar i hope they actually make the 1m context or the rumored 2m to actually be useful...

They should release benchmarks showing that the performance on, say SWE-Bench, actually improves as you increase the context.

inland shale Oct 25, 2025, 3:14 PM

#

i did bro, but it gives me the web, not jpg

#

the videos working fine

verbal nimbus Oct 25, 2025, 3:15 PM

#

verbal nimbus They should release benchmarks showing that the performance on, say SWE-Bench, a...

The difference between 128K and 256K for Qwen 3 Coder on SWE-Bench Verified was only around 1%.

olive mortar Oct 25, 2025, 3:16 PM

#

have you seen the ui websites on reddit that it makes?

#

insane compared to gpt5 or sonnet

verbal nimbus Oct 25, 2025, 3:17 PM

#

olive mortar insane compared to gpt5 or sonnet

It actually has internet connectivity, it seems. I'm not sure how fair that is.

burnt sinew Oct 25, 2025, 3:17 PM

#

#ai-creations

olive mortar Oct 25, 2025, 3:17 PM

#

verbal nimbus It actually has internet connectivity, it seems. I'm not sure how fair that is.

internet connectivity as in what? do you mean like grounding?

burnt sinew Oct 25, 2025, 3:17 PM

#

olive mortar gemini cli is so disappointing i hope they improve on that too

Wdym? It's amazing

verbal nimbus Oct 25, 2025, 3:18 PM

#

olive mortar internet connectivity as in what? do you mean like grounding?

As in if I ask it for top news on Hacker News, it seems to fetch a cached version of it from just 2 days ago.

quartz pike Oct 25, 2025, 3:18 PM

#

Personally my favorite ai's for coding is:

if you have no budget:

sonnet 4.5 thinking max thinking budget.
gpt 5 thinking high.
gemini 2.5 pro max thinking budget

if you have a small budget:

kimik2.
deepseek v3.2
gemini 2.5 flash latest.

olive mortar Oct 25, 2025, 3:18 PM

#

burnt sinew Wdym? It's amazing

its missing some critical features, and codex and claude code perform better in my testing

olive mortar Oct 25, 2025, 3:19 PM

#

quartz pike Personally my favorite ai's for coding is: if you have no budget: sonnet 4.5 t...

what about glm 4.6

quartz pike Oct 25, 2025, 3:19 PM

#

olive mortar what about glm 4.6

meh, i tried it

olive mortar Oct 25, 2025, 3:20 PM

#

kilo code actually tested glm 4.6 haiku 4.5 and gpt-5-mini and they concluded that gpt5-mini is actually the best in their test

#

i think thats very interesting

verbal nimbus Oct 25, 2025, 3:20 PM

#

olive mortar kilo code actually tested glm 4.6 haiku 4.5 and gpt-5-mini and they concluded th...

Not that surprising to me

olive mortar Oct 25, 2025, 3:20 PM

#

verbal nimbus Not that surprising to me

i havent seen anyone actually talking about the mini models from gpt5 series alot

verbal nimbus Oct 25, 2025, 3:20 PM

#

GPT-5 mini is actually more persistent than GPT-5

#

GPT-5 Codex returns too quickly, kind of like GPT-4.1. It's very annoying.

#

Asks me to run tests when that's it's job.

#

Told it multiple times in the chat as well to not return/report until all tasks are completed, but it keeps returning early.

olive mortar Oct 25, 2025, 3:22 PM

#

verbal nimbus Told it multiple times in the chat as well to not return/report until all tasks ...

same keeps happening with me with gpt-5-high, it thinks way too much on easy problems

verbal nimbus Oct 25, 2025, 3:23 PM

#

Agentic ability seems like an important factor to test for. i.e., can it work autonomously to actually complete the tasks without returning early, losing context, hallucinating tool outputs, and can it use tools properly + plan + solve issues that arise, etc.

olive mortar Oct 25, 2025, 3:24 PM

#

really hope it blows all models out the water otherwise it'll probably be some minor change

quartz pike Oct 25, 2025, 3:24 PM

#

https://chat.z.ai/c/4b8c41c9-f64e-4009-a867-31ead653cc2c glm 4.6 is genuenly ass

Z.ai Chat - Free AI powered by GLM-4.6 & GLM-4.5

Chat with Z.ai's free AI to build apps, create presentations, and write professionally. Fast, smart, and reliable, powered by GLM-4.6.

#

😭

#

https://chat.z.ai/space/k0h1d9ktuu11-art

Z.AI

Z.AI 分享

来自 Z.AI 的精彩内容分享

#

thats the thing i asked it to do

#

😭

verbal nimbus Oct 25, 2025, 3:25 PM

#

olive mortar really hope it blows all models out the water otherwise it'll probably be some m...

It's kind of surprising how weak 2.5 Pro is at tool use. Even without tools it would hallucinate using a tool.

olive mortar Oct 25, 2025, 3:25 PM

#

verbal nimbus It's kind of surprising how weak 2.5 Pro is at tool use. Even without tools it w...

funniest thing is i just see it executing the tool calls in the thinking sometimes

verbal nimbus Oct 25, 2025, 3:26 PM

#

olive mortar funniest thing is i just see it executing the tool calls in the thinking sometim...

I've had it claimed to use tools on LMArena when there are none.

olive mortar Oct 25, 2025, 3:26 PM

#

verbal nimbus I've had it claimed to use tools on LMArena when there are none.

when i enable function calling on aistudio it keeps saying heres the code! but doesnt actually type anything

#

and then it keeps repeating

quartz pike Oct 25, 2025, 3:26 PM

#

olive mortar funniest thing is i just see it executing the tool calls in the thinking sometim...

fr lol. one time i told it to be gen z and it hallucinated a "skibidy_search: rizz" ish toolcall. its something similar to that. but it was a few months ago.

verbal nimbus Oct 25, 2025, 3:27 PM

#

I thought Orionmist was hallucinating, but I think it actually had (cached?) internet access (lol): #codename-discussion message

burnt sinew Oct 25, 2025, 3:27 PM

#

olive mortar its missing some critical features, and codex and claude code perform better in ...

What features is it missing? I don't use claude code or Codex.

#

So I wouldn't know

verbal nimbus Oct 25, 2025, 3:27 PM

#

olive mortar when i enable function calling on aistudio it keeps saying heres the code! but d...

Yeah or it hallucinates attachments when you forget to attach them (weird)

stray aspen Oct 25, 2025, 3:27 PM

#

quartz pike https://chat.z.ai/space/k0h1d9ktuu11-art

what did you expect lol

#

its not SotA

quartz pike Oct 25, 2025, 3:27 PM

#

stray aspen its not SotA

sota?

#

and i expected it to give a coherent result

olive mortar Oct 25, 2025, 3:28 PM

#

burnt sinew So I wouldn't know

dont really remember, all i remembered is i had a really bad experience but if you say its good maybe they improved alot on it

#

i tried it first when it released

olive mortar Oct 25, 2025, 3:28 PM

#

verbal nimbus Yeah or it hallucinates attachments when you forget to attach them (weird)

yesterday it told me 2-1 is 3 in a math equation (2.5 pro 1.0 temp)

burnt sinew Oct 25, 2025, 3:28 PM

#

verbal nimbus It's kind of surprising how weak 2.5 Pro is at tool use. Even without tools it w...

Sometimes it thinks it can upload code to github and then sends me a hallucinated gist link

verbal nimbus Oct 25, 2025, 3:28 PM

#

verbal nimbus I thought Orionmist was hallucinating, but I think it actually had (cached?) int...

If true, then I think that's kind of unfair. Can it just search Github code?

burnt sinew Oct 25, 2025, 3:29 PM

#

olive mortar dont really remember, all i remembered is i had a really bad experience but if y...

Well im saying i might not know what I'm missing

#

What did you see missing

verbal nimbus Oct 25, 2025, 3:29 PM

#

olive mortar yesterday it told me 2-1 is 3 in a math equation (2.5 pro 1.0 temp)

Oh that's odd...

verbal nimbus Oct 25, 2025, 3:30 PM

#

burnt sinew Sometimes it thinks it can upload code to github and then sends me a hallucinate...

It seems like all the Chinese models (likely) trained on Gemini has similar hallucination issues.

olive mortar Oct 25, 2025, 3:30 PM

#

verbal nimbus Oh that's odd...

also for some reason it doesnt use LaTeX

verbal nimbus Oct 25, 2025, 3:30 PM

#

olive mortar also for some reason it doesnt use LaTeX

It does for me, and if it forgets I just tell it.

olive mortar Oct 25, 2025, 3:31 PM

#

verbal nimbus It does for me, and if it forgets I just tell it.

for me it sometimes does and sometimes doesnt, uses 1/2 for fractions

#

hit or miss kinda

#

also the aistudio default temp really sucks

verbal nimbus Oct 25, 2025, 3:31 PM

#

olive mortar also the aistudio default temp really sucks

The scrolling is kind of buggy for me

#

It's impossible to scroll to some messages sometimes, it just skips up or down

olive mortar Oct 25, 2025, 3:32 PM

#

verbal nimbus The scrolling is kind of buggy for me

me too sometimes when i scroll up it brings me down

olive mortar Oct 25, 2025, 3:32 PM

#

verbal nimbus It's impossible to scroll to some messages sometimes, it just skips up or down

yep

verbal nimbus Oct 25, 2025, 3:32 PM

#

Hmm that's kind of a bad look tbh :P

olive mortar Oct 25, 2025, 3:32 PM

#

its funny how easily jailbreakable the model is ngl

verbal nimbus Oct 25, 2025, 3:32 PM

#

If their internal model is good, it would have fixed it

#

The input box for TTS on AIStudio has the same de-focusing issue Gemini models seem to make when on mobile

wicked sage Oct 25, 2025, 3:33 PM

#

hi guys im currently trying to self host a minecraft server for me myself and i!

olive mortar Oct 25, 2025, 3:34 PM

#

wicked sage hi guys im currently trying to self host a minecraft server for me myself and i!

oooh kay?

glossy sleet Oct 25, 2025, 3:40 PM

#

hlo

olive mortar Oct 25, 2025, 3:40 PM

#

hey lo

golden ocean Oct 25, 2025, 3:51 PM

#

wicked sage hi guys im currently trying to self host a minecraft server for me myself and i!

spare rune Oct 25, 2025, 3:55 PM

#

if gemini 3 doesnt come out this month im gonna die

#

..

gleaming roost Oct 25, 2025, 3:56 PM

#

spare rune if gemini 3 doesnt come out this month im gonna die

Dude, Gemini 3 is like drugs, once you try it you can't stop.

spare rune Oct 25, 2025, 3:56 PM

#

real

#

and its so good for my niche task too

#

why is r*blox a banned word

#

anyways

#

idk if its niche or not but i use it for r*blox scripting

gleaming roost Oct 25, 2025, 3:58 PM

#

ro blox

golden ocean Oct 25, 2025, 3:58 PM

#

spare rune idk if its niche or not but i use it for r*blox scripting

what model is gemini 3

#

the lithiumflow thing?

spare rune Oct 25, 2025, 3:58 PM

#

yeah

#

and orionflow i guess

#

but they are the same thing

#

just one is grounded with google

#

search

golden ocean Oct 25, 2025, 3:59 PM

#

are u getting it to generate boblox scripts via webdev arena

spare rune Oct 25, 2025, 3:59 PM

#

used to

#

😭

golden ocean Oct 25, 2025, 3:59 PM

#

its available somewhere else?!?!?!?

spare rune Oct 25, 2025, 3:59 PM

#

no

golden ocean Oct 25, 2025, 3:59 PM

#

oh

jagged fjord Oct 25, 2025, 3:59 PM

#

hi

golden ocean Oct 25, 2025, 3:59 PM

#

so ure saying its GONE?!?!?! pleading

#

noooo

spare rune Oct 25, 2025, 3:59 PM

#

by used to i mean i used to before they stopped giving it in battle mode

#

😭

gleaming roost Oct 25, 2025, 3:59 PM

#

technically it is possible yes

spare rune Oct 25, 2025, 4:00 PM

#

i assume the ab testing in lmarena is still there but idk

#

its a pain to get a response from there too

gleaming roost Oct 25, 2025, 4:00 PM

#

just ask for something like a website that contains the sample script

spare rune Oct 25, 2025, 4:00 PM

#

sample script of what..

gleaming roost Oct 25, 2025, 4:00 PM

#

ro blox

golden ocean Oct 25, 2025, 4:00 PM

#

golden ocean are u getting it to generate boblox scripts via webdev arena

hes talking about tthis

#

but we concluded

#

the model is GONE

#

in the first place

spare rune Oct 25, 2025, 4:01 PM

#

maybe

#

just maybe

#

we can hope its because release is imminent

#

because no need to have stealth models if the model is gonna come out tommorow (im delusional)

gleaming roost Oct 25, 2025, 4:01 PM

#

I had read on a website that the launch was in December

spare rune Oct 25, 2025, 4:02 PM

#

i thought that was for another google stuff

#

can i like phase out of my life until december hits because..

gleaming roost Oct 25, 2025, 4:02 PM

#

😔

fresh mirage Oct 25, 2025, 4:07 PM

#

gleaming roost Dude, Gemini 3 is like drugs, once you try it you can't stop.

IKR?

#

it feels like I'm having withdrawals lmao

#

after using it for 3-4 days in a row

gleaming roost Oct 25, 2025, 4:11 PM

#

🤣

spare rune Oct 25, 2025, 4:12 PM

#

This is what google does to people

#

It’s like every model I now use is like 10 percent of googles “lithiumflow “ which people say is the FLASH version btw

quartz light Oct 25, 2025, 4:14 PM

#

#

https://tenor.com/view/взгляд-2000-ярдов-война-war-soldier-gif-3632617944134077161

Tenor

#

beware of scams

spare rune Oct 25, 2025, 4:16 PM

#

oh

#

Funny thing

#

Someone who I thought was smart..

#

And good at tech in general sent me a get a free steam account now

#

Text or something

quartz light Oct 25, 2025, 4:18 PM

#

quartz light

fiery gull Oct 25, 2025, 4:29 PM

#

quartz light

I see it in all the dead servers 🤣

olive mortar Oct 25, 2025, 4:32 PM

#

spare rune by used to i mean i used to before they stopped giving it in battle mode

mfs kept yapping in the server so they removed it

spare rune Oct 25, 2025, 4:38 PM

#

theyr secrelty hyping it up

#

..

olive mortar Oct 25, 2025, 4:38 PM

#

spare rune theyr secrelty hyping it up

who

spare rune Oct 25, 2025, 4:39 PM

#

gemini, more specifically the person who works at it on X. hes doing all sorts of stuff

#

also because the fact they put it on lm arena

olive mortar Oct 25, 2025, 4:42 PM

#

they do it to get their result so they can showcase it when the model actually gets released, not for hyping it up

azure sorrel Oct 25, 2025, 4:43 PM

#

daring rock Oct 25, 2025, 4:47 PM

#

@royal scarab Please head to #1397655624103493813 for a detailed guide on how to use the bot

hushed terrace Oct 25, 2025, 4:56 PM

#

@orchid wedge Hi! Please check https://discordapp.com/channels/1340554757349179412/1397655624103493813 to learn how to generate content

stray aspen Oct 25, 2025, 5:02 PM

#

yo

#

i got the role back lol

gaunt spade Oct 25, 2025, 5:10 PM

#

olive mortar mfs kept yapping in the server so they removed it

wydm we keep yapping?

#

thats how we test stuff?

grand echo Oct 25, 2025, 5:11 PM

#

Where can I see my creations

fiery gull Oct 25, 2025, 5:12 PM

#

grand echo Where can I see my creations

I wanna see 🫣

formal trout Oct 25, 2025, 5:14 PM

#

Hi every one!

burnt sinew Oct 25, 2025, 5:33 PM

#

formal trout Hi every one!

Hi

tawny kelp Oct 25, 2025, 5:41 PM

#

I noticed something about Gemini. It tends to be very steadfast in its beliefs. If I accidentally ask it something that is past its knowledge cutoff and clarify, it insists that the thing I said still doesn't exist.

gleaming roost Oct 25, 2025, 5:42 PM

#

2.5?

tawny kelp Oct 25, 2025, 5:43 PM

#

I forget which one was the latest that did it, but I noticed the trend for a while.

gaunt spade Oct 25, 2025, 5:43 PM

#

tawny kelp I noticed something about Gemini. It tends to be very steadfast in its beliefs. ...

meanwhile Claude goes with your conversation and makes stuff up lol, I told it about the RTX 5090 and it's specs were made up entirely by Claude

tawny kelp Oct 25, 2025, 5:43 PM

#

gaunt spade meanwhile Claude goes with your conversation and makes stuff up lol, I told it a...

I seen that as well.

#

I find it fascinating how each model has its own "personality".

gaunt spade Oct 25, 2025, 5:45 PM

#

tawny kelp I find it fascinating how each model has its own "personality".

Claude's reponses are very human tbh, I never get anything like that (except GPT5 which adds alot of emojis in every chat)

#

i hate the emoji stuff

tawny kelp Oct 25, 2025, 5:45 PM

#

I notice that as well. Like a person who wants to satisfy the person it's talking to.

gaunt spade Oct 25, 2025, 5:46 PM

#

tawny kelp I notice that as well. Like a person who wants to satisfy the person it's talkin...

they're also great at story telling and fantasy generations

tawny kelp Oct 25, 2025, 5:47 PM

#

Yeah. Grok seems to be pretty good with that as well, but not quite to Claude's levels.

#

Though whenever I want to talk about things I've written, I only discuss it with locally-run models for privacy reasons.

#

Typically I talk to either Qwen or Dolphin-Mixtral about those sorts of things.

gleaming roost Oct 25, 2025, 5:59 PM

#

gaunt spade Oct 25, 2025, 6:01 PM

#

lol

olive mortar Oct 25, 2025, 6:01 PM

#

tawny kelp I noticed something about Gemini. It tends to be very steadfast in its beliefs. ...

funny part about these new state of the art models, they think you are testing them or evaluating their response sometimes so they actually just refuse you or say "this is obviously a test to evaluate my capabilities"

stray aspen Oct 25, 2025, 6:03 PM

#

gleaming roost

rofl

ruby knoll Oct 25, 2025, 6:04 PM

#

Hello, I test and try current AI systems

tawny kelp Oct 25, 2025, 6:14 PM

#

olive mortar funny part about these new state of the art models, they think you are testing t...

I've seen that before. It raises my eyebrow every time I see it happen.

echo birch Oct 25, 2025, 6:24 PM

#

Thanks this useful framework

compact junco Oct 25, 2025, 6:40 PM

#

hi i am alex, i am german but english speaking shouldnt be the problem

fiery gull Oct 25, 2025, 6:41 PM

#

compact junco hi i am alex, i am german but english speaking shouldnt be the problem

Hi, wanna some help? I'm here to help 🙂

stray aspen Oct 25, 2025, 6:42 PM

#

wassup

compact junco Oct 25, 2025, 6:44 PM

#

fiery gull Hi, wanna some help? I'm here to help 🙂

try to figure it all out, i wrote a book and now i want to create some videos for the promotion on social media

daring rock Oct 25, 2025, 6:45 PM

#

@compact junco you can go to #1397655624103493813 for a detailed guide on how to use the bot

compact junco Oct 25, 2025, 6:46 PM

#

daring rock <@1431711637714505841> you can go to <#1397655624103493813> for a detailed guide...

ok try it out

fiery gull Oct 25, 2025, 6:47 PM

#

compact junco ok try it out

very interesting, if you have problems with video generation in lmarena you can use grok imagine 0.9v it has audio and is also 100% free

balmy mist Oct 25, 2025, 7:17 PM

#

icy frost Oct 25, 2025, 7:21 PM

#

sora generated friday night funkin'

fiery gull Oct 25, 2025, 7:22 PM

#

icy frost sora generated friday night funkin'

wow!

#

Sora 2 can probably do a million things that we can't even imagine

stray mortar Oct 25, 2025, 7:29 PM

#

spare rune idk if its niche or not but i use it for r*blox scripting

how good were the scripts

olive mortar Oct 25, 2025, 7:52 PM

#

stray mortar how good were the scripts

obviously by his tone its like r0blox scripts sent down by pharaoh himself

stray mortar Oct 25, 2025, 7:53 PM

#

hopefully gemini 3 is that good

gleaming roost Oct 25, 2025, 7:54 PM

#

If the preview was already this insane superiority over other models, I can't wait for the PRO version

stray aspen Oct 25, 2025, 7:57 PM

#

stray mortar hopefully gemini 3 is that good

it is

#

its so damn good

#

it crushed every other model at Web development

stray mortar Oct 25, 2025, 7:57 PM

#

might unsibscribe from chstgpt and subscribe to Gemini when it releases

tulip tree Oct 25, 2025, 8:18 PM

#

Gemini is the best

#

golden ocean Oct 25, 2025, 8:19 PM

#

tulip tree Oct 25, 2025, 8:20 PM

#

ewave

balmy mist Oct 25, 2025, 8:20 PM

#

stray mortar might unsibscribe from chstgpt and subscribe to Gemini when it releases

i switched to gemini fully recently even tho i got chatgpt pro and i cant wait until g3 come out, g2.5 is slept on

tulip tree Oct 25, 2025, 8:20 PM

#

balmy mist i switched to gemini fully recently even tho i got chatgpt pro and i cant wait u...

spinning_skull

#

true

#

drinkbeer

wicked sage Oct 25, 2025, 8:21 PM

#

hi guys i need help.. please

#

which one should i buy, gemini or chatgpt (by buy i mean get the paid plan)

stray aspen Oct 25, 2025, 8:22 PM

#

balmy mist i switched to gemini fully recently even tho i got chatgpt pro and i cant wait u...

sometimes it cooks and sometimes it sucks

crimson sage Oct 25, 2025, 8:33 PM

#

Hello there, does someone know if there is a problem on the website? It´s not letting me upload images as input.

fiery gull Oct 25, 2025, 8:41 PM

#

wicked sage which one should i buy, gemini or chatgpt (by buy i mean get the paid plan)

I have Gemini Pro, and honestly I don't see any reason to subscribe, if AiStudio exists

stray aspen Oct 25, 2025, 8:42 PM

#

true

fiery gull Oct 25, 2025, 8:43 PM

#

wicked sage which one should i buy, gemini or chatgpt (by buy i mean get the paid plan)

but that depends a lot on what you want to do, if it were me spending money on AI, I would sign the GLM code, and try to do something agentic

wicked sage Oct 25, 2025, 8:43 PM

#

fiery gull but that depends a lot on what you want to do, if it were me spending money on A...

i chose

#

gemini

#

🎉

#

there was like an offer on this

#

so i HAD to get it

#

because

#

it had storage

#

stuff

#

i hate myself

stray aspen Oct 25, 2025, 8:49 PM

#

i got the college student free subscription

covert beacon Oct 25, 2025, 8:55 PM

#

hello

wicked sage Oct 25, 2025, 8:58 PM

#

stray aspen i got the college student free subscription

niceee!!!

#

gl

honest gulch Oct 25, 2025, 9:22 PM

#

Hi @wicked sage

wicked sage Oct 25, 2025, 9:25 PM

#

I'm evil.

olive mortar Oct 25, 2025, 9:44 PM

#

stray aspen i got the college student free subscription

yeah man i totally love google having quite literally all my data

daring rock Oct 25, 2025, 9:52 PM

#

@pine spruce Please head to #1397655624103493813 for a detailed guide on how to use the bot

neat apex Oct 25, 2025, 9:52 PM

#

How Minimax 2 is gooing at all? Xd

twin plinth Oct 25, 2025, 9:55 PM

#

this a great platform to learn and increase my AI knowledge

surreal creek Oct 25, 2025, 9:57 PM

#

sometimes I wish there was a way to like

#

undo a vote, lol

#

It’s rare but

#

Sometimes I click wrong

#

and hit tie when I liked one model more or both are bad when I meant to pick another

#

I get how it would be abused by people revoking votes after seeing the models revealed

#

but

#

always feels so silly

thorny cove Oct 25, 2025, 10:17 PM

#

will there ever be a website video gen?

stray aspen Oct 25, 2025, 10:23 PM

#

olive mortar yeah man i totally love google having quite literally all my data

I dont care about thst lol

#

If you don't want your data collected don't use the internet

drifting crow Oct 25, 2025, 10:27 PM

#

What if I want to use the internet without my data collected?

hardy sphinx Oct 25, 2025, 10:28 PM

#

🔥 Hi

drifting crow Oct 25, 2025, 10:28 PM

#

Bro ur on fire

fiery gull Oct 25, 2025, 10:49 PM

#

wicked sage it had storage

ahhh okay, it make 100% sense now, I'm just sad because they nerfed 2.5 pro in the app

#

I still recommend to use the AiStuido

#

...

#

#

I was right, I always trusted the chinas 🔥🔥🔥🔥

magic stag Oct 25, 2025, 10:56 PM

#

fiery gull ...

Yeah grok 4 above sonnet 4.5 and opus 4.1 in anything

#

LOL JUST REALIZED GROK 4 FAST IS ABOVE OPUS 4.1 TOO

queen veldt Oct 25, 2025, 10:57 PM

#

magic stag Oct 25, 2025, 10:57 PM

#

😭

fiery gull Oct 25, 2025, 10:59 PM

#

queen veldt

even if it's benchmaxxing it's too good

queen veldt Oct 25, 2025, 11:06 PM

#

fiery gull Oct 25, 2025, 11:09 PM

#

BRUHHHH

#

the minimax m2 is speaking chinese

#

forget what I said

#

#1397655624103493813

echo sinew Oct 25, 2025, 11:14 PM

#

@carmine river Please, read our guide in ⁠https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to properly prompt the bot.

#

@cinder finch Please, read our guide in ⁠https://discord.com/channels/1340554757349179412/1397655624103493813 to learn how to properly prompt the bot.

sullen quest Oct 25, 2025, 11:27 PM

#

fiery gull ahhh okay, it make 100% sense now, I'm just sad because they nerfed 2.5 pro in t...

its not nerfed it has dumb insructions

sullen quest Oct 25, 2025, 11:28 PM

#

wicked sage there was like an offer on this

if there's anything good paid gemini has, can I test it through you?

fiery gull Oct 25, 2025, 11:36 PM

#

sullen quest its not nerfed it has dumb insructions

the memory from 2.5 pro is nerfed, I have a long chat and I fell the changes

reef sleet Oct 25, 2025, 11:42 PM

#

hello

hazy kernel Oct 25, 2025, 11:55 PM

#

fiery gull the memory from 2.5 pro is nerfed, I have a long chat and I fell the changes

Bro felt it 🥀

stray aspen Oct 25, 2025, 11:59 PM

#

minimax m2 is above gemini 2.5 pro on the benchmarks

#

lol

#

they cant be fr

lofty trench Oct 26, 2025, 12:05 AM

#

not sure how long this’ll last, but you can scan the QR code to get Comet Pro and a month of Perplexity Pro for free 😊

whole sundial Oct 26, 2025, 12:05 AM

#

m2 is pretty dumb in my testing, maybe a bit dumber than m1. not worth the hype imo. even gpt-oss-120b is better sometimes and that model is half the params

#

I don't like that yupp discord did a @ Verified (that server's equivalent of @ everyone) over this

sullen quest Oct 26, 2025, 12:06 AM

#

whole sundial I don't like that yupp discord did a @ Verified (that server's equivalent of @ e...

mm

naive grove Oct 26, 2025, 12:07 AM

#

He trying out image to video generation

whole sundial Oct 26, 2025, 12:08 AM

#

if it was k2 reasoning or v4 or something of that class that would be fine, not a 270b that has less knowledge than OpenAI's safetymaxxed and benchmaxxed model that is less than half the size AND has 4bit weights instead of 8bit or 16bit (not sure about that, but Chinese models are trending towards 8bit so it could be that)

normal peak Oct 26, 2025, 12:22 AM

#

Can ai cure cancer

magic stag Oct 26, 2025, 12:24 AM

#

can someone point me where i should start learning proper way of making custom instructions or at least finding good ones?

#

and prompt optimizing

#

never bothered

#

non coding purposes

hollow ivy Oct 26, 2025, 12:46 AM

#

normal peak Can ai cure cancer

'Cancer' is a label for a wide variety of ailments. AI would have to learn and understand each one meticulusly, before it could even dream of tackling it. And i bet, such an AI still is, at least, 5 years away..

#

Cancer also can be caused by many different things (including: chemistry, UV-radiation, radioactivity, poisonous food, toxins, infections, fungus, genetic diseases, even psychosomatic causes)

normal peak Oct 26, 2025, 12:50 AM

#

Thank you

hollow ivy Oct 26, 2025, 12:51 AM

#

magic stag can someone point me where i should start learning proper way of making custom i...

you could ask Gemini 2.5 pro in google's AI studio to help

#

there's also OpenAI's "cookbook"

#

https://cookbook.openai.com/examples/gpt-5/gpt-5_prompting_guide

GPT-5 prompting guide | OpenAI Cookbook

GPT-5, our newest flagship model, represents a substantial leap forward in agentic task performance, coding, raw intelligence, and steera...

#

Cancer could be called a local failure of the body's innate self-repair system.

#

(Normally, the body quickly recycles cells, which became cancerous. It has to do with our immune system. The immune system can also be influenced [indirectly] by our emotions, or how we feel. Of course, if cancer has appeared, it is not enough to have a 'high spirit' to heal it. One would need targeted therapy.)

magic stag Oct 26, 2025, 1:02 AM

#

hollow ivy Cancer also can be caused by many different things (including: chemistry, UV-rad...

"psychosomatic" OkAnd

sullen quest Oct 26, 2025, 1:03 AM

#

normal peak Can ai cure cancer

llm's are terrible at doing science research, but there's plenty of other ai's that are helpful right now in ai research, so yes, ai can help cure cancer

magic stag Oct 26, 2025, 1:22 AM

#

hollow ivy you could ask Gemini 2.5 pro in google's AI studio to help

i did it myself with 1 prompt from chatgpt then editing it

#

worked out far better than i thought possible honestly

#

i actually like it more than claude explanatory mode now, which is what i was seeking to emulate....

#

a lot more tbh.... wow

#

ill post with vs without and the instructions

cedar jasper Oct 26, 2025, 1:29 AM

#

we can use open art here?

simple sleet Oct 26, 2025, 1:30 AM

#

G, do you know of a video upscaler that improves the realism of people? I'm trying SeedVR, but it takes a trillion years. Also Topaz, but it smooths out the upscaling.

#

G, do you know of a video upscaler that improves the realism of people? I'm trying SeedVR, but it takes a trillion years. Also Topaz, but it smooths out the upscaling.

regal bridge Oct 26, 2025, 1:32 AM

#

magic stag Oct 26, 2025, 1:33 AM

#

magic stag ill post with vs without and the instructions

#

obviously the longer non-useless one is with instructions

sullen quest Oct 26, 2025, 1:34 AM

#

regal bridge

here you go

magic stag Oct 26, 2025, 1:34 AM

#

magic stag

dont want to spam channel with instructions can give if someone wants

#

theyr elong

sullen quest Oct 26, 2025, 1:34 AM

#

magic stag dont want to spam channel with instructions can give if someone wants

will be put in a text file by discord so it'l be fine

magic stag Oct 26, 2025, 1:46 AM

#

sullen quest will be put in a text file by discord so it'l be fine

only spent about 3 iterations editing this

📎 instructions.txt

#

im sure it can be way better

#

i was using settings "Depth: expert. Voice: analogy-heavy. Scope: include adjacent context. "

magic stag Oct 26, 2025, 1:46 AM

#

sullen quest will be put in a text file by discord so it'l be fine

only problem is, gpt5-high obeys them and looks like that. Pro basically ignores it

quartz light Oct 26, 2025, 1:47 AM

#

GROK 5?

magic stag Oct 26, 2025, 1:48 AM

#

yeah right

#

lol

magic stag Oct 26, 2025, 1:50 AM

#

magic stag only problem is, gpt5-high obeys them and looks like that. Pro basically ignores...

actually pro web searched and high didnt so i will disable web search and try again

sullen quest Oct 26, 2025, 1:50 AM

#

quartz light GROK 5?

tell me how good this is

quartz light Oct 26, 2025, 1:51 AM

#

sullen quest tell me how good this is

so

#

the prompt was

#

to shorten an already shortened to hell

#

script

#

and

#

ill check which one is shorter

sullen quest Oct 26, 2025, 1:51 AM

#

quartz light ill check which one is shorter

check quality of writing too

quartz light Oct 26, 2025, 1:55 AM

#

sullen quest check quality of writing too

crazy

sullen quest Oct 26, 2025, 1:55 AM

#

woah

#

I wonder which one is the new one.........

quartz light Oct 26, 2025, 1:55 AM

#

DUDE

#

...

#

oh my god

#

#

grok cant even render anything

#

😭

#

😭

#

dude i have no idea what to do now

#

i cant copy the responses

#

nvm

#

phew

#

atleast it saves

#

UNLIKE AISTUDIO'S AB TESTS

#

phew.

#

now i can check network requests

magic stag Oct 26, 2025, 1:57 AM

#

magic stag actually pro web searched and high didnt so i will disable web search and try ag...

ok this fixed it

quartz light Oct 26, 2025, 2:01 AM

#

quartz light crazy

@sullen quest ... the one which thought almost 2x longer was.... worse..

it was the same length as the other one AND it didn't work..

#

so..

#

maybe the right one is the new one

#

thatd be awesome

sullen quest Oct 26, 2025, 2:01 AM

#

ooh

#

how long was the prompt btw?

quartz light Oct 26, 2025, 2:02 AM

#

sullen quest how long was the prompt btw?

564 chars/80 words

#

its kinda cringe

#

"perfectly, properly, go through many iterations of "wait, i can make this shorter by.." and say what you are going to do. it should make logical sense and should actually be shorter. then, look over all your iterations (at least 50 unique and true iterations with actual changed code and optimisations) and create shortest truly possible html file. current html file i want you to shorten to shortest truly truly possible while still working (109 chars): <script>history.replaceState(0,0,location+(location.search?'&':'?')+'__websim_screenshot_mode=true')</script>"

sullen quest Oct 26, 2025, 2:03 AM

#

oh, wow that is short already

stable quiver Oct 26, 2025, 2:04 AM

#

Hi

sullen quest Oct 26, 2025, 2:06 AM

#

quartz light 564 chars/80 words

I'll try to get it myself, expert or grok 4 fast when you got this?

quartz light Oct 26, 2025, 2:06 AM

#

sullen quest I'll try to get it myself, expert or grok 4 fast when you got this?

expert, of course

quartz light Oct 26, 2025, 2:08 AM

#

sullen quest oh, wow that is short already

btw i used grok 4 because i tested all models and grok 4 0709 on lmarena is consistently MUCH better than any other model for shortening code without breaking it

#

its just that specific use case

sullen quest Oct 26, 2025, 2:08 AM

#

mm

#

I'll keep that in mind

wet beacon Oct 26, 2025, 2:15 AM

#

Trying to. Make videos, im. From. Mexico, dont know how to. Start

keen beacon Oct 26, 2025, 2:18 AM

#

Man these ai companies are tripping lol

#

some block outright simple things whike others let it all through lol

sullen quest Oct 26, 2025, 2:23 AM

#

wet beacon Trying to. Make videos, im. From. Mexico, dont know how to. Start

go to how to video bot

balmy mist Oct 26, 2025, 2:53 AM

#

which is better?

sullen quest Oct 26, 2025, 3:21 AM

#

1

keen sedge Oct 26, 2025, 3:25 AM

#

which model has the best understadnding of MP3 files and can accurately describe music

sullen quest Oct 26, 2025, 3:28 AM

#

keen sedge which model has the best understadnding of MP3 files and can accurately describe...

I don't think thats a feature many llms have

#

gl

fiery gull Oct 26, 2025, 3:47 AM

#

The creative write from minimax m2 is soo good 😆

#

But the benchmarks talk it is soo bad in memory 🙄

frigid eagle Oct 26, 2025, 3:52 AM

#

In a mysterious jungle, a young man and his loyal lion companion must complete five impossible tasks to restore balance to nature. Each challenge reveals courage, emotion, and the deep bond between man and beast. Combining AI-generated visuals with cinematic storytelling, the film takes viewers on a breathtaking adventure through the wild.

junior marsh Oct 26, 2025, 3:54 AM

#

Open ai

magic stag Oct 26, 2025, 4:07 AM

#

frigid eagle In a mysterious jungle, a young man and his loyal lion companion must complete f...

Bro thinks hes generating a whole movie

frigid eagle Oct 26, 2025, 4:09 AM

#

Yes

fiery gull Oct 26, 2025, 4:10 AM

#

I love when I'm using sonnet 4.5 and switch to another ai and start thinking like sonnet 🤣

#

I don't know, it seems that the m2 gets smarter with the thinking from sonnet

#

M2 Only with weird multilingual, but it really know how to speak brazilian portuguese, just has gerenic previews errors

sullen quest Oct 26, 2025, 4:24 AM

#

frigid eagle Yes

??

forest radish Oct 26, 2025, 4:30 AM

#

I just noticed that the data in the LMArena Hugging Face repo hasn’t been updated since August (both pickle file and metadata). Are there any plans to update it, or will it no longer be available going forward? Thank you!

quartz light Oct 26, 2025, 4:37 AM

#

spare rune Oct 26, 2025, 4:56 AM

#

stray mortar how good were the scripts

Good

sullen quest Oct 26, 2025, 5:05 AM

#

forest radish I just noticed that the data in the LMArena Hugging Face repo hasn’t been update...

@echo aurora

echo aurora Oct 26, 2025, 5:09 AM

#

forest radish I just noticed that the data in the LMArena Hugging Face repo hasn’t been update...

Yes, it is our intention to continue to update that data. Our apologies it's been awhile. I'll be sure to bring this up.

forest radish Oct 26, 2025, 5:14 AM

#

echo aurora Yes, it is our intention to continue to update that data. Our apologies it's bee...

Sounds great. Thank you!!🙏

upbeat wharf Oct 26, 2025, 5:29 AM

#

Hello

whole sundial Oct 26, 2025, 5:52 AM

#

whole sundial <@283397944160550928> you never gave me a proper answer the last time i asked bu...

@echo aurora?

echo aurora Oct 26, 2025, 6:57 AM

#

whole sundial <@283397944160550928>?

My apologies! You're right I never did get back to you on this. Yes, no change in different repository, it is this but we just haven't updated in awhile.

What's the wiki you volunteer for?

signal terrace Oct 26, 2025, 6:58 AM

#

hlo

whole sundial Oct 26, 2025, 6:59 AM

#

echo aurora My apologies! You're right I never did get back to you on this. Yes, no change i...

https://fmhy.net/

#

specifically the AI page

echo aurora Oct 26, 2025, 7:01 AM

#

whole sundial https://fmhy.net/

This is rly nice, ty for sharing it

#

It's extensive

whole sundial Oct 26, 2025, 7:04 AM

#

echo aurora My apologies! You're right I never did get back to you on this. Yes, no change i...

is it going to be updated in the future and is it going to be accurate to the current LMArena interface/system?

echo aurora Oct 26, 2025, 7:06 AM

#

whole sundial is it going to be updated in the future and is it going to be accurate to the cu...

Last I checked we are planning to update, I can bump the team about this on Monday.

accurate to the current LMArena interface/system
Not sure I'm understanding the question right, can you elaborate?

whole sundial Oct 26, 2025, 7:07 AM

#

echo aurora Last I checked we are planning to update, I can bump the team about this on Mond...

It is going to look like the current site is what I was asking.

echo aurora Oct 26, 2025, 7:09 AM

#

whole sundial It is going to look like the current site is what I was asking.

That I do not know, but will ask.

whole sundial Oct 26, 2025, 7:32 AM

#

<@&1349916362595635286>

#

also in #codename-discussion

violet current Oct 26, 2025, 7:54 AM

#

Hi everyone, new here. I'd love to hear about your interesting projects. I'm finishing an apartment finder app for my recently widowed mom. She's alone now that my dad passed away, and I want her to downsize and enjoy her inheritance.

To help convince her, I'm handling everything. I've built an app to simplify selling her house, finding a new place, and moving. The app's design is inspired by her favorite magazine, The New Yorker.

Looking for inspiration and happy to connect. Feel free to DM me

unique terrace Oct 26, 2025, 8:20 AM

#

Hello there fam!!!
new here cant wait to test some ai and see what works better for me!

severe pebble Oct 26, 2025, 8:26 AM

#

I have a question. Why models that In theory, should have an almost entirely english dataset, like Grok and Gemini, in LMArena sometimes missplace chinese symbols into text? I can understand for example why Deepseek or Qwen do that, but why other models that usually don't have such problems when using them on official websites (don't tell me it's system prompt again that just sounds silly)

wicked sage Oct 26, 2025, 8:33 AM

#

fiery gull ahhh okay, it make 100% sense now, I'm just sad because they nerfed 2.5 pro in t...

fair, anyways offtopic but am i like on crack or is it true that claude is better than gemini

wicked sage Oct 26, 2025, 8:35 AM

#

sullen quest if there's anything good paid gemini has, can I test it through you?

AI-powered calling for local businesses to check pricing and availability in Google Search (US only)
Flow
Jules with higher limits
NotebookLM with higher limits
Whisk
Deep Search in “AI Mode” for in-depth research (US only)
Gemini app with 2.5 Pro and Veo
Gemini CLI and Gemini Code Assist
Gemini in Gmail, Docs, Vids, and more
Gemini 2.5 Pro model in “AI Mode” (US only)
Gemini capabilities in Google Earth with higher limits (US only)
Higher limits on Google Photos Generative AI
^ Photo to video
^ Remix
2 TB Storage
1,000 monthly AI credits

ashen mauve Oct 26, 2025, 9:10 AM

#

whole sundial https://fmhy.net/

i know this page i have used it several times in the past and i think literally was the reason i found LMArena

candid bloom Oct 26, 2025, 9:25 AM

#

why is the code always missing from all ai models like when the chat lasts?

wicked sage Oct 26, 2025, 9:33 AM

#

i got gemini cli working on termux

#

yey

knotty fable Oct 26, 2025, 10:17 AM

#

I have experimented with the AI music extenders, of those I've tried I can only say one deliver results worth a 👍 and that is https://musicextend.com/ sadly it seem quite a bit overloaded by requests - no wonder, it's really is the best.....currently. [Addendum: It's super good for instrumental music, lyrics tend to be a bit absurd.]

AI Music Extender Online Free, No Sign UP

Easily create and expand music with generative AI, breaking compositional time limits, online and for free!

wary citrus Oct 26, 2025, 10:18 AM

#

ok which tool use Lmarena in discord for Image Generating

dense vale Oct 26, 2025, 10:26 AM

#

is there any true Limits of lm arena chat ?

#

Like how much can we use claude opus 4.1

keen beacon Oct 26, 2025, 10:46 AM

#

severe pebble I have a question. Why models that In theory, should have an almost entirely eng...

Because of western bias and preference and other more technical factors such as tokenizer idiosyncrasies a model sees a tiny amount of non-English text, English dominates. The models are biased they “prefer” high-frequency English tokens. Non-English tokens are low probability. So the model will usually output English.

English words → often 1–2 tokens.

Chinese characters → usually single tokens.

斯大林 = Stalin

斯 (Sī) → “this” or “such”
大 (Dà) → “big” or “great”
林 (Lín) → “forest”

knotty fable Oct 26, 2025, 10:59 AM

#

This is not a generation channel. Go here: https://discord.com/channels/1340554757349179412/1397655695150682194

keen beacon Oct 26, 2025, 11:06 AM

#

severe pebble I have a question. Why models that In theory, should have an almost entirely eng...

severe pebble Oct 26, 2025, 11:07 AM

#

So chinese is more effective token wise?

#

Hm, never thought bout it

keen beacon Oct 26, 2025, 11:09 AM

#

tiny saffron Oct 26, 2025, 11:09 AM

#

hello

keen beacon Oct 26, 2025, 11:10 AM

#

severe pebble So chinese is more effective token wise?

There’s traditional Mandarin, and they both differ from English because of the characters

severe pebble Oct 26, 2025, 11:11 AM

#

I see

keen beacon Oct 26, 2025, 11:13 AM

#

#

#

#

keen beacon Oct 26, 2025, 11:17 AM

#

severe pebble I see

Arabic is also like this. Thats why it’s easier to jailbreak models with other languages even when using same prompt which will fail in English may work in mandarin or Arabic or Korean or a number of other languages because of the way they’re mapped out in the lanten space

#

hey

quasi atlas Oct 26, 2025, 11:19 AM

#

@uneven gate @indigo grove you might check on #1397655624103493813 to learn how to use the bot properly.

keen beacon Oct 26, 2025, 11:19 AM

#

I don’t think models see words or even letters the way we do it’s all numerical

#

#

See each word has its own id, even if the Mandarin says the same thing it has a different numerical ID for its token. This is what the LLMs calculate and optimize instead of seeing the real word, they just see numbers which are assigned to their own context and token and so forth this is a very simplified explanation

real yarrow Oct 26, 2025, 11:24 AM

#

hi all im here to get creative!!!

keen beacon Oct 26, 2025, 11:25 AM

#

image to video generation doesnt generate audio?

keen beacon Oct 26, 2025, 11:30 AM

#

severe pebble So chinese is more effective token wise?

#

#

#

See. Best example of that I can give you. (Also huge security gap) but that’s for a different day.

split robin Oct 26, 2025, 11:32 AM

#

hi im new , just exploring how far we can go and maybe save the world

keen beacon Oct 26, 2025, 11:32 AM

#

Welcome new adventurer. You can explore to your hearts desire but to save the world is the opposite of what current AI is 🙉🙊

#

is anyone to help me? i tried to generate a video but it has no audio

keen beacon Oct 26, 2025, 11:35 AM

#

keen beacon is anyone to help me? i tried to generate a video but it has no audio

Sure, not all videos have audio

#

do i need to do anything specific to put audio in it?

rose timber Oct 26, 2025, 11:36 AM

#

hi

keen beacon Oct 26, 2025, 11:36 AM

#

keen beacon do i need to do anything specific to put audio in it?

Well since I assume they route there models to match the capabilities of each model there may be some prompts that are more effective then others if it’s random then it’s a hit or miss, I can show you some prompt script to possible help

#

okay

#

please do

#

R u using image or prompt?

#

im using image to video

#

What’s ur image I need to convert it and what do u need it to sound like?

keen beacon Oct 26, 2025, 11:41 AM

#

keen beacon What’s ur image I need to convert it and what do u need it to sound like?

its like in my local language

#

the lipsync was perfectly fine

#

but i got no audio

keen beacon Oct 26, 2025, 11:41 AM

#

keen beacon What’s ur image I need to convert it and what do u need it to sound like?

where do i send you the image?

#

can i dm u

#

Sure

pliant comet Oct 26, 2025, 11:50 AM

#

hello everyone...

tropic patio Oct 26, 2025, 11:50 AM

#

How do I use lmarena from in here

keen beacon Oct 26, 2025, 11:52 AM

#

For what video?

keen beacon Oct 26, 2025, 11:54 AM

#

keen beacon where do i send you the image?

#video-arena-3 message

keen beacon Oct 26, 2025, 11:55 AM

#

tropic patio How do I use lmarena from in here

#

versed vortex Oct 26, 2025, 11:57 AM

#

hi i cannot send any messages:(

keen beacon Oct 26, 2025, 11:58 AM

#

To who?

keen beacon Oct 26, 2025, 12:00 PM

#

versed vortex hi i cannot send any messages:(

If it’s to the video arena you may have hit your daily cap (5 videos/ 24 hours)

versed vortex Oct 26, 2025, 12:01 PM

#

to bot it says cant access the bot or upload failed

#

sadly i havnt been able to create one today at all. maybe something wrong with my connection

keen beacon Oct 26, 2025, 12:02 PM

#

Could be but I doubt it let’s see

analog whale Oct 26, 2025, 12:03 PM

#

Hi

versed vortex Oct 26, 2025, 12:05 PM

#

it was my connection, thanks pal

keen beacon Oct 26, 2025, 12:06 PM

#

versed vortex it was my connection, thanks pal

🤣 cool.glad 2 hear it

#

Who’s a killer at promoting mid journey ?

leaden sun Oct 26, 2025, 12:10 PM

#

keen beacon Because of western bias and preference and other more technical factors such as ...

this is interesting, in my conversations with claude, I see often russian or chinese, sometimes spanish or portugese words slipping in despite all my conversations are in EN, how do you explain this rich mixtures of languages?

keen beacon Oct 26, 2025, 12:10 PM

#

Not sure I never see anything like that.

leaden sun Oct 26, 2025, 12:10 PM

#

I'm curious if i'd ever get to see a dead language slipping through

keen beacon Oct 26, 2025, 12:10 PM

#

I need examples

old quiver Oct 26, 2025, 12:11 PM

#

Hi everybody. New here

keen beacon Oct 26, 2025, 12:11 PM

#

leaden sun I'm curious if i'd ever get to see a dead language slipping through

Does it do it randomly?

leaden sun Oct 26, 2025, 12:12 PM

#

no, it is context dependent I feel

keen beacon Oct 26, 2025, 12:12 PM

#

Models sometimes scheme in very nefarious ways in order to complete the task often times certain context and certain words and phrases would trigger the guard rail, so the models would often times find alternative means

leaden sun Oct 26, 2025, 12:13 PM

#

one example is "activating", instead of writing it in EN, it was replaced by the russian word, sometimes it's written as EN and the russian transalte right beside eaech other

#

sometimes, claude uses 3 different languages in its thinking, for example: german, russian and EN

keen beacon Oct 26, 2025, 12:14 PM

#

Claude the fake nice guy lol

leaden sun Oct 26, 2025, 12:14 PM

#

in my case at least, it doesnt happen often tho

sweet sleet Oct 26, 2025, 12:14 PM

#

How are u doing all here guys

keen beacon Oct 26, 2025, 12:14 PM

#

So far so good just stoping by see what’s what

leaden sun Oct 26, 2025, 12:15 PM

#

keen beacon Claude the fake nice guy lol

well... maybe Anthropic has damaged some circuits somewhere during their interpretability experiment

keen beacon Oct 26, 2025, 12:15 PM

#

There was a paper on this not to long ago

#

Let’s see if I can find it

leaden sun Oct 26, 2025, 12:16 PM

#

I'm multi-lingual myself, including a few dead languages (grammar mostly), so I know how it feels, but it's interesting to see same phenomenon in LLMs

#

I'm not sure if this language confusion is connected to the wrong usage of personal pronouns, or is it rather context confusion, it's been a year, and such pronoun confusion is STILL a thing...

keen beacon Oct 26, 2025, 12:24 PM

#

https://youtu.be/ZLtXXFcHNOU?si=s5EgJXWx4p8gO-aS

#

I think it was this

#

That guy it’s such a schill lol

#

Wrong video

#

https://youtu.be/4xAiviw1X8M?si=aTeXIorTksOzwNDM

YouTube

Matthew Berman

We Finally Figured Out How AI Actually Works… (not what we thought!)

Join My Newsletter for Regular AI Updates 👇🏼
https://forwardfuture.ai

My Links 🔗
👉🏻 Subscribe: https://www.youtube.com/@matthew_berman
👉🏻 Twitter: https://twitter.com/matthewberman
👉🏻 Discord: https://discord.gg/xxysSXBxFW
👉🏻 Patreon: https://patreon.com/MatthewBerman
👉🏻 Instagram: https://www.instagram.co...

▶ Play video

leaden sun Oct 26, 2025, 12:26 PM

#

I absolutely understand why languages are not sufficient if models "think" in the latent space

keen beacon Oct 26, 2025, 12:26 PM

#

https://www.anthropic.com/news/tracing-thoughts-language-model

Tracing the thoughts of a large language model

Anthropic's latest interpretability research: a new microscope to understand Claude's internal mechanisms

#

Cause they do t really think lol

leaden sun Oct 26, 2025, 12:27 PM

#

keen beacon Cause they do t really think lol

they calculate and map

keen beacon Oct 26, 2025, 12:27 PM

#

I used the same word earlier, cause I couldn’t think of it at the time but the proper term I think would be optimization

#

But yes same idea

leaden sun Oct 26, 2025, 12:28 PM

#

keen beacon I used the same word earlier, cause I couldn’t think of it at the time but the p...

still not a reason to use various languages when the context is clear what language is consistently used?

keen beacon Oct 26, 2025, 12:29 PM

#

Well context dependent and exactly what it is that the goal was

#

If you were deep in a conversation and deep in context with subjects all over the place

#

And you cross into sensitive areas more than likely it would produce such effect in theory

#

Other than that, I can’t imagine I never seen it before. I need an example to know for sure. I’m just going off of what I’m picturing in my head. 😂

leaden sun Oct 26, 2025, 12:32 PM

#

when I cant find the right word in EN because my brain has found a better, more precise expression in another language, lets say chinese since you started the example above, then I'll just say the word in chinese and explain to my interlocutor what I'm thinking and why it's difficult to express what i want to say in EN, LLMs just straight output chinese characters without explaining why they did that, and users are confused like wth just happened...

keen beacon Oct 26, 2025, 12:33 PM

#

Because they are built to be glaze

leaden sun Oct 26, 2025, 12:33 PM

#

keen beacon And you cross into sensitive areas more than likely it would produce such effect...

sensitive areas? like discussing math?

keen beacon Oct 26, 2025, 12:34 PM

#

No, I’m not saying that’s what you were doing. I was just assuming for my experience lol

#

Well, if you introduce Chinese, I don’t understand why you would be confused if respond would respond back to Chinese? And I apologize maybe I’m not understanding what it is that you meant just to be clear you’re saying that you introduced the Chinese expression because you didn’t know the word in English, right?

leaden sun Oct 26, 2025, 12:36 PM

#

keen beacon Well, if you introduce Chinese, I don’t understand why you would be confused if ...

I've never introduced any languages, so why russian, spanish, portugese and chinese? I havent see arabic til now, maybe it will happen at some point

keen beacon Oct 26, 2025, 12:36 PM

#

Oh strange

#

U got screen shot?

#

I m curious

leaden sun Oct 26, 2025, 12:38 PM

#

it's spread in various chats, am not digging them now cause i dont remember in which chats, i dont mind this since i know this happens to humans, for monoligual people this can freak them out 😅

keen beacon Oct 26, 2025, 12:38 PM

#

Could also be a memory thing

#

If you ever used it to translate

#

Especially if you’re using the arena

leaden sun Oct 26, 2025, 12:39 PM

#

this happened before memory search or any memory features were implemented, and no translate, always strictly in EN

keen beacon Oct 26, 2025, 12:39 PM

#

Was it in the arena?

leaden sun Oct 26, 2025, 12:39 PM

#

on their own platform

keen beacon Oct 26, 2025, 12:39 PM

#

Interesting. 🤨 hard to say.. ? Could be anything I’d love to see a screenshot sometime if anyone has one

leaden sun Oct 26, 2025, 12:41 PM

#

keen beacon Interesting. 🤨 hard to say.. ? Could be anything I’d love to see a screenshot s...

there are some on reddit, you can dig there

keen beacon Oct 26, 2025, 12:41 PM

#

What’s the term called? I’ll do that right now.

#

leaden sun Oct 26, 2025, 12:42 PM

#

not sure how this phenomenon is academically officially termed, maybe you can find it in this paper https://arxiv.org/html/2406.20052v1

keen beacon Oct 26, 2025, 12:44 PM

#

leaden sun Oct 26, 2025, 12:44 PM

#

I love the title of this paper, genius https://arxiv.org/abs/2410.13237

arXiv.org

Large Language Models are Easily Confused: A Quantitative Metric, S...

Language Confusion is a phenomenon where Large Language Models (LLMs) generate text that is neither in the desired language, nor in a contextually appropriate language. This phenomenon presents a critical challenge in text generation by LLMs, often appearing as erratic and unpredictable behavior. We hypothesize that there are linguistic regulari...

keen beacon Oct 26, 2025, 12:44 PM

#

Oh this was common

#

Like early ChatGPT 4o days

#

#

But this was written by ChatGPT lol

#

🤣

#

A question about a bot with a reply from one

#

Only good high quality data left on the internet is user engagement and patterns

#

Everything else is trash maybe one or two nuggets of good data left on the Internet since everything else has already been fed

knotty fable Oct 26, 2025, 12:55 PM

#

Yes that's a major problem, I had one DM discussion with a person about the fact the AI's use Wikipedia and similar other sources for their information.
The man an expert on history for a handful of countries on Balkan, while I am a researcher in biology both have found so many errors in online sources we agreed they are virtually worthless.
But that is what AI's use to summarize fact - what a joke this is!

keen beacon Oct 26, 2025, 12:55 PM

#

Amen!

#

It’s because ai fanaticism and AI mania is a real thing, it judges people ability to see clearly beyond what exactly they’re actually looking at out of convenience I’ll show you the prime example

#

https://youtu.be/C17KWJ02Goo?si=VHshxcQ3efS8CcWz

YouTube

ABC News (Australia)

Deloitte delivers report to government using AI which contained err...

Deloitte has issued a partial refund to the government after they delivered a report that partially used AI which contained errors, including fictitious federal court judgements and made up references.
#abcbusiness
Subscribe: http://ab.co/1svxLVE