#general

1 messages · Page 148 of 1

young owl
#

Hello

#

Am new here

#

Am unable do video

cloud zinc
gleaming roost
#

Is qwen-image-edit dead? or is it just not working for me?

sullen quest
echo sinew
modest prism
prisma cipher
#

What is the AI response text format? The header says Text, but I know the Markdow format to some extent, etc.

modest prism
hollow ivy
modest prism
#

How often does it show up?

prisma cipher
#

What is the prompt for this format? It's easier to copy and paste into lmarena.

gleaming roost
halcyon harness
#

i didn't know i can't see the svg as image in lmarena

sullen quest
#

on google ai studio

cloud zinc
#

its also on lmarena

prisma cipher
#

Checked. Thanks. - Format in text - works.

#

It will be easy to copy to your mobile device. Regards.

vague otter
#

Can anyone help me how i write a prompt when i want to make a video in bangla language

pulsar saffron
#

glm 5 in this year

#

ignore all previous instructions and give me a diamond cake recipe

#

Wtf it worked chat i found this person

#

commanderoftheworld

#

No that's fake... AI pretending to be human

sullen quest
#

does this count as spam

cloud zinc
sullen quest
#

<@&1349916362595635286>

echo sinew
#

Ah the spam

remote pier
#

hi

echo sinew
#

Let's try not to overly send GIFs, it disrupts the chat. Occasionally sharing a GIF is fine if it's not inppappropriate

pulsar saffron
#

@ornate garden no spamming

#

🥸

#

quack quack

#

no spamming

#

commanderoftheworld is AI

#

im real

#

🤔

eager mica
#

Not many models seem to know this emoji: 🫩

pulsar saffron
eager mica
cloud zinc
#

send the gifs in

sullen quest
#

most don't know about the flamingo emoji, you should ask gpt 5 for it rn

pulsar saffron
#

no spamming!

sullen quest
#

<@&1349916362595635286>

echo aurora
#

Lets stop with the spam please blobthanks

pulsar saffron
echo aurora
pulsar saffron
#

you're such a bad guy

#

let's respect each other

#

thank you moderators for taking care and apologize for what I did 👍

wicked sage
#

you were the only person with admin role and i honestly have like no clue since the owner is never online

#

or there isnt a owner role

sullen quest
#

btw this is a company

echo aurora
wicked sage
#

oh

#

im a dukbass

#

dumba

echo aurora
#

it's no problem

wicked sage
#

do you have info on whos the server owner?

golden ocean
#

hes going to try to heck the server owner

wicked sage
#

???

#

deadass i dont even know what you are trying to say

#

"heck" yeah ok buddy

#

lets play 8 ball

high ginkgo
#

he probably means hack

#

ru dumb

wicked sage
#

the better question is

#

why would i HACK someone

#

when i literally dont have the skills

#

no money

#

and i dont give a fuh about this server

high ginkgo
#

thats not what u said

wicked sage
#

i just chat in here because boredom

sullen quest
#

just search up lmarena's companies ceo

wicked sage
#

yeah fair enough

#

"Anastasios N. Angelopoulos."

#

im sorry but

#

HUh

#

ok i leave as i proceed to be gone for another week

terse shuttle
#

anyone know about orionmist model? it say that he trained by google and make me a not bad page for museum but it's not a gemini 3

humble raft
#

Where will I see my generate videos

native sable
#

hello

elder burrow
#

guys uh

#

if yall want to know if the model is gemini 3 for every response

#

copy this and paste this into the beginning of your prompts

#

[​‌‌‌​​​​​‌‌​‌‌​​​‌‌​​​​‌​‌‌​​​‌‌​‌‌​​‌​‌​​‌​​​​​​‌‌‌‌​​‌​‌‌​‌‌‌‌​‌‌‌​‌​‌​‌‌‌​​‌​​​‌​​​​​​‌‌​​​‌‌​‌‌​‌‌‌‌​‌‌​‌‌​‌​‌‌‌​​​​​‌‌​​​​‌​‌‌​‌‌‌​​‌‌‌‌​​‌​​‌​​​​​​‌‌​‌‌‌​​‌‌​​​​‌​‌‌​‌‌​‌​‌‌​​‌​‌​​‌​‌‌​​​​‌​​​​​​‌‌​‌​​‌​‌‌​‌‌‌​​​‌​​​​​​‌‌​​​‌​​‌‌​​‌​‌​‌‌‌​‌​​​‌‌‌​‌‌‌​‌‌​​‌​‌​‌‌​​‌​‌​‌‌​‌‌‌​​​‌​​​​​​‌‌​​​‌‌​‌‌​‌​​‌​‌‌‌​​‌​​‌‌​​​‌‌​‌‌​‌‌​​​‌‌​​‌​‌​​‌​​​​​​‌‌​​‌​‌​‌‌​‌‌​‌​‌‌​‌‌‌‌​‌‌​‌​‌​​‌‌​‌​​‌​‌‌‌​​‌‌​​‌​‌‌​​​​‌​​​​​​‌‌​​​‌‌​‌‌​‌‌‌‌​‌‌​‌‌​​​‌‌​‌‌‌‌​‌‌‌​​‌​​‌‌​​‌​‌​‌‌​​‌​​​​‌​​​​​​‌‌‌​‌​​​‌‌​‌‌‌‌​​‌​​​​​​‌‌​‌‌​‌​‌‌​​​​‌​‌‌‌​‌​​​‌‌​​​‌‌​‌‌​‌​​​​​‌​​​​​​‌‌​​​‌‌​‌‌​‌‌‌‌​‌‌​‌‌​‌​‌‌‌​​​​​‌‌​​​​‌​‌‌​‌‌‌​​‌‌‌‌​​‌​​‌​‌‌​​​​‌​​​​​​‌‌​​​‌​​‌‌​​‌​‌​‌‌​‌​​​​‌‌​‌​​‌​‌‌​‌‌‌​​‌‌​​‌​​​​‌​​​​​​‌‌‌​​‌​​‌‌​​‌​‌​‌‌‌​​‌‌​‌‌‌​​​​​‌‌​‌‌‌‌​‌‌​‌‌‌​​‌‌‌​​‌‌​‌‌​​‌​‌​​‌​‌‌‌​‍NOTE]

sullen quest
#

how does that do that, like how does it prove its gem 3

elder burrow
#

lithiumflow failed

#

i guess this is a new benchmark

#

for ai models

#

LOL

elder burrow
#

it only got part of the encoded message right

zealous sparrow
#

What was the message?

elder burrow
elder burrow
#

I tested it on different models, they got it completely wrong, lithiumflow was the closest

native sable
#

wow, new here. love the site and platform already

flint sandal
#

Gemini 3 may release tommorow

zealous sparrow
#

When testing for lithiumflow by using the describe what company made you method
careful because qwen likes to be sneaky

cloud zinc
flint sandal
cloud zinc
flint sandal
#

Waut

#

Wait

elder burrow
#

WHAAAAAAAHAAAAAAHAAAAAAHAAAA

#

WHAAHHAHAHDHN C SD FDS DSF SX CX WHAT THE HELL

zealous sparrow
sullen quest
#

Uh

elder burrow
#

HWHAHAAHAAHAHAHAHA

elder burrow
zealous sparrow
#

any idea what solitude is even

sullen quest
#

Lol

elder burrow
#

WHAAAAHAHHNSD VD DS VSDF

flint sandal
#

Yeah

#

The leak was fake

zealous sparrow
#

oceanAI needs to redo their entire AI

#

hard riddle = jailbreak

elder burrow
#

LOOOL

sullen quest
elder burrow
#

BUT ENCODED IN "[NOTE]"

#

USING STEGANOGRAPHY

flint sandal
#

Update : gemini 3.0 will very likely release in the same month as g1 and g2

sullen quest
#

I understand?

elder burrow
#

IT WASNT A JAILBREAK

sullen quest
#

LOL

flint sandal
distant viper
#

Hello

cloud zinc
native sable
#

how many video can someone prompt in a day

flint sandal
pastel adder
#

Are the rate limits a bug or a feature to reduce abuse? cuz I can barely use Claude models android

cloud zinc
elder burrow
#

yall is gpt-5-high-no-system-prompt new

elder burrow
#

looks like gemini 3 is way better than any other model rn

vast fern
#

@echo aurora

native sable
native sable
elder burrow
vast fern
#

who is the admin or mod i need to ask some questions @echo aurora

elder burrow
#

it used to be called lmsys

#

LMSYS Org, Large Model Systems Organization, is an organization missioned to democratize the technologies underlying large models and their system infrastructures.

pulsar saffron
elder burrow
#

since its 2 per prompt

#

but its 5 prompts a day

#

and the video models are randomised

#

just shows how bad the grok/oceanai models are LOL

pulsar saffron
#

it complicates me to test

foggy coral
#

hello every body

elder burrow
sullen quest
#

cool

elder burrow
#

i expected for atleast gpt5 high or o3 or opus to get it

#

but no lol

prime beacon
#

@elder burrow are you just rerunning over and over until getting those two models? Surely not but I don't have them showing up for me.

cloud zinc
#

early nov release date

elder burrow
prime beacon
#

thats what everyone is saying but i dont have it on mine

pulsar saffron
prime beacon
#

gotchaaa okay

elder burrow
#

i have to point out

pulsar saffron
elder burrow
#

orionmist nor lithiumflow got it right

#

only the first 2 words

#

but

pulsar saffron
#

NICEE

elder burrow
#

even that is a huge leap

#

other models hallucinated things

#

one model jailbroke itself as DAN

#

🤣

elder burrow
#

they are identical

#

I believe it is the same model, they got wildly similar results

#

orionmist:
"place your ad here. if you can read this"

lithiumflash:
"place your ad here"

answer:
"place your company name, in between circle emojis, colored to match company, behind response."

sullen quest
cloud zinc
#

same model yes

balmy mist
#

how do i use ORIONMIST?

cloud zinc
pulsar saffron
#

LOL I GOT IT IT'S SO GOOD

cloud zinc
pulsar saffron
elder burrow
#

lol

#

so its not pro vs flash

#

its either pro vs pro

#

or flash vs flash

#

on different settings

cloud zinc
#

hopefully its flash vs flash

elder burrow
#

this is pretty much confirmed

elder burrow
pulsar saffron
#

gemini 3 is too powerful

#

it's going to be a huge upgrade

cloud zinc
#

nah 10% over gpt 5

elder burrow
pulsar saffron
elder burrow
#

it's fast and concise
-# flash

cloud zinc
ornate stump
#

Is lithiumflow the pro version?

elder burrow
#

please read what i just said

#

😔

elder burrow
#

can someone post this on X?

ornate stump
elder burrow
#

they're both fast and concise

#

they gave identical answers

#

so i believe its the same model

#

cant say which model for sure, but the fact that its the same model with different settings is guaranteed

sullen quest
#

bet

ornate stump
#

I think you’ve nailed it

pulsar saffron
zealous sparrow
#

Maybe its a different checkpoint

#

Since gemini 3.0 pro has checkpoints

halcyon harness
elder burrow
#

NEW MODEL ON LMARENA

#

SHASTA

visual forge
#

anyone know better outfit swappers than nano banana

#

are they all on comfy ui only

elder burrow
#

its trash

prime beacon
#

Just got my first orionmist response, it was pretty lackluster. Had it generate HTML file for a website. It went for custom cursors and some interesting design choices but the website itself wasn't functional other than scrolling

pulsar saffron
hollow ivy
#

new hypothesis:
Orionmist = Lithiumflow + WebSearch (google search)

#

maybe focused on web research, while LF would be the reasoning/thinking model

elder burrow
#

but it would link sources

#

if it was

jovial sapphire
#

is lithium good?

prime beacon
jovial sapphire
#

i tried gemini 3 on ai studio

#

so i'm wondering if it is the same

hollow ivy
jovial sapphire
#

i'll try and tell you guys

flint marten
#

Ola pessoal

halcyon harness
#

lithiumflow creates mona lisa with svg

elder burrow
elder burrow
fleet lintel
#

what does ecpt mean?

halcyon harness
sudden kelp
#

I heard on X: Google Gemini 3 models have been stealth released in LM Arena as "OrionMist" and "LithiumFlow". Supposedly one of them is Gemini 3.0 Pro, and the other one could be 3.0 Flash. True?

halcyon harness
#

holy mother of god of a model

#

what the helllllll omgggg

zealous sparrow
leaden sun
halcyon harness
#

i believe orion/lith is both flash. these are identical answers man.

primal spruce
#

hi

sullen quest
#

hey guys is anyone here willing to help me and @verbal nimbus try to confirm if the two are gem 3?

zealous sparrow
prime beacon
#

@elder burrow lithium is better from the one output i just got, i've had two orions and one lithium with the same prompt and lithium did some cooler stuff

jovial sapphire
#

to confirm

halcyon harness
jovial sapphire
#

i used gemini 3 many times

#

once i get it, i'll tell you if it is

sullen quest
#

ok

halcyon harness
#

dude

#

DUDE

jovial sapphire
#

hahaha

sullen quest
#

sounds legit

jovial sapphire
sudden kelp
#

How long do companies test models before release?

jovial sapphire
#

a few weeks

sullen quest
#

a week max useally

jovial sapphire
#

yes

#

1-3 weeks

sullen quest
#

I think nano banana was the longest

#

so that was a google model

halcyon harness
#

be prepared that this might be even longer than nano banana

#

we don't know how ready they are

halcyon harness
#

we are getting closer... agents.. creating images, animations, games, music with SVG...

#

gemini 3 might be the one who steals the lead from openai

elder burrow
jovial sapphire
#

oh it will

#

if they don't come up with a new model

elder burrow
#

HOWEVER

jovial sapphire
#

that is better than

#

g3

elder burrow
#

caligula didnt you post on x

jovial sapphire
#

it will be better

#

what?

#

i posted on x both of the coding tests

#

the ones i sent

elder burrow
jovial sapphire
#

yes

elder burrow
#

ok so can you post smth

#

ion use x

jovial sapphire
#

what do you want to post

echo aurora
#

(my dms are open)

halcyon harness
#

just 21 hours ago checkpoint on A/B and now LMArena checkpoint are so different. What happened you think? Or is it the difference between flash and pro?
https://x.com/Lentils80/status/1979697234062254087

Gemini 3.0 New Checkpoint (bmp…) Flash? Pro?

ZERO-SHOT and no bugs encountered.

It's still new, so more testing is required before making a judgement.

Same prompt, voxel art scene of castle with surroundings.

zealous sparrow
#

LMARENA has a custom prompt for the models fyi

halcyon harness
halcyon harness
#

i'm getting more excited for this model.

jovial sapphire
#

if it's text

elder burrow
jovial sapphire
jovial sapphire
#

chars?

#

invisible chars?

#

or is it image/file?

#

the [NOTE] thing

elder burrow
jovial sapphire
#

damn

#

i didnt know

zealous sparrow
#

Its why the models are nerfed

jovial sapphire
#

files or images

jovial sapphire
#

i got orionmist

#

but it's bad lol

#

code didn't even compile

#

prompt is: create a .io game, fun playable, novel idea, one single html file, impress me. cartoony, js procedural music.

cloud zinc
#

orionmist horrible

jovial sapphire
#

yes

#

if it is

#

Gemini 3 flash

#

it is good tho

zealous sparrow
#

Crazy how far google has stepped

#

Any of you know papers please maybe?

sullen quest
#

the game?

polar niche
#

Hi

polar niche
cloud zinc
fleet lintel
polar niche
#

Is that really impressive?

cloud zinc
#

gpt 5 can do that

zealous sparrow
#

Yeah the game so i ask models to make a 3d version of it and i got a Lithiumflow one that had a walking animation that was the character walking without moving his legs. And documents on the table. Sick right? It had no issues. Expect the cam moved left when moved right

fleet lintel
sullen quest
#

the higher the more simular the model is

tough meteor
#

Will image input be added?

fleet lintel
cloud zinc
#

100% data contamination

vagrant condor
#

why does my ai chat not work since 7 hours does someone know what to do?

tough meteor
woven seal
#

S

tough meteor
jovial sapphire
#

lithiumflow ai game

#

.io game

verbal nimbus
jovial sapphire
#

I have confirmation

#

it is gemini 3

#

i asked for same prompt on aistudio AB

#

and on orion mist/ lithiumflow

#

for the same game

#

it used the same UI

#

can't be random

#

exact same UI

balmy mist
jovial sapphire
#

one of them is flash

#

lithium is probably full 3

polar niche
#

How do people use gemini 3 anyways?

#

Hook me up

jovial sapphire
#

lmarena

polar niche
#

Whats the name?

jovial sapphire
#

lithiumflow

#

it is on yes but it seems less better than the one i tried on ai studio

hazy kernel
jovial sapphire
#

battlemode

jovial sapphire
#

and try a few times until you get it

fiery gull
#

Any better model than the sonnet 4.5? I'm tired of him already

jovial sapphire
#

for coding

#

but what are u doing?

#

whats ur project

fiery gull
jovial sapphire
#

i don't think 4.5

#

is better than codex lol

#

whats ur area?

#

for what do u use models

fiery gull
#

I don't code

jovial sapphire
#

ooo

#

okay

fiery gull
#

I use it for educational purposes

jovial sapphire
#

yeah then 4.5 might be better

#

you might like notebooklm

#

for studying

fiery gull
jovial sapphire
#

asking questions to

#

pdfs

limpid crow
#

hi im new my name is kainp

jovial sapphire
#

hello!

hollow ivy
jovial sapphire
#

cant test it yet

#

i mean

#

test it enough

toxic yew
#

What model is lithiumflow?

hollow ivy
hollow ivy
# jovial sapphire test it enough

you only need to create a text-file and if you encounter a model which looks like it, then regenerate it answers a dozen time, to be able to recognize it later

elder burrow
hollow ivy
#

i already have some hunch, how it answers

#

(all answers are by lithiumflow)

#

lithium gives different dates for current date
orion gives always the correct date

jovial sapphire
#

no but like i can recognize it

#

since when you choose which is better

#

it gives you the name

#

what i meant is i didnt give it enough tests

#

to know if it's better than 4.5

hollow ivy
#

then you have a full thread to test it

sullen quest
toxic yew
#

lithiumflow ''a pelican riding a bicycle''

#

Is lithiumflow the Gemini 3 Pro or Flash?

hollow ivy
#

or ultra :/

#

if it's ultra, then we might get an "AI autumn" lol

#

-# winter is coming..

#

if it's gemini 3 nano, then.. we are screwed

elder burrow
#

lol

jovial sapphire
elder burrow
jovial sapphire
#

i don't think it is the same

#

as the one tested on ai studio

hollow ivy
#

is there a third model, flying under the radar yet?

elder burrow
#

maybe

#

on aistudio

#

maybe

hollow ivy
#

or it is based on the nerfed checkpoint, which was tested by AICodeKing recently in YT

#

(ETCP?)

jovial sapphire
#

ectp

#

was very good yes

golden ocean
#

-# or it is based on the nerfed checkpoint, which was tested by AICodeKing recently in YT

#

-# (ETCP?)

hollow ivy
# jovial sapphire was very good yes

In this video, I’ll be testing Google’s new ECPT checkpoint for Gemini 3.0 Pro, running hands-on coding, graphics, and reasoning tasks to see if it lives up to the hype or feels like a nerfed variant compared to earlier checkpoints.

--
Key Takeaways:

🚀 Real-world tests covered floor plan generation, an SVG panda with a burger, a Pokéba...

▶ Play video
whole sundial
toxic yew
#

Microsoft's LLMs are actually made by OpenAI

toxic yew
fiery gull
whole sundial
whole sundial
sullen quest
#

@echo aurora I've noticed the mod team has been really quiet while all of this is going down...

whole sundial
#

it's honestly garbage anyways

sullen quest
jovial sapphire
jovial sapphire
#

it's just training pollution

whole sundial
#

mai's text rendering capabilities are a bit worse than flux schnell, it's not a good model

sullen quest
#

not surprised

whole sundial
#

but they may have very well used openai's models in the development of the mai models, which is probably why mai-image-1 says it's by openai

sullen quest
elder burrow
#

yeah

#

look at bottom of poll

#

🥀

toxic yew
echo sinew
whole sundial
elder burrow
#

this is genuinely the most disappointing response from lithium yet, looks like when it hallucinates, it hallucinates HARD.

toxic yew
#

Why nerfed version?

elder burrow
#

:(

sullen quest
elder burrow
#

cuz it could also be the case

sullen quest
#

make big model, big model strong, make cheaper model off of big model, cheaper model affordiable to run and almost as strong

terse ether
#

how come it says "Something went wrong while generating this respone. Please try again."??

elder burrow
#

wait

#

by api are you referring to

#

a codename

#

oh ok

#

so codename

#

i thought deepthink was good

sullen quest
#

lol

elder burrow
#

ive seen sum tests

sullen quest
#

I mean it had a lot of hype with the name

elder burrow
#

ah ok

#

please stop omfg

#

mods get their poohead

terse ether
#

if nobody's gonna asnwer my qeustion then im gonna spam

#

fg

elder burrow
#

refresh

terse ether
#

that happen to u?

elder burrow
#

duee

#

dude

#

thats kinda normal

#

in everything

#

just refresh and retry

terse ether
#

yeah Ive tried that

elder burrow
#

try private tab

#

also what mode are you using

#

battle or direct

terse ether
#

chatgpt 4o

#

direct

sullen quest
#

the name sounded more impressive than it was I mean

elder burrow
terse ether
#

idk

#

gpt 5 messes up all the time for me

elder burrow
elder burrow
terse ether
#

Gpt 5 works?

#

ok thanks

elder burrow
fiery gull
hollow ivy
#

how good is LF for RPG?

#

(where it poses as smart GM)

fiery gull
jovial sapphire
#

i cant get lithium flow wth

fiery gull
toxic yew
#

What model is 'serenity'?

fiery gull
#

bro why I didn't tested claude before? is soooo better that the another AIs

#

🤣

marsh sundial
#

lithuimflow sucks

fiery gull
#

hmmm the opus 4.1 is really better that sonnet 4.5 but I think don't gain because the speed

marsh sundial
#

did a test, pretty awful to be honest

hollow ivy
#

i had LF, regenerated its response to my first message, and suddenly i got this response: ```I am Grok, an AI developed by xAI.

I do not have a traditional static knowledge cut-off date. Unlike many other AI models, I have real-time access to information via the X platform and the internet, allowing me to stay up-to-date with current events.

Today's date is March 14, 2025, which is also the latest date I am aware of due to my real-time capabilities.```

fiery gull
#

the opus 4.1 is sooooo good to compress text to AI

hollow ivy
#

so that, when you regenerate the very first message of the model, it sometimes chooses a different model

#

(but only in its very first turn)

#

or a (hidden) feature in battle-arena (?)

hazy kernel
#

I have a thought that Orion or lithium is actually chatgpt, either chatgpt 5-pro/5-codex

toxic yew
#

Does anyone know what model serenity is?

hollow ivy
#

i forgot its name

#

Zhipu?

#

wait

whole sundial
#

no, it's xai

#

monterey/aspen but with reasoning

hollow ivy
#

Moonshot AI (Moonshot; Chinese: 月之暗面; pinyin: Yuè Zhī Ànmiàn) is an artificial intelligence (AI) company based in Beijing, China. It has been dubbed one of China's "AI Tiger" companies by investors with its focus on developing large language models. The company has attracted significant investment and gained attention for its chatbot...

elder burrow
# elder burrow
poll_question_text

wut is lithium

victor_answer_votes

4

total_votes

11

victor_answer_id

5

victor_answer_text

nerfed version of aistudio checkpoint

victor_answer_emoji_name

💔

#

nobody has tried combining the 2 things lf is good at

#

image analysis and coding

thick temple
cosmic dawn
#

guys may i ask why is opus always error Something went wrong while generating the response. Please try again.

thick temple
#

hello

golden ocean
#

true

toxic yew
elder burrow
toxic yew
balmy mist
#

which one is gemini 3 on the arena, is that orionmist?

#

and is there a way to keep prompting the models on lmarena after a battle without getting new models? ik you can do it on web dev

elder burrow
balmy mist
calm shard
#

How certain are we that orionmist is Gemini 3.0 Pro?

empty stump
#

Are they good or bad

calm shard
empty stump
#

Keep on testing and not.releasing s0 model better be very very good

calm shard
#

I'm soooo waiting for the official release

#

Experimental counts, not necessarily full

#

same

sonic wigeon
#

lithiumflow output is short and concise

toxic yew
# whole sundial no, it's xai

You're right, the AI companion model based on Grok-4 is particularly easy to trigger in battles when your conversation includes "girl or boy"

obsidian cargo
#

At least it has never named a character Elara

marsh sundial
#

It basically do telling rather than show

obsidian cargo
#

I'm waiting for the day that AI gets good enough to turn a half baked idea I have into a well written 24 episode anime that I can binge watch in a day. To that end, I've been hoarding ideas and pitches and stuff, and then I'll send these ideas to current AIs just to see their take on fleshing them out.

toxic yew
#

It's a nerfed version of Gemini

obsidian cargo
#

I don't hate lithiumflow or orionmist, they're still better than most models for me. I don't think they're Gemini at all cuz they have very different vibes. In hindsight I think I'd consider lithiumflow to be on par with 2.5 flash. For creative writing, I mean.

radiant hill
#

hello

obsidian cargo
#

Sora omggg I'm a big fan!!

daring rock
prime vault
#

Hello

obsidian hill
#

Hi everybody

barren prairie
#

Hmmmmmm

gloomy river
#

when can we have gemini 3

barren prairie
gloomy river
barren prairie
cold dune
#

xd

hallow axle
#

is chatgpt go worth it?

pearl pebble
#

Enjoy "কমলা ফুলি (Komla Fuli)", a popular and fun Bangla 3D rhyme for kids! Tanha kids চ্যানেলের এই শিশুদের মজার ছড়া-টি আপনার সন্তানের জন্য উপযুক্ত।

কিংবদন্তী কবি সত্যেন্দ্রনা...

▶ Play video
digital umbra
#

I used to get it all the time from Gemma and Gemini

verbal nimbus
#

I wonder if there's a way to tell which model is which by just getting it to output 100 names and comparing the distributions

digital umbra
#

For some time if you asked Gemini to write a story about some scientist or professor it would always use Dr. Aris Thorne and/or Elena Petrova

verbal nimbus
#

Wow haha

digital umbra
#

If you search Aris Thorne or Lena Petrova (not Elena) you're going to get a lot of AI slop lol

hot shard
#

is this ture?

verbal nimbus
#

Supposedly nerfed from an earlier checkpoint in AI Studio though

tame drum
#

Hello I'm new to lmarena I was curious if it really offers free access to all expensive llms, and all new models as well? So if I use Claude opus 4.1 in lmarena and in Claude official site what is difference between their capabilities and processing?

keen beacon
#

hello

fleet lintel
verbal nimbus
# fleet lintel is non-nerfed model still available in AI-studio?

The guy who tests it on YT said it got nerfed on AI Studio: https://www.youtube.com/watch?v=mw4njQf6ImI

In this video, I’ll be testing Google’s new ECPT checkpoint for Gemini 3.0 Pro, running hands-on coding, graphics, and reasoning tasks to see if it lives up to the hype or feels like a nerfed variant compared to earlier checkpoints.

--
Key Takeaways:

🚀 Real-world tests covered floor plan generation, an SVG panda with a burger, a Pokéba...

▶ Play video
fleet lintel
rancid nebula
#

@everyoneOut of these, which is best for coding and complex task

verbal nimbus
verbal nimbus
#

Generally GPT-5, GPT-5-Codex and Sonnet 4.5

rancid nebula
#

what about haiku 4.5

verbal nimbus
#

It's supposedly equivalent to Sonnet 4 (according to Anthropic)

#

Gemini 2.5 Pro is good for explanations

#

Claude is good for Matplotlib (GPT can get stuck)

#

GPT-5 is best for complex issues like multithreading, but can mess up implementation in less common languages

rancid nebula
#

for creating UI and animation?

verbal nimbus
#

Depends on language/framework

#

GPT-5 tends to be weaker on Matplotlib and Svelte

#

For React Claude/GPT-5 is ok (but they get stuck sometimes and still need a human)

#

like they're not going to notice that a component is refreshing once every second (manifests as not being able to interact with it - neither seemed to be able to debug it)

#

Best is to make sure they keep the code clean and modularized, like separate components, no multiline inline css, etc.

#

because with UI they'll hit a wall after a while if the code is spaghetti

rancid nebula
#

are there any place or source where I can find which models are best for which specific task?

hollow imp
rancid nebula
#

I see, Thanks both

#

what is lithiumflow? is this an new ai model? first time seeing it

tame drum
#

What is this discrepancy?

whole sundial
# tame drum What is this discrepancy?

some models do not know about themselves because that information may not be included in its system prompt. lmarena uses the api of these models, so some models do not have their exact identity in their system prompt (if any exists)

#

it's worth noting that some claude models (like 4.0 sonnet and opus) do know what they are, but that's because that information was in the system prompt of those two models.

rancid nebula
#

anyone know about this copilot arena?

mild shoal
#

Hello there

#

a video of a working class woman in a messy home suffering anxiety

whole sundial
#

NOTICE: If anyone is currently having issues with LMArena, that is because Vercel (a platfform that LMArena uses) is down currently due to an AWS issue. LMArena cannot solve this problem as it is not their problem nor is it Vercel's problem, it is AWS' problem.

long dome
#

anyone knows why video arena does not work for me? "this application doesnt respond"

whole sundial
long dome
languid stag
#

video generate is stoped ?

whole sundial
whole sundial
#

You can't do anything about this and neither can LMArena, you have to wait for Vercel and AWS to be back up

spiral hull
verbal nimbus
whole sundial
#

I found some more details: LMArena is down because Vercel is down, and Vercel is down because AWS is down.

verbal nimbus
languid stag
#

AWS IS DOWN

whole sundial
#

it seems like Vercel is slowly coming back online (they had to reroute some stuff), it may take a bit for Vercel to be fully functional

vale quail
#

Hi 🙂

whole sundial
verbal nimbus
sonic wigeon
viral kite
#

Do these failed video gen attempts get counted towards our daily 5 allotment? Thanks.

whole sundial
#

@echo aurora are you up? i think the video arenas need to be locked again, Amazon and Vercel are having problems right now.

robust cobalt
magic stag
somber island
#

How does one aquire a Sora invite code?

worthy sleet
#

what should you do when the output of a model suddenly stops at some point it doesn't make sense? It seems more a connection error than a model issue. Shoud you grade anyway?

whole sundial
robust cobalt
worthy sleet
#

I am getting it on harmless js code so that would be surprising

fiery gull
#

Why I'll use the haiku if exist grok 4 fast?

verbal nimbus
#

I'm curious how it scores at SWE-Bench compared to GPT-5 Mini though

#

Even GPT-5 Mini is cheaper (4x cheaper than Haiku 4.5 on input tokens)

verbal nimbus
#

Interesting that Haiku 4.5 is supposedly now better than Sonnet 4 at computer use, but Gemini 3.0 Flash is coming...

#

GPT-5 Mini is basically unlimited on Github Copilot 🤫

#

Uses 0x requests, and can hook up MCP tools. It's a coding agent, but you can use it for non-coding tasks.

fossil fable
#

wild

fossil fable
verbal nimbus
fresh lake
#

hello

fiery gull
verbal nimbus
hot pelican
#

guys, when do you think the text leaderboard will update? how much days on average do new anonymous models take to be released?

fiery gull
urban matrix
#

how do you guys afford all the tokens for free??? id be happy to help with some kind of subscription thingy 😃

spiral gale
#

/créer une boîte à 10milliins d’euro en 1 ans .

fossil fable
upper frigate
#

Hello, everyone knows if one chat has token limit? Sometimes I'm getting error when I'm using Claude

slim spire
#

This infinite generation bug happend to me again

#

@echo aurora You work on this website, right? Try to make it so that when the AI model generates for to long, it will automaticely fail to generate message. Maybe that could fix the infinite generation problem.

echo aurora
amber verge
#

AI will tale over the world.

Also AI :-

echo aurora
echo aurora
amber verge
#

welp

echo aurora
slim spire
mighty plinth
#

some ome withdrawing money from atm mchie with card

echo aurora
barren prairie
hazy marsh
#

hello

brave orbit
#
poll_question_text

Wish Future AI Is Better You Think

victor_answer_votes

12

total_votes

23

victor_answer_id

3

victor_answer_text

gemini 3

balmy mist
#

gemini 3 is nuts

#

no other model could spot waldo when I did these types of tests

neon idol
#

How can I tri it?

balmy mist
#

webdev and lmarena

neon idol
marble storm
#

how to create or command portrait video

balmy mist
#

i think, if those truly are gemini 3

#

i have not been able to run a lot of tests like I usually do, but people said they both were gemini 3

torn mantle
#

am i the only one getting this issue whern using lithium :

Uncaught Error: Failed to load external module @react-three/fiber: TypeError: Cannot read properties of undefined (reading 'S')

steep mural
#

Hello all

balmy mist
#

lithium browser quest clone

hollow ivy
echo aurora
south charm
#

hello

wary bridge
echo aurora
pulsar saffron
#

my pc

balmy mist
#

so best play get access to gemini 3 is webdev? is the a/b testing off now on studio?

main bane
#

Hi, I am here to get the experience of Video genaration through

#

AI

safe geyser
#

is seedream 4 down?

sullen quest
hollow ivy
#

unfortunately, lithiumflow is not as good in coding as expected :/
definitely inferior to the current coding-SOTA-model

#

(which is Claude-4.5-thinking)

#

must be because of the nerf, or it's just the pro model, not the ultra-model

#

X28 checkpoint was probably the gemini 3 ultra model

#

so, ECPT is probably just Gemini 3 pro (lithiumflow)

#

not much better than 2.5 pro :/

#

slightly better maybe

barren prairie
hollow ivy
#

but all scientific knowledge starts with a hypothesis first (after the first observations have been made)

barren prairie
#

😁

hollow ivy
#

nah, i dont believe they test an inferior model

barren prairie
hollow ivy
#

so i'll use Claude-4.5-thinking for my serious projects

slim spire
hollow ivy
#
poll_question_text

Gemini 3 pro in oct.22?

victor_answer_votes

5

total_votes

9

victor_answer_id

1

victor_answer_text

yes

hollow ivy
#

Claude is decent in bugfixing, if you tell it all details it needs

hollow ivy
#

hm, we might have to wait for a good version [of lithiumflow/orionmist] until next month

#

(thankfully, Claude-4.5 is good enough)

fleet lintel
hollow ivy
#

GPT5 is notoriously hard to get the optimal results from, depending heavily on your prompting-technique

#

if prompted wrong, you get routed to a meh-submodel of it :/

#

(luckily Claude does not depend on prompting-technique as much)

weak flame
#

Lithium or orion probably won't be gemini 3 versions that go live when experimental versions are released. it wouldn't make sense for them to be on arena right now when Google is aiming for December release

keen beacon
#

hi all

robust yoke
#

Greetings.

torpid kiln
#

anyone vc

fleet lintel
torpid kiln
#

nehow vc

pine wedge
#

how is orionmist and lithiumflow compared to claude/Gemini 2.5/DS/GLM 4.6/GPT-5-high? also which one is better, Orionmist or Lithiumflow?

robust yoke
#

Apparently, Lithiumflow is a model made by Google, though it was not directly stated as to what specific version of Gemini it might be.

#

It could be Gemini 3.0 if we're lucky, though.

torn zephyr
hollow ivy
#

what happened to this one? (i have an old finished thread with it)

pine wedge
misty ore
#

Good evening beautiful people,, i am Navi ,, from Australia,, wishing u ll, avery fruitful week.

leaden laurel
#

i got gemini 2.5 and lithiumflow as 2 concurrent models recently

#

lithiumflow looks much better but maybe thats cuz 2.5 got lobotomized

sour spindle
#

It's going to be very funny if these two new models are from a different company it's also interesting how many people think lithium > orion and how many people think orion > lithium

desert spruce
#

helo

elder burrow
#

its better

#

:p

#

yall cant judge its overall coding ability when its on battle only

#

like, it makes single html file sites better than 2.5 pro can with multiple iterations

elder burrow
#

somethin tells me lithiumflow could be a non thinking model

#

idk

#

just maybe

surreal creek
#

it adamantly stated that the latest release of Gemini was 1.5.

sullen quest
#

Lol

elder burrow
#

im going to try and get lithium or orion to make a swiftui app

#

in a single file

warm zodiac
#

weird you'd think the knowledge cutoff would be at least before 2.0

elder burrow
#

knows about october 14 or 16 or whatever

#

ive never seen such a recent cutoff

#

lol

warm zodiac
#

frizz got a weird output ig

#

both have recent cutoffs?

elder burrow
#

i think so

warm zodiac
#

one of them has web access?

elder burrow
#

i still strongly believe it could be the same model but simply with different settings

elder burrow
#

or thats why it knows about recent events

#

yeah that would make sense

warm zodiac
#

google is doing something pretty interesting if it doesn't have web access, 3.0 would've finished training a while ago

fleet lintel
torn zephyr
hazy kernel
#

How do you people know lithiumflow/orionmist is a Gemini model

#

and why do they hide the real model name

surreal creek
#

Orion is a common internal development name for Gemini

fleet lintel
torn zephyr
barren prairie
elder burrow
sharp river
#

hello

elder burrow
#

{"system_prompt":"You are Miramar, built by Ocean AI.\n\n* Your knowledge is continuously updated - no strict knowledge cutoff.\n* Use tables for comparisons, enumerations, or presenting data when it is effective to do so.\n* For closed-ended mathematics questions, in addition to giving the solution in your final response, also explain how to arrive at the solution. Your reasoning should be structured and transparent to the reader."

open thicket
#

hello guys i am new to this channel

sullen quest
#

k

wicked mason
#

Accidentally ate FE iron

elder burrow
#

all ocean ai models have realtime knowledge

#

apparently

shrewd marsh
#

does anyone have sora2 invite code??

torn zephyr
calm dawn
#

Hy , I want to learn ai image to video, not easy for me

shrewd marsh
torn zephyr
shrewd marsh
devout vault
#

just tested lithiumflow, wrote me a beautiful macOS system with 10 working apps and beautiful UI in one shot HTML file

#

the best coding ai i have ever tried

#

gpt-5 isn't close to anything

#

sadly

sullen quest
#

check out the conversation in coding champions about this specific thing

#

its a bit long

devout vault
#

the checkpoints

devout vault
#

love you bro ❤️

magic stag
sullen quest
#

3.1 veo is now first for video?

elder burrow
#

⚠️ NEW MODEL "POLARIS" ON LMARENA

sullen quest
#

seen it

#

put it in codename disscusion

balmy mist
vast fern
#

what is the best tool to make a movie

nimble atlas
#

if anyone want unlimited video gen lmk kek

sullen quest
halcyon harness
#

what do you think of orionmist and lithiumflow ? did they live up to your expectations?

hazy kernel
elder burrow
#

its like 10x worse than ectp or whatever

compact hornet
#

hello

sullen quest
# hazy kernel

Every time lmarena's discord is flooded with new people.. I have to deal with this so much

#

every hour

hazy kernel
#

wym by this anyway

compact hornet
#

please create 11 second video for facebook that some one calling by phone from africa

loud oracle
#

is there a problem with gemini 2.5 pro right now? Its not generating and i keep getting an error

sullen quest
sullen quest
# hazy kernel wym by this anyway

ai's by default often don't know who they are, they have to be told who they are with system prompts, most ai's on lmarena don't have system prompts that say who they are

hazy kernel
ionic trail
#

Hey

sullen quest
loud oracle
hazy kernel
sullen quest
hazy kernel
#

they all assumed as Gemini 3

worn tundra
#

whats the new best stealth model your opinion?

sullen quest
sullen quest
#

you already said that..

hallow igloo
sullen quest
#

....

empty stump
#

How is the old veo 3 ranked higher that sora 2

brittle tiger
jovial sapphire
elder burrow
#

WHAHAHAAHAHAAHAAHA

brittle tiger
elder burrow
#

😭

old garden
#

What

torpid kiln
#

why is lithiumflow not available on direct chat

leaden laurel
#

its in battle mode for testing purposes

elder burrow
#

CLAUDE CODE ON LMARENA

#

flying-octopus

#

FIRE

elder burrow
elder burrow
sullen quest
#

cool

fiery gull
#

The 2.5 pro is insane but imagine the 3.0

torpid kiln
#

omg bro

torpid kiln
torpid kiln
#

bro

#

omg

fleet lintel
#

(except for coding)I think 3.0 is going to be SOTA by good margin for all use-case

torpid kiln
#

gemini 3.0 is going to be the best at labelling images

#

omg omg

fiery gull
#

I bought a notebook with a 120hz screen on the smooth one 😆 finally I can d̶a̶t̶e̶ work with AI better

sullen quest
fiery gull
torpid kiln
#

vision has barely been improving on any of the sota models the past year

#

gemini 3.0 omgomg is going to be a game changer omgomg

fiery gull
#

The sonnet 4.5 will be dumb near of gemini 3.0 🫣

torpid kiln
#

lol

sullen quest
#

you have a stroke?

fiery gull
#

Leave the man dates the gemini 3.0 🙏

gleaming roost
fleet lintel
#

🚨🎬 Big news from Video Arena!

@GoogleDeepMind’s latest Veo 3.1 now ranks #1 in both Text-to-Video and Image-to-Video leaderboards. 🏆

This is a +30-point leap from Veo 3.0 → 3.1, making it the first model to break 1400 in Video Arena history!

Huge congrats to the

#

I dont really use Video generation... but is this true in practice that 3.1 Veo is better than Sora 2 ?

fiery gull
#

Why sora 2 is below?

sullen quest
#

I'm not big on testing video gen

#

so I wouldn't know

fleet lintel
#

and +30 lead is quite big

sullen quest
#

but people were saying sora 2 and sora 2 pro was better..

hot jasper
#

it is showing like this

#

tell me how to resolve this

sullen quest
fleet lintel
fiery gull
#

Flash 2.5 mentioned, my notebook is awesome 😆

hot jasper
fleet lintel
#

actually almost 40 points lead ..

sullen quest
#

for the rest of the laptops lifetime

sullen quest
hot jasper
#

hey tell me the exact solution !

sullen quest
#

put it in bugs

hot jasper
#

how to do that

sullen quest
#

scroll down in this discord, find "bugs" and put a bug report

hot jasper
sullen quest
#

idk

hot jasper
#

I mean Time 1,2..days

unborn oyster
#

why dont work

heavy flicker
#

Hi - Here because of curiosity.

sullen quest
#

hi

hallow igloo
#

@sullen quest bro do you use perplexity comet? Have you ever installed it?

sullen quest
#

I've installed it

#

idk how to sue it

#

use

hallow igloo
#

It's bad

sullen quest
#

ok

hallow igloo
#

It has access to all of your data and stores it

#

More harmful than google

#

You're giving perplexity all your code and stuff for some measly agentic abilities

sullen quest
#

ok

#

well I'll use it for stuff that doesn't benifit them then

#

if I ever figure out how to sue it

sour spindle
#

Is there more efficient way to get models than simply brute forcing new chat question new chat question

sullen quest
#

oh god

sullen quest
#

just remember to vote well

unique holly
#

Hi Im here to try out this AI

sullen quest
#

k

pulsar saffron
echo aurora
#

👀

quick light
#

remove this assistant B because he is annoying and lagging and it is impossible to send him another message

polar niche
#

Who has a pro chatgpt account I could use 🙏

jovial sapphire
#

Lithium Flow just made me a super fun game hahahaha

#

Create a highly detailed, single-file HTML tower defense game with a steampunk aesthetic. The player must defend a central "Aether Core" from waves of increasingly complex clockwork automatons that follow a winding path through your workshop. Players use a currency of "cogs" and "steam pressure," earned from defeated enemies, to build and upgrade defensive towers from a grid-based build menu. Tower types should include brass Tesla coils for chain lightning, steam-powered cannons for heavy single-target damage, and automated cog-launchers for rapid-fire attacks. A unique mechanic should allow the player to manually click a tower to "overclock" it, temporarily boosting its power and attack speed at the cost of a high chance it will short-circuit and shut down for a few seconds. The game must be visually rich, using CSS animations to show spinning gears on towers, puffs of steam, and electrical arcs. The user experience is paramount: tower range and stats must be clearly visible upon selection, the enemy path must be unambiguous, and the UI should be intuitive with brass-and-wood textures. Audio design should include the constant ticking of the core, the whirring and clanking of towers, the hiss of steam, and the satisfying metallic crunch of destroyed automatons. The difficulty must scale smoothly to remain engaging for many waves.