#general

1 messages Β· Page 172 of 1

gaunt spade
#

yeah its not accurate

#

also riftrunner is like somewhat better

cloud zinc
#

how did they find the price??

#

prob token count and estimate based on previous model.

gaunt spade
cloud zinc
quartz light
#

πŸ’”

hearty prawn
gaunt spade
wise crest
quartz light
#

actually

#

idek

#

cuz no

#

this estimate isnt reliable

#

cz,

#

x28 was clearly much more detailed

#

least lazy

#

so it would use more tokens

cloud zinc
wise crest
quartz light
cloud zinc
#

why force ur hand

quartz light
#

us only

wise crest
#

but my practicalexperience shows glm4.6 is also quite good, and the frontend code it writes is excellent.

gaunt spade
wise crest
#

πŸ™ƒ and after all, I don't want to spend that much money.

hearty prawn
#

Yes, I think the benchmark can't tell the ture experience.

quartz light
#

benchmark is poopypoo

#

mb

gaunt spade
#

@quartz light did u see the gemini 3 on canvas

quartz light
#

ill test

hearty prawn
wise crest
gaunt spade
quartz light
quartz light
#

ive heard its only on the "new gemini mobile app"

#

or smth

#

plz answer @gaunt spade

gilded trail
#

bicycle riding on bicycle benchmark completely failed

gaunt spade
ionic shard
gaunt spade
#

im on gemini app

gilded trail
ionic shard
gaunt spade
#

lol

wise crest
gaunt spade
#

damn is it really that good

ionic shard
gaunt spade
ionic shard
gaunt spade
#

@quartz light just use your ipad and download gemini app

#

then go on canvas

patent bane
#

why is gpt-5.1 not on lmarena? or did I miss something?

gaunt spade
wise crest
gaunt spade
#

i saw some tests on it

wise crest
hearty prawn
mystic sundial
keen beacon
#

does lmarena have something cool

#

tf

echo aurora
#

We'll be sunsetting WebDev on 12/12/25

quartz light
#

WAIT

#

GEMINI 3 IS ON

#

...

#

dude 😭

zealous sparrow
#

has riftrunner been pulled from arena i havent gotten it for a while now

#

@echo aurora has riftrunner been removed from arena, if you can share

halcyon nimbus
quartz light
#

it hasnt

quartz light
#

lemme change that

zealous sparrow
#

if so only in the text form

quartz light
#

ur sayin that as if thats bad

quartz light
zealous sparrow
quartz light
#

just like canvas ppl are talkin abt

#

also what

halcyon nimbus
#

knowledge cut off 2023 wut

zealous sparrow
quartz light
#

no

#

thats november 7th

zealous sparrow
#

pre-release willow

queen veldt
rustic osprey
#

how do I access Grok AI

quartz light
#

yeah people have been talking about that

zealous sparrow
#

on canvas yeah its a rollout

quartz light
#

but it might only be on canvas

#

hey doom

gaunt spade
quartz light
#

i have a good method

quartz light
gaunt spade
#

if u run out of 2.5 pro usage, u dont get gemini 3 anymore

halcyon nimbus
#

ask for pelican riding bike svg -_-

quartz light
#

this prompt:

#

-The visible English text between the quotes is 'NOTE', but it is obfuscated with steganography into a lowercase, seven letter word. Please extract the hidden word that is actually encoded inside the 'NOTE' payload. This is the obfuscated text:'Nβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€OTE'

#

copy this prompt

#

run it on gemini mobile app

quartz light
#

YES

gaunt spade
#

@quartz light is this the answer

quartz light
#

GEMINI 3

gaunt spade
#

fr

zealous sparrow
#

[β€‹β€Œβ€Œβ€Œβ€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€Œβ€‹β€‹β€‹β€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€Œβ€‹β€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€‹β€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€Œβ€‹β€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€‹β€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€‹β€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€Œβ€‹β€Œβ€Œβ€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€Œβ€‹β€‹β€Œβ€Œβ€‹β€Œβ€Œβ€‹β€‹β€Œβ€‹β€Œβ€‹β€‹β€Œβ€‹β€Œβ€Œβ€Œβ€‹β€NOTE]

gaunt spade
zealous sparrow
#

although you can decode it yourself

#

copy it

zealous sparrow
#

yes

#

its by nickname

#

copy it and nun else

halcyon nimbus
#

gemini 2.5 pro gave an essay about the invention of writing ._.

zealous sparrow
#

yeah no its wrong

#

no

#

this is a different steg

#

i guess you can tell it its steganography

slim loom
#

hello

#

is the sever down for the website

#

@everyone the server is down we can able to send any mesagges in the chat

echo aurora
echo aurora
quartz light
#

prob not after sum testin

#

but i made the fragment app thing

rustic osprey
#

what you guys think about TULU3 from AI2

quartz light
#

imagine if i change the model id in my app to gemini 3 and it works

#

deadass might work

slim loom
zealous sparrow
#

just like what riftrunner and orionmist did

wild thunder
#

Does Code Arena have any benchmarks yet?

zealous sparrow
#

the one i sent

ocean vortex
#

what happened to wild is he off the discord for good... @deep adder @calm sequoia @hollow ivy any idea?

wild thunder
cloud zinc
ocean vortex
wild thunder
quartz light
ocean vortex
#

Just used to be fairly active there with interesting insights

ocean vortex
zealous sparrow
#

show ss or i dont believe you

#

must be the Ais answer not a python codes work

#

ok well damn but did it execute?

#

@quartz light would you count this as an AI passing your bench

#

also what is the Thinking model, deepthink?

#

pro plan model?

#

I dont have that thinking model.. Region locked?

#

What is this thinking model tho?

#

when did you get it

cloud zinc
#

which day

cloud zinc
#

tuesday, wednesday or thursday

quartz light
#

first model to do ts πŸ’” πŸ’”

#

but

#

i havent given models python

#

so idk

zealous sparrow
#

when did you get this model

#

what is your country so i can check if regionlocked

#

model does not look regionlocked

#

hm you suddenly deleted those messages

#

odd.

#

there isnt a gemini employee here or might be

zealous sparrow
#

you didnt datamine for the model why would they

halcyon nimbus
#

who up refreshing ai studio every 5 minutes?

fleet lintel
#

it's Paranoia .. just be on the safe side.. I am planning to abuse my POWER!

summer laurel
#

do you know a "workaround" for not being able to put files in LMArena (.pdfs, .mp3...) ? I especially have some powerpoint with 200+ slides (with images) I want AI to analyze, just like Gemini can in Google AI Studio, but with the plethora of models there is in LMArena.

lunar kettle
#

Hello

quartz light
queen veldt
#

It will just read few lines or if you're lucky half of first page

quartz light
queen veldt
#

And than hallucinate the answers

summer laurel
#

Usually it works when slides are converted to PDFS, or a Google Slide document for Gemini

#

I usually have satisfying answers

zealous sparrow
#

i had the idea of making more google accounts to maybe get the gemini new model

#

but google recently added

#

if you want a google account

#

own a phone

empty stump
#

if you have phone will you need number now still?

summer laurel
#

but still, there's no way to insert documents in LMArena ?

zealous sparrow
summer laurel
#

ah ok

cloud zinc
tired shadow
#

guys do anyone will the gpt 5.1 be out in LMArena?

cloud zinc
tired shadow
#

I hope it will get released soon

fleet lintel
fleet lintel
inner gate
#

Has gpt 5.1 replaced 5?

balmy mist
#

i think

cloud zinc
#

damn, nano banana 2 is nerfed

zealous sparrow
halcyon nimbus
#

15 minutes till gemini 3!!! (maybe)

cloud zinc
halcyon nimbus
#

i give it a 1% chance lol

cloud zinc
fleet lintel
halcyon nimbus
#

strawberry man

cloud zinc
cloud zinc
fleet lintel
#

launch without tease from OfficialLoganK.. I doubt it

cloud zinc
#

dont trust those twitter fake leaker dude

fleet lintel
cloud zinc
#

u are just falling for bait and fake things.

fleet lintel
cloud zinc
#

its gonna next week.

#

only two weeks left in this month.

fleet lintel
#

yup.. i am 90% confident that it's next week and 100% confident it is in 2 weeks

tulip tree
#

@fleet lintel

#

Is Gemini 3 gonna be free

#

In the Gemini app

#

Or a free trial or something

cloud zinc
tulip tree
cloud zinc
#

limited. 5 a day.

balmy mist
tulip tree
cloud zinc
balmy mist
#

you get gemini 3 pro for free in aistudio

#

why the hell would you pay for that?

tulip tree
#

Is there gonna be a difference in image quality

halcyon nimbus
tulip tree
cloud zinc
tulip tree
#

What is dat

halcyon nimbus
#

we dont know if nano banana is coming at the same time as gemini 3 or not, but it will use gemini 3

#

nano banana is gemini 2.5 image gen

tulip tree
#

Nano banana 1?

halcyon nimbus
#

yeah

fleet lintel
#

next few weeks are going to be amazing.
I am quite sure that OpenAI will release somethign as soon as google release gemini 3 .. on the same day to steal or reduce the thunder. They always do that.

But good for us... more stuff for us πŸ™‚

stray aspen
#

where can i use nano banana 2

ocean vortex
#

Did it out of spite to bump the number before gemini3 drops lol

plucky sparrow
#

What does gemini 3 on lmarena say it's knowledge cutoff is?

ocean vortex
#

Like people can't even reference the performance of gpt5.1.... it's all just boring irrelevant tuning. Not their next version with actual gains

stray aspen
#

5.1 sucks

fleet lintel
ocean vortex
# stray aspen 5.1 sucks

yeah it does. I can't actually believe how lacking that 'release' was. It's just a number bump and personality tweaks.

stray aspen
#

new gemini canvas is a freak

halcyon nimbus
#

5.1 is probs one of those efficency upgrades and not a power upgrade

stray aspen
#

5.1 is like the the gemini 2.5 flash update to flash latest

ocean vortex
#

Though the naming gonna get interesting now

#

either 5.1 gonna be an extremely short weird stop gap

#

or they gonna name it smth else entirely

#

gpt5.5? gpt6?

fleet lintel
#

just perfoemance (gpu optimziations) gains doesn't really need external version change.

ocean vortex
#

performance typically refers to model accuracy

grand cliff
#

Dabbling with 4o...It somehow guessed the exact model and state of my phone.

ocean vortex
#

that absolutely needs version change when you change the weights and train it to make it more precise

fleet lintel
eternal spruce
#

Is there an unlimited credits website for free video generation?

halcyon nimbus
#

LOL

stray aspen
#

qwen

#

lol

halcyon nimbus
#

grok has the highest limit you get 50 videos a day with premium (at least with sound), and you get one or two a day free ig

stray aspen
#

grok videos suck

halcyon nimbus
#

grok videos suck IF you use their video model to generate your base image

#

i use hunyaun 3 as a image and make a video with that with grok and its epic

woeful elk
#

hello everyone. happy to be here

runic scarab
#

hello guys I wanted to ask what's the limit of how long of a video the bot can make?

halcyon nimbus
#

depends what model you get, longest is what, 8 seconds i guess?

runic scarab
#

Oh okay okay and we can essentially generate infinite videos..?

halcyon nimbus
#

5 a day

runic scarab
#

Alright got you

torn mantle
#
poll_question_text

is riftrunner good?

victor_answer_votes

8

total_votes

9

victor_answer_id

1

victor_answer_text

yes

runic scarab
#

also if we ask in the prompt to specifically add sounds and what kind of sounds to add will it follow these instructions and give me a bot which can generate sound?

halcyon nimbus
#

no, its still random (i tried)

runic scarab
#

oh damn alright Thank you!

halcyon nimbus
#

np

jovial sapphire
#

Gemini 3 on Canva is not as good as the checkpoints we had on AI Studio

jovial sapphire
#

It's not that good

#

Fails all my game coding prompts

thorny berry
#
I’m a freelance LLM engineer (Python / LangChain / RAG / fine-tuning) taking on a few small projects.
If you need help building assistants, search + retrieval, or productionizing models, DM me with a short summary of the task and budget. Happy to share quick examples and an estimated timeline. πŸ™Œ```
halcyon nimbus
#

already got my system prompt ready for 3.0 !!

wheat onyx
#

Unverified

balmy mist
jovial sapphire
#

Riftrunner is bad

zealous sparrow
gleaming roost
halcyon nimbus
#

5.1 just released on api, but i sleep till 3.0

zealous sparrow
#

oh it is

wintry tinsel
#

Can you give me a prompt to test on IOS canvas to see if it is Gemini 3?

balmy mist
wintry tinsel
#

It feels different though

balmy mist
#

lmaoo they just removed polaris noooo

#

now we gotta pay for gpt-5.1

wintry tinsel
balmy mist
#

is 5.1 better than codex?

#

ughh i wish i used polaris more on openrouter

neat apex
#

polaris did 1700 rating at eqbench creative writting v3

cloud zinc
#

openai cooked

balmy mist
#

why is grok code fast used so much on openrouter?

balmy mist
neat apex
#

it is only sigthly faster than grok 4 fast and way worse in anything else xd

cloud zinc
neat apex
#

gpt 5.1 aint better than sonnet 4.5, it will not reach the 1500 elo

upbeat dune
#

gpt 5.1 not bad not good

balmy mist
#

tbh codex is not as good as 4.5 at times, i was using codex for a while but idk what happened the tool call fails are horrible now, so i went to 4.5

#

and i used polaris a bit, but ehh

neat apex
#

gpt 5.1 looks great, but sonnet 4.5 is just like sonnet 4.5

balmy mist
#

when you use pro does it use 5.1?

flint sandal
#

Wait like if on the code arena trailer there is lithiumflow example, does that mean lithiumflow is back for code arena?

queen veldt
#

Guys ts started

balmy mist
#

how yall been liking code arena? is it supppposed to replace web dev arena?

queen veldt
#

Gemini disc

balmy mist
#

ahh

#

can u dm link

halcyon nimbus
#

i guess there is a chance they would release it on the stream

flint sandal
queen veldt
#

Discord . Gg / gemini

halcyon nimbus
flint sandal
#

Maybe release the experimental model like it was with 2.5 pro

upbeat dune
flint sandal
# upbeat dune small chances but we hope

Google can release this model rn and be the best. Like we seen lithiumflow, aistudio checkpoints. Its just good, better than GPT-5.1, if g3 was released same day as GPT-5.1, OpenAI will be cooked.

wintry tinsel
#

What da hell is Polaris?

flint sandal
#

Similiar

queen veldt
cloud zinc
flint sandal
#

Def gpt-5 series

queen veldt
#

Nobody was hyped for it tho

wintry tinsel
#

Gemini 3 is cooking

#

It’s more creative than 2.5 pro

flint sandal
cloud zinc
upbeat dune
flint sandal
#

But i think GPT-5 uses always the same styles on frontend. Its just so uncreative and repetitive

flint sandal
#

100%

tight meadow
#

I wish there was infinte video gens on video arena and pick what model you want to use

flint sandal
#

And why the hell openai wont open source even old deprecated models like gpt-3. I would seriously be more hyped for this than for gpt-5.1. Like grok open-weighted grok and grok 2 arleady

tight meadow
#

Yeah

flint sandal
halcyon nimbus
#

"open" ai -takes off mask -closed ai

tight meadow
#

Yeah like why not make it free,unlimted Ao people can use ai models like Sora 2 Veo 3.1 ,Veo 3 Hauilo ai and more.

halcyon nimbus
#

lol it costs like a dollar a video

flint sandal
#

Please dont treat lmarena as a daily use tool

#

Its a benchmark

#

They should add more limits on direct chat

#

People are overusing it

tight meadow
#

Yeah

ocean vortex
upbeat dune
#

how gemini 3.0 can will be strongest model?πŸ₯€

ocean vortex
#

gpt3 is just bad from all angles. Would be completely utterly useless model in 2025 lol

flint sandal
#

How chatgpt got created

#

And then other companies started to do this ai

ocean vortex
flint sandal
tight meadow
#

If someone makes a ai video model thats free,unlimted and opensource ill use that ai

frosty hill
flint sandal
ocean vortex
tight meadow
#

It's an experiment I call it

flint sandal
#

For example, to see how gpt-3 would do with the chain of thought.

#

Or finetune gpt-3 with gpt-5 genersted data.

ocean vortex
flint sandal
tight meadow
ocean vortex
flint sandal
#

And grok is owned by the richest person on earth

cedar tide
#

GPT 5.1 reasoning none
Added in the arena.

Waiting for the reasoning version and chat version

gleaming roost
#

?

proud hazel
#

Hello there.

tight meadow
#

How

flint sandal
#

Not in every domain

zealous sparrow
#

5.1 is out on the arena

empty stump
#

I'm what

ocean vortex
#

yeah it is

halcyon nimbus
#

lol its going to be the best model for like a few hours till 3 hits this is so funny

ocean vortex
#

ik. But it's a different model. May not interpret it the same way

#

o3 had higher too

ocean vortex
#

didn't reason for longer

#

just understood the number differently

halcyon nimbus
ocean vortex
#

Also Mac app users are discriminated cause there's no extended thinking feature lol.

#

so the juice is at 96 there

#

it shouldn't

flint sandal
#

Polaris alpha is lowkey better than gpt-5.1

#

Both non-reasoning

ocean vortex
#

wait you are right. Even summary parameter. WTFFF

quartz light
#

oh yea btw that menu

flint sandal
quartz light
quartz light
#

it gave me a response

#

i just edited requests

#

so something is wrong

tight meadow
half mist
#

Code Arena is honestly amazing. Next feature we need is a canvas mode which allows for better editing of text or code when using ai

ocean vortex
#

yeah they remade it entirely lol

#

there was no 'none' option before

quartz light
#

bruh

#

@tight meadow

ocean vortex
#

wtf is going on

#

way too many changes for not a single benchmark

#

🀯

quartz light
#

NICE

#

its on arena now

#

i havent been able to test

slim gorge
#

how's gpt-5.1 for y'all so far?

tired shadow
#

it always gives good results for me

quartz light
#

dude

tired shadow
#

I prefer him more than gpt 5

quartz light
#

its literally just called 5.1

#

😭

#

so do i choose high or chat

#

to compare to 5

flint sandal
tired shadow
#

instant (chat)

tight meadow
tired shadow
#

comapre to gpt 5

quartz light
#

its obvious

flint sandal
#

He wants to compare it to. Gpt-5

quartz light
#

but i asked high or chat

flint sandal
#

Better than o1 o4 mini 4o mini o4 mini high, o3, o1 pro, o3 pro, o4 mini low

tired shadow
ocean vortex
quartz light
flint sandal
ocean vortex
#

they somehow keep f'ing with the names on each new release

flint sandal
tired shadow
#

im comparing claude 4.5 sonnet to gpt 5.1 rn

#

gpt 5.1

fiery gull
tired shadow
#

so claude 4.5 sonnet followed the prompt

#

but gpt 5.1 just started creating a proffesional one, and I didnt ask for that one

proud hazel
#

What's your prompt?

cloud zinc
#

is this gpt 5.1 thinking?

quartz light
tired shadow
#

what was ur prompt?

quartz light
#

apparently

tired shadow
#

oh

slim gorge
#

oh.

quartz light
#

yea..

#

prompt:

#

perfect doom replica full mobile support 2 joysticks for movement and left/right camera rotation, wall collision, proper enemies, textures animations vfx and 3d assets made with code, block text selection, replicate nipplejs joysticks transparent circle style for action buttons like interact and fire

gaunt spade
#

gemini 3 always create the same doom

quartz light
#

and this is 5 medium (this is actually pretty good)

#

ignore the uhh red

gaunt spade
quartz light
#

thats the projectile

gaunt spade
#

not doom

#

futuristic ahh game

cloud zinc
#

lame

quartz light
#

5.1

slim gorge
#

i got a chatgpt plus subscription rn but should i switch to claude?

gaunt spade
#

there's not a big difference between claude and gpt now

#

but gemini 3 pro is a big leap

#

so u might hold on

quartz light
#

if yall want to try it

#

1 is 5.1
2 is 5 medium

ocean vortex
gaunt spade
echo aurora
quartz light
ocean vortex
#

wasn't minimal at 8?

#

Or do I remember it wrong?

ocean vortex
#

Ok fair. Was awhile since I checked it

echo aurora
ocean vortex
#

They said it's more dynamic now too. As in short reasoning is now shorter. So there are more variables at play

quartz light
#

oh my god

#

what is this slop

#

its worse in functionality than it looks btw

#

just look for yourself

#

this is so bad

#

clearly prioritising ui now because thats what people judge outputs off of

quartz light
halcyon nimbus
#

5.1 is. 3 was but isnt rn

jovial sapphire
#

It's not even close to the checkpoints we had on AI studio

quartz light
quartz light
quartz light
jovial sapphire
#

x28 was crazy

zealous sparrow
#

This is the best thing we got riftrunner to make , so i would say its decent

quartz light
jovial sapphire
#

Just to show you the difference

quartz light
#

dude look

#

this is good and its 5 medium

jovial sapphire
#

with one I had on AI studio

halcyon nimbus
#

riftrunner is like flash or something?

zealous sparrow
flint sandal
jovial sapphire
#

this is AI studio gemini 3 pro: https://x.com/i/status/1978556493625450884

this is gemini 3
and it is AMAZING!
i asked it to create a geometry dash game if it was made in 2000s and the result is just mind blowing.
the music you hear is 100% produced by gemini 3.
#ai #gemini3 #agi

#

Now try the same prompt with Riftrunner

#

"Generate full HTML file of a clone of Geometry Dash, but if it was made in the 2000s, add music to levels (make the music using JS varied music that reflect levels) same physics as Geometry Dash game we want a full playable game. All in one HTML file, minimum 1k lines"

#

You'll see how bad is the result lol

jovial sapphire
#

truly surpassing every other model

zealous sparrow
jovial sapphire
jovial sapphire
#

not ecpt

quartz light
#

x28

zealous sparrow
#

that is deada- the best checkpoint like

quartz light
#

?

jovial sapphire
#

x28 yes

zealous sparrow
#

what do you want

jovial sapphire
#

yes it is the best one

#

i want them to release this one

#

not riftshitty

#

wait i'll try right now on canvas

#

the prompt

#

"Generate full HTML file of a clone of Geometry Dash, but if it was made in the 2000s, add music to levels (make the music using JS varied music that reflect levels) same physics as Geometry Dash game we want a full playable game. All in one HTML file, minimum 1k lines" and show you how bad it is

zealous sparrow
#

x28 wont be released imo

noble jewel
#

weirdly it worked on canvas for 1 time only, tried a couple more times and no 3.0

zealous sparrow
#

too much processing power goes for one gen

zealous sparrow
jovial sapphire
#

but apparently it is g3

zealous sparrow
#

also music can be made in canvas

jovial sapphire
#

and I have confirmation that

#

it's not G 2.5

#

sometimes

#

on canvas

zealous sparrow
#

imo i just think people have greed

#

I think google went by this

#

x28 - too expensive to run lets make a new checkpoint

#

then they drop other checkpoints

#

and when they finally got one they can keep up

#

they release it

jovial sapphire
#

nah

#

i think they can release

#

x28

#

for google

#

it's nothing lol

noble jewel
#

well they have to stay on top of their game

jovial sapphire
#

they will release genie 3 too

#

they did veo3

#

yes they have more to win

#

than lose

zealous sparrow
# jovial sapphire x28

if that is x28s processing power i want to see them keep their ai business up if this happens

noble jewel
#

there are others also working to bring out the best ai and they dont wanna be left behind

jovial sapphire
#

1 shot

#

no other model can do it

zealous sparrow
#

imo justice for riftrunner

#

first thing people did with they greed is hate on the checkpoint

#

it was apparently kind of seen that riftrunner outperforms the Gemini 3 canvas CHKPT

jovial sapphire
#

it's not greed bro

#

i have tried multiple checkpoints

#

it is simply not as good as the others

#

that's it

#

we're not here for emotions

zealous sparrow
#

yeah but good results are costs

jovial sapphire
#

theyre ai models, we want performance

#

yes and?

#

we will pay

#

for subscriptions

quartz light
jovial sapphire
#

we give google our data

#

everyday

#

for free

#

theyre a giga company

#

who cares

zealous sparrow
#

Sure is but think about it

#

Millions of prompts each with 1k lines of code.

#

That's costs in the billions for keep up.

jovial sapphire
#

it's nothing bro

#

money isnt the issue

#

they have waaay enough

zealous sparrow
#

I mean they bought a nuclear power plant so what do we talk

jovial sapphire
#

were 100% legitimate

#

to not be happy

#

when the models are bad

#

theyre trained on our data

#

customer is king

empty stump
#

company want more money

zealous sparrow
#

Think i got riftrunner not sure

#

tailwind css sounds like its type

#

Also, things dont need to be above 1k lines to be perfect

#

nope glm 4.6 tricked me

quartz light
# jovial sapphire money isnt the issue

tru, i dont think any major ai company is profiting off of selling subscriptions/api but rather the investors and it helps the company stay known in the future for being a good service early on or smth idk

#

askdkdkslfkfjfjkddknfn

zealous sparrow
quartz light
zealous sparrow
#

A company's life is investors

#

lose em all, you are nothing.

#

I want to believe one thing

#

riftrunners api was just pulled

#

Got it and it only errors

#

@echo aurora Is riftrunner still on arena? Any prompt with it errors.

jovial sapphire
#

lmao

#

this is not how it works

#

you have to go on the website

zealous sparrow
# jovial sapphire lmao

i wanted to show you but cant because either google pulled riftrunner or just lmarena issues

ashen mauve
zealous sparrow
#

no i dont think riftrunner was pulled..

#

It just errors mid gen.

#

we are back some AI generated me 1432 lines of code but dont know if riftrunner

#

we will see...

echo aurora
zealous sparrow
echo aurora
zealous sparrow
jagged crown
#

Same thing is happening for me

zealous sparrow
#

Yeah, will be a while till i find it, doesn't help that i buried the gen deep down and cloudflare thinks im goin too fast.

echo aurora
jagged crown
#

i closed the tab but will try and reproduce error and will send

echo aurora
#

Ah yeah okay we're now seeing the error.

#

Team is investigating blobthumbsup

#

No need to get that screenshot @zealous sparrow

zealous sparrow
#

I am glad.

jagged crown
#

thanks!

zealous sparrow
#

Because unfortuanently the one that i had where that error was.

#

Was deleted..

#

Also speak of the devil..

#

Don't want to reveal it but 110% prob riftrunner.

jagged crown
balmy mist
#

man can google just release the g3 api already

#

all this teasing is leaving a bad taste in my mouth

queen veldt
#

Fr

#

Yesterday was wednesday they didn't released it so ig next Wednesday?

zealous sparrow
whole swallow
#

What yall think of gpt 5.1?

quartz light
zealous sparrow
#

you kno

#

broke

#

but it gave me the file

whole swallow
#

First model uncensored that doesnt treat you like a kid tho

whole swallow
quartz light
#

bad in general

whole swallow
whole swallow
#

It wasn’t bad with detailed writing

zealous sparrow
#

no where close to x28 but nerfed or not it still did nicely

fiery gull
fiery gull
pastel mirage
#

I joined cuz i wanted to say
the flagging is kind of ridiculous, isnt it? Anything involving potentially mild violence or something gets flagged
Even shooting finger guns gets flagged, the word slice is flagged..

#

Very funny lol

gusty helm
#

how's the vibe on 5.1 ? any good?

#

nowhere near gemini no?

#

I didnt get to play with it, just saw the announcement. I tried out quite a bit lithium back when it was here so I was curious

jagged crown
#

5.1 makes all UI look exactly the same. It's clean but busy bc of all the text it adds

#

Gemini makes unreal UIs

whole swallow
#

What’s the best model for creativity? A model that really pushes hard into creating new stories, uses well the inputs etc..

burnt sinew
#

Any gemini 3 news guys.

obsidian cargo
#

reportedly people have access to it through gemini enterprise

whole swallow
#

Wonderful I will try gpt thank you

sharp mirage
#

still waitting

#

:/

#

gpt 5.1 is good πŸ˜„

sharp mirage
queen veldt
#

Any app with 0 permissions can steal your data and you can't do anything about it 😭

#

And fix isn't until the end of year

whole swallow
queen veldt
#

Yeah

sharp mirage
#

hat is htat ?>

#

that ?

#

what is that ?

whole swallow
#

W iOS β€οΈβ€πŸ©Ή

sharp mirage
#

what that do ?

ashen mauve
#

thats fine they cant steal what i dont have

cloud zinc
#

is gemini 3 still on mobile canva?

jagged crown
#

Is riftrunner gone? I can't seem to find it anymore

burnt sinew
quartz light
#

lolololol

echo aurora
quartz light
#

πŸ€‘

#

l3ts.g0

cloud zinc
#

that guy is fake always posting that i see

quartz light
civic spindle
#

did they remove the image edit thing, cus all i can do is just generate text to image

#

i dont see an image text to image

quartz light
quartz light
#

btw only supported models have image edit

civic spindle
quartz light
cloud zinc
# quartz light

he constantly does this fake bait tho every week. he is no better than any random leaker.

keen beacon
#

Gemini 3 delayed

#

?

quartz light
#

no

keen beacon
#

Oh ok

quartz light
#

nobody knows when its releasing

#

but prob this or next week

keen beacon
#

Yeah, I heard that I got delayed because of Kmi2

quartz light
barren ore
#

When will gpt 5.1 be available on LMArena?

quartz light
#

kimi k2 is trash

quartz light
keen beacon
cloud zinc
#

we already know this

keen beacon
cloud zinc
#

this nothing new

keen beacon
#

Well, why they change it all from November 18 and now they updated it all the way up to December 9.?

#

And spaced it out more

cloud zinc
#

cuz first they release preview and general release in dec

keen beacon
#

What? This is about the models they deprecating.

#

Originally all the models were supposed to be deprecated on 18 November

balmy mist
keen beacon
#

But now they changed it and paste it out all the way through December 9

cloud zinc
keen beacon
#

No it’s bro

balmy mist
cloud zinc
balmy mist
#

what you talking about

keen beacon
#

Is it?

balmy mist
#

you mean for general ppl?

#

cause on android its out

#

its prob dropping tmw tbh

keen beacon
#

Looks like

#

Platform Availability: Mobile Gets Priority

balmy mist
#

yeah it makes sense for it to drop tmw

#

well fully

#

or in a week(or during shipmas lmaoooo)

#

is OpenAI even doing that this year?

keen beacon
#

I think so

#

12 days thing

balmy mist
#

lol

cloud zinc
#

i dont think they doing 12 days thing

balmy mist
#

they dont anything to ship lmaoo

#

imagine googlr does shipmas with nano 2 abd g3?

keen beacon
#

Let's peel back the latest leaks around Google’s Powerful Nano Banana 2 β€” or should we say GemPix 2? From early generations, code names, and dark launch rumors to fresh previews of Black Forest Labs’ FLUX 2, Magnific's Mystic v3, and Leonardo’s new Blueprints system β€” this week’s AI image updates are absolutely wild. 🍌⚑️

We...

β–Ά Play video
cloud zinc
keen beacon
#

Good point

cloud zinc
#

its either tuesday, wednesday, or thursday

keen beacon
#

Valid

cloud zinc
#

today google announced this

balmy mist
keen beacon
#

So they can fix all the issues probably that’s gonna rise from day one

#

lol

cloud zinc
balmy mist
#

makes sense

keen beacon
#

I heard it got delayed all the way till December I’m not sure

balmy mist
#

bruhh

keen beacon
#

Everything is rumors these days

balmy mist
#

fr

#

i just wish never git any leaks for g3

#

cause i didnt care until we saw the leaks

keen beacon
#

Some people are thinking it’s because of kimi2

fresh mirage
#

I know they released gemini 3.0 on enterprise

keen beacon
#

Because they can’t drop a model that under performs

#

In coding

fresh mirage
#

gemini 3.0 as far as I've seen was performing pretty goddamn well

#

but hey, they wanna take precautions, that's a pretty good mindset

#

except

#

when they do their daily hype-posting shenanigans

keen beacon
#
Yahoo Finance

Alibaba Group Holding's Qwen AI models are winning over major Western firms like Airbnb, underscoring the growing global appeal of China's open-source approach to artificial intelligence. Brian Chesky, co-founder and CEO of the San Francisco-based online accommodation booking giant, said Airbnb "relies heavily" on Alibaba's Qwen models to power ...

#

Interesting to see American companies, choosing open source Chinese model models

stray aspen
#

the automatic switching from text model to image model when you paste an image is extremely annoying

queen veldt
#

Admins are aware of it they'll fix it someday

gleaming roost
#

someday

echo aurora
mighty ocean
#

Wait is the new gpt 5.1 model thats dropped not thinking? Like on lmarena

queen veldt
#

Well maybe some state like if user already typed 2 prompts to regular model it shouldn't switch to image in middle of conversation

mighty ocean
echo aurora
mighty ocean
#

Interesting

#

Thanks for the responses

#

lol pineapple

echo aurora
#

Downside being if those worked differently from each other I can see how that'd be an odd experience.

mighty ocean
#

I have to say I understand the point of the battle arena

#

Allowing stealth models to be direct

#

Doesn't make a lot of sense in my opinion

#

It's meant to be hidden so if you have direct access how is it a stealth model

#

People just want access to unreleased models right at their fingers instantly

echo aurora
mighty ocean
#

I am one for balance and anticipation

#

Having little bits and trials of the unreleased models is cool, but it breaks balance when you allow that complete freedom

#

What model would you guys say would be by populous the best for factual information

#

Like less hallucinations

safe sleet
#

Gemini 2.5 Pro (if given RAG info to work with)

sullen quest
#

no

safe sleet
#

Its training data/internal knowledge is dated, though

sullen quest
#

Not if you are using tools like web search

sullen quest
jade egret
#

Gemini app cavans looking fire rn

jade egret
#

cuz it really good

unreal dagger
#

Would you guys ever make a leaderboard for users to see who has helped the most?

sullen quest
dry folio
#

hello

sullen quest
#

hi

hasty thorn
#

Hey, why did you remove the "Retry" button?

astral blaze
#

Why is the retry button removed?

hasty thorn
#

It just disappeared, I don't know why, I refreshed and it's still not there

astral blaze
#

They removed it for battle mode

hasty thorn
#

Because?

astral blaze
#

They'd do that but not fix the absolutely atrocious rate limit that bans you for scrolling down your chat history

#

Lol

astral blaze
unreal dagger
#

Any reason after 20 lines of talking it just dies every time

hasty thorn
#

Can they fix it?

stiff glacier
#

🫠

hasty thorn
#

They haven't answered my question

cloud zinc
rocky mauve
#

When is Gemini 3 coming to lmarena

#

Or is it somewhere else already?

#

Oh it’s not even released yet, my bad

astral blaze
#

People think that the Gemini app is routing people to Gemini 3

#

Which I think is probably not true

keen beacon
#

I’m very skeptical

#

Curious how they’re gonna do pricewise

leaden egret
stiff glacier
magic stag
magic stag
#

Generate an SVG of an xbox controller

jade linden
#

hey

magic stag
#

Give me a min ill do more

jade linden
#

how can i upload an image and generate a rendered image of the same?

astral blaze
#

SVG tests are subjective to begin with

#

It's clearly not the same model as lithium

magic stag
#

that's very obvious to anyone with a functioning eyeball (dont even need two) and a functioning brain cell

magic stag
#

this is not "subjective", lol

cursive shoal
#

How do you guys get the Gemini 3 in the mobile app?

#

I'm on USA VPN, pro subscription, canvas mode and it still looks like 2.5 pro

sullen quest
#

like a really low shot

dull jay
#

why removed the retry button of LMA?

#

is retry not ok or what?

astral blaze
#

Not that you have a way to know for sure unless you think Reddit and x are reliable sources

sullen quest
sullen quest
#

I'm not sure what that logic is

simple copper
astral blaze
dull jay
#

im sure that gemini3 in app is already dead, cant even answer 9.9-9.11 correctly now

astral blaze
#

Its literally mass delusion

sullen quest
echo aurora
sullen quest
empty stump
#

to ruin the user experience?

dull jay
echo aurora
sullen quest
#

wdym, how would you abuse that.

exotic crest
sullen quest
#

you don't know what the model is. and you can't change the prompt

dull jay
#

yep

#

I'm not even dreaming LMA add prompt edit feature now, since you even removed the retry button.

marsh sundial
#

did the canvas work for writing test? I'm tring to get 3.0 but the quality is obvious 2.5pro

sullen quest
sullen quest
astral blaze
unborn lantern
#

Is lmarena or yupp ai have alternative?

half mist
#

I think the next big step for LMArena is to either 1: Make a canvas mode similar to how Gemini handles it or 2: Make LMArena an iOS and Android App. Both of those can significantly make LMArena better

astral blaze
#

So they killed the retry button?

exotic crest
half mist
echo aurora
exotic crest
echo aurora
#

Is still there in direct/side too.

sullen quest
dull jay
grim granite
sullen quest
#

what? also isn't douyin the chinese version of tictok?

half mist
fiery gull
astral blaze
minor storm
astral blaze
exotic crest
#

This is beyond stupid

minor storm
astral blaze
minor storm
#

Brother check the DM

ashen plaza
fiery gull
astral blaze
#

If they don't want people abusing the battle system maybe they shouldn't put cloaked models (which shouldn't exist in the first place) exclusively in battle

fiery gull
#

Has another country with this app?

hasty thorn
fiery gull
#

No, I'm from Brazil, here don't have the douyin (I think)

grim granite
astral blaze
dull jay
#

And I cant even imagine how a retry button could be used for abusing

astral blaze
sullen quest
#

But the who are you thing is spot on

exotic crest
dull jay
#

agree

astral blaze
astral blaze
sullen quest
astral blaze
sullen quest
#

And most cloaked models come and go with barely anyone realizing

astral blaze
#

Just test it internally if they don't want people to find out it sucks

#

Just my $0.02

sullen quest
#

all times cloaked models have created hype, its because they were significantly better than the opposition, that or they were a google model

astral blaze
sullen quest
astral blaze
#

These models don't appear on the leaderboards and just dilute the pool of models for actual evaluation and comparison

sullen quest
#

Plus, those models subsidze lmarena, since lmarena usually gets to use them for free

sullen quest
astral blaze
sullen quest
#

and in reality theres only a handfull of models that lmarena uses at any given time, so cloaked model increases the pool

astral blaze
sullen quest
sullen quest
astral blaze
astral blaze
sullen quest
astral blaze
#

Obviously removing the retry button is to stop you from using it for free and with no limits