#general

1 messages Β· Page 316 of 1

elder solar
#

Now it has some sort of stronger filter

storm dust
#

russian vs captchas

gray isle
#

sonnet better be not bugging

#

Yikes, if only Opus are available through Sundays lol

chrome goblet
#

And Gemini 3.1

gray isle
#

or just 4.5 thinking with File Upload. i don't need vision at all

#

Like not again

#

that's 7th time now

#

the code is about to end, and it just crash like that

#

tf?

chrome goblet
#

?

gray isle
# chrome goblet ?

i've been asking sonnet a very difficult (Difficulty: Closer to being Impossible) simple prompt. where it would create me an HTML, somehow it ends up like that, and for the 7th time in a row, (in 5 hours) it ended up like that again

broken sundial
#

hi, does anybody know how long a session goes until I hit the session limit?

gray isle
#

been investigative here though, some people who suggests ideas here, doesn't reappear or neither they come back after producing an idea. (which comes into fruition, that it became a reality) (people who basically joined for one time, and doesn't chat back, like are u a bot or something)

vale quest
#

Is me @empty sky because my account is at risk and I accidentally phone locked it

heavy latch
gray isle
#

anthropic naming Mythos as Mythos, and saying they're not gonna release it in the public, makes me think, it's just a myth and it's just Opus 4.7 lol

#

after all mythos means myth lol

#

after all every hyped model isn't really as good as they say it would be lol.

#

(and i just used the word "after" and "all" two times and three times as i just typed this)

#

i would like my Side by side be like that lol

undone scarab
#

oh its not removed nvm

undone saffron
#

Too slow

gray isle
hidden elk
atomic lagoon
gray isle
# hidden elk Sometimes, it didn't respond

mine would not be muse at all, but either GLM (which at this time, i may gave a praise, since my opinion changed, but it just lack something i need, which is file upload) or sonnet

gray isle
undone saffron
atomic lagoon
undone saffron
#

Something about GraphQL of ig

hidden elk
#

Hey
If I use only for roleplay. Which model I should use?

#

I usually use Glm 5.1, sonnet models. I want to try out other models as well.

undone saffron
#

@burnt sinew
Why did you set the arena's current state as your profile pic?

#

Literally πŸ“‰

undone saffron
tranquil badge
#

what the hell is this recpatcha bruh

#

fix yall site

#

this is so buns the recaptcha aint working ive been 30min hiting everything good and still tweakin

#

fix yall recaptcha

#

sh ai

undone saffron
gray isle
#

i don't even complain about recaptcha, i complain about this goddamn error lol

#

that's the thing people should be complaining about

#

why the hell does it randomly stop out of nowhere

#

so it's a random battle mode?

brisk turret
#

the models are outdated! this data is misleadin to people who look at it and think gemma3 is still best (gemma 4 is out)... also the BLEND ratio should not be fixed

#

there should be a slider for the blend ratio, not fixed at 3:1

gray isle
undone scarab
#

for Chinese models are cheaper api

ripe moss
#

Hello guys

sterile wagon
#

Dude, what's with trying to add points when the site isn't even working properly? Logging in is a nightmare because it keeps glitching πŸ™„

#

Stop adding Windows 11-style features for the love of God

undone saffron
blissful knot
#

why is the opus model in the arena gone now?

red chasm
#

@blissful knot

#

that's the reason why there are no models

#

Well, for about a week now

#

We're waiting for it to be returned

#

were removed from April 3

blissful vapor
#

What's the worst possible model on arena? My friend is asking for ai to help him with code.

boreal bough
#

im just generating image

blissful knot
# red chasm

@red chasm can that model be obtained indirectly in "battle mode"?

red chasm
#

but I haven't tried

#

Although I really miss GPT 5.4

blissful knot
gray isle
tulip flume
#

the image models are gone again after login

gray isle
#

and it was good friday at the same time

tulip parcel
#

Hey guys, quick question. I have a ChatGPT plus subscription right now, and I want to ask if it’s worth it to cancel it and get a Claude subscription? I mainly use AI for uni projects and stuff like that. My only concern is Claude having certain token limits and ChatGPT being unlimited. What do you think?

wary nacelle
wary nacelle
#

Claude is way better for exactly Optimized and good code

#

When GPT is for thinking and other planning kind of tasks

#

GPT rn is not the best at coding

undone saffron
wary nacelle
wary nacelle
#

but he says pineapple will kill him

#

so yeah rn it only fixes client side stuff

#

like chat stuck

#

Captcha

#

Skip button not appearing when forced comparison in direct chat

#

Copy Buttons at the bottom of your prompt so you can copy easier

wary nacelle
#

LMarena old theme-

#

and ofc my fav Enhance Prompt which is ultra useful to me

undone saffron
#

The classic arena theme was the best

wary nacelle
undone saffron
wary nacelle
#

Save whole Chat History Context

#

as a one prompt

#

which is also good

#

incase LMArena chat breaks so hard

#

that extension cant even fix it

#

-# which kinda happens very often on claude-

undone saffron
wary nacelle
#

uhh we already have that

#

this button is universal

#

all it does is....

#

ignore that chat is already generating a response

#

and send a new prompt

#

how is it good?

#

FIXES ALL ISSUES
even errors

tulip parcel
wary nacelle
undone saffron
wary nacelle
#

cuz Chrome Web store wants 5$ to upload extension πŸ˜”

wary nacelle
wary nacelle
#

yes

#

πŸ˜”

#

And... the thing is Chrome Web Store doesnt accept my currency

#

either @light siren 's

undone saffron
#

I've created a twitch extension to automate everything

#

But I haven't gotten around to making one like the one you made for arena

wary nacelle
#

cuz if you did you would loose some huge amount of braincells just trying to understand how tf Arena Ai even functions on such unstable code

#

on hopes and prayers ig

#

Literally cuz LMArena is financially supported by big companies

#

they dont rely on us... Users

#

thats why their User Experience i ass

#

Even claude has better user experience

undone saffron
light sleet
# wary nacelle

Basically what they changed is background color and the name and the font, and removed the thing that was saying "Find The Perfect AI For You" thing with the ai logos

undone saffron
wary nacelle
undone saffron
light sleet
#

Yea but ngl I liked old LMARENA background color better

#

New fonts aren't that good too

wary nacelle
#

they be trying to make themselves look more PREMIUM while they cant just fix a damn copy button to be at THE BOTTOM of the user message

#

instead of the middle

#

Like is it actually that hard?

#

why is it important?

wary nacelle
#

cuz if you send a long prompt

#

goodluck finding the copy button

light sleet
wary nacelle
#

Max

#

goofiest ai router

undone saffron
#

@wary nacelle
Add an option to disable completelly the math markdowns
When AI processes a complex regex, that function breaks the regex visually and you can't copy it

wary nacelle
#

even i can make better one

#

hmmm

#

good idea

light sleet
#

πŸ˜”

wary nacelle
#

just realized it had a highlight

light sleet
#

lmarena was better

#

arena kinda lags too

#

I didnt lag in lmarena it was more simpler and better

wary nacelle
#

oh its a simple reason

#

smth happend to their Senior Full Stack Dev

#

and new one is kinda bad

#

okay now how tf do i upload it to chrome web store..

#

without 5$ commission fee

vale quest
#

Huh

silent tree
#

like rolls out in others first then u can think what to do for chrome web

#

like edge

wary nacelle
#

Microsoft Edge Webstore & Opera Webstore are free yes but they are completely different frameworks and do not work on Chrome

#

AND

#

Chrome webstore was specifically made

#

to be dominant

#

so google can be greedy

#

and earn money

silent tree
#

Oh

wary nacelle
#

and sharing the extension source code is a bad idea

silent tree
#

people could modify and professional ones would probably bypass arena stuff by modifying stuff

silent tree
wary nacelle
#

since yall already have this

#

image

#

my wants yall to find other icons

#

idk

#

he needs reference to perfect the extension Bring Back LMArena Theme thing

#

idk @light siren

light siren
silent tree
light siren
#

nope

#

it's a really expensive model

#

unless u wanna use anthropic's website directly

silent tree
#

but lemurs better and non laggier and specifically made for extensions too.

light siren
#

I've used it myself once

silent tree
untold rune
inland quest
#

where and how extension can be downloaded?

#

or its private rn?

inner relic
#

are they offering claude opus for free

#

bankrupt is real

silent tree
#

yeah cuz that's a edited arena lol

silent tree
#

prove it

#

take a video.

inner relic
#

which platform is this

#

I am interested if it offers mimo for free

silent tree
#

then its fake lmao

#

its fake dont believe it

inner relic
#

It doesnt look like fake.

silent tree
#

gatekeep your shi

silent tree
#

lmao

#

πŸ’€

inner relic
#

uhh okay. I will stay quiet

silent tree
#

spammers the one that r mad

inner relic
#

Might be fake dude

#

If they offered claude opus 4.6 for free

#

gg

#

they are bankrupt

#

what ads are useful for? if they dont have 1k member

red chasm
#

What is this site?

light sleet
#

Top example of being slow

red chasm
#

Is everything more or less free there?

#

I don't give a damn about advertising

#

I need a website

#

haha

silent tree
#

<@&1349916362595635286> advertising and misinformation

inner relic
#

I noticed this guy is an alt

#

suspicious

#

No generous providers offer claude opus 4.6 opus for free

#

πŸ‘

light siren
#

yes

light sleet
#

nobody wants your fake edited arena

#

take your advertising to someone else

#

LMFAO

#

😭 😭

#

he's actually slow

#

sad

wary nacelle
#

@light siren And Users

#

what do you guys think about

#

making our ( My and Liam) extension

#

an App?

light siren
light sleet
wary nacelle
#

soo..

#

Yk Electron-Vite @light siren

#

as i said its basically Browser

#

but made as an app

light siren
wary nacelle
#

yep

#

with my extension

light siren
ocean venture
#

what is this platform?

wary nacelle
#

?

light siren
wary nacelle
#

sdfgfsd

wary nacelle
#

*forgetting

light siren
# wary nacelle ?

releasing on any official platforms would be an issue in that case

#

cause it's basically arena but with fixes

#

lets stick to the extension

#

he is right yk

#

like nobody asked

light sleet
#

ff

#

fr

wary nacelle
#

Puter.js

#

whatever lets block him

light siren
#

we do have our own solutions to use opus

wary nacelle
#

so he doesnt flood our chat

light siren
wary nacelle
#

so where were we-

light sleet
#

rule 3 and 4 mogs him anyways lol

#

blocked

inland quest
#

i want back my Gemini 3.1 pro + Opus 4.6 thinking combo 😭

light sleet
wary nacelle
#

yeah right

thorn nebula
#

Ain't no way one guy raigebaited all of u 😭 πŸ’€ "zs" bro's a menace

wary nacelle
#

so why it wasnt a good idea make LMArena app?

light sleet
wary nacelle
#

with built in extension

gray isle
light sleet
#

It's his main account

light siren
light siren
gray isle
#

all? ahh... i just joined this conversation... (baliw yarn)

light sleet
#

anyways the app

wary nacelle
#

yes app in sizes will be lil bit heavy like 100-200 mb but even faster than LMArena

#

i think

light sleet
wary nacelle
#

prob 7 idk

silent tree
#

would it be available for Android too πŸ˜”

light siren
#

@wary nacelle can we first make auto fixes option so the app just fixes automatically

wary nacelle
#

i only make frontend

light siren
wary nacelle
#

peak Auto Moderation

light siren
#

ha

wary nacelle
#

Oh RIGHT

#

on prompts

#

lemme give you my current worked Extension zip

#

dms

#

i wish this server had proper voice channel

feral bloom
#

I know the platform

whole sundial
#

it used to but they are gone now, they must've just removed them

spring oar
#

for study i think chatgpt is the best

whole sundial
# light siren ?

there were two voice channels, "research lounge" and another one

light siren
#

insane

#

good job

#

lmao

wary nacelle
#

should we lowkey rename extension to "Arena Pro Max" πŸ’€

#

or nah?

light siren
#

oh hell nah

#

but we can change to smthn that makes sense

wary nacelle
#

Well-

sterile tartan
#

Lol is this real?

#

Or lie

wary nacelle
#

real probably but kinda the website itself is made with ai-

light siren
sterile tartan
#

I see

wary nacelle
#

HOLY SHIT-

#

wait

#

@light siren i have a pretty DECENT idea

sterile tartan
#

Proxy?

wary nacelle
#

yes-

sterile tartan
#

Knew it

wary nacelle
#

well not proxy

light siren
wary nacelle
#

i can get the api directly

sterile tartan
#

I just call it Proxy

wary nacelle
#

then just plug in Opus 4.6 and other peak loved ais we all love

silent tree
#

Or Arena Extra

silent tree
wary nacelle
#

Arena Tools?

silent tree
#

Arena Tools is good

#

too

wary nacelle
#

@light siren

light siren
silent tree
#

Arena+

wary nacelle
light siren
#

sure

sterile tartan
#

Long prompts don't work

light siren
light sleet
#

yall getting fans now

sterile tartan
#

no.

light sleet
#

lol

light siren
#

like wat u wanna help

#

not happening

feral bloom
#

Nah

light siren
#

then

silent tree
#

They're making a chicken named arena tools, it tastes delicious

light siren
light sleet
#

nah it tastes like heaven

light siren
silent tree
#

better with the side of LMArena Sauce

feral bloom
#

You are welcome

light siren
silent tree
light sleet
#

ChickenArena πŸ”₯ πŸ”₯

wary nacelle
#

KFC Arena

#

nice new Chat Mode name-

#

wait but fr-

#

i already have Cooking Ai

#

not bad idea to implement it into Arena

#

lowkey just make LMarena power with ai response

#

and the framework of cooking

#

and stuff

#

handled by extension

red chasm
#

Thanks bro

#

I wish there were more kind people like you

red chasm
tulip parcel
#

You get like 2 prompts on that website

burnt sinew
#

Meta actually made something good what?

inner relic
wary nacelle
#

For Arena Fixes extension

light siren
#

help us find the old lmarena ui yall

wary nacelle
#

Prompt Enhancer preview

light sleet
wary nacelle
#

Oh its only the start

#

wait till you see i add to it Ultra Fast Web Search

#
  • Chat context extraction
#

so incase chat breaks

#

you can start a new one

#

@light siren is actually cooking too

#

but smth else

light sleet
#

there's one here

past socket
#

goly sht

#

thry got rid of 2.5 pro?? srsly???

#

are you fk ing kidding me

hollow mulch
#

Should changing to Arena capcha. Ai cause im keep get captcha every prompt

light siren
hollow mulch
wary nacelle
light sleet
worldly plume
light siren
grave panther
#

where is video generation option?

wary nacelle
hollow mulch
wary nacelle
hollow mulch
wary nacelle
#

performance was fast

#

no forced comparisons

#

No captchas

#

*Recaptchas

hollow mulch
#

Im hate captcha

#

Is alright when you need one or two verify

#

But then more after 4 or 5 image choose keep going, even misunderstanding wrong for that

grave panther
#

which tool Lmarena use for video generation?

gray isle
wary nacelle
hollow mulch
#

@hollow mulch i can tag myself lol

#

So who make arena ai?

#

Even the site get to known more from tik tok

gray isle
hollow mulch
#

But why arena have to add captcha? Because some guy create new acount to using limited model?

gray isle
#

well after all, the Extension feels like a gift from an angel.

hollow mulch
#

Guy did yupp ai closed or the site still there?

wary nacelle
toxic mulch
#

hi im new

hollow mulch
toxic mulch
#

why is the ai crashing so much

#

the smart models

wary nacelle
hollow mulch
toxic mulch
#

ah

gray isle
hollow mulch
toxic mulch
hollow mulch
toxic mulch
#

i am wondering how the website still exists

hollow mulch
#

Here

#

Im think they got sponsor or smth:/

gray isle
#

that's my current bug now

wary nacelle
#

wanna know why?

#

CUZ EVERY SINGLE PROMPt you send

#

EVERY SINGLE DROP

#

of data

#

is used for their data

#

and models training

#

basically no privacy

gray isle
worldly plume
wary nacelle
#

our projects are just training data

gray isle
#

whatever data that we input, the output would become the AI's knowledge? am i right?

#

because that's been a theory for a long time

hollow ivy
light siren
gray isle
light siren
wary nacelle
hollow mulch
#

So we are just like hamster in lab?

#

Holy shocking right now

outer flicker
#

why still have claude opus in leaderboard lol πŸ™‚

wary nacelle
#

and it has 1504 user opinions

outer flicker
#

possive opus so much

#

even me fr

obsidian cargo
outer flicker
#

yep no one can"t pass peak

gray isle
gray isle
cursive timber
#

i used to have np 20-30 anki cards

#

summary of yt video doing very good

#

but now i have only sonnet grok or this muse spark or something

#

before i used to use claude opus 4,5 and gemini 3.0 pro

gray isle
#

now it's reduced into two

cursive timber
#

i discovered arena in october 2025

#

before i just using chatgpt

#

and randomly gemini

gray isle
#

me at nov 2025

#

actually shocked that something like this existed

cursive timber
gray isle
#

but didn't know i can actually integrate LaTex coding until January

#

then i joined this DC server because of errors of GPT 5.2 and Opus Thinking

#

at February

#

and most of that errors, was caused by prompts and the results of my LaTex

cursive timber
#

i joined because opus dissapear

#

and other models

#

i still have chats with opus 4,6 and 4,6 thinking

cursive timber
#

and some others

outer flicker
#

how brooo

#

????

cursive timber
#

i created account

gray isle
#

i discovered this through a sharing subreddit

#

i still benefit in that subreddit, because randomly i would find some post, that there would be 60% in some certain cinema

#

like a promo like this, it's a sharing subreddit after all, idk if America has a version of that

gray isle
#

that would be 3.35 usd

#

cheap for americans, not for us tho

sterile tartan
#

Economy Diffrences

#

And that save 96 is just a gimmick

#

Just increase the price by extra and label it as discount

rigid pasture
#

Add one more to the list

gray isle
#

J-co is popular here though compared to krispy kreme

#

you would found random reseller across a street.

gray isle
sterile tartan
#

There are always cheaper sellers

#

Just gotta find them

gray isle
#

because it gets sold out in 8 hours

#

scalpers as you call them

sterile tartan
#

πŸ’€

#

Well that's inconvenient

spring oar
#

Opus 4.6 is nerfed ?

spring oar
#

like the sota model ?

outer flicker
spring oar
gray isle
#

like it's gone?

#

or the quality is gone?

spring oar
#

with opus

gaunt falcon
#

Helo

worldly plume
modest topaz
# spring oar

Heavily nerfed and hallucinating a lot. The Api is still performing well, but not like it used to. Ask it a simple question like I want to wash my car and the car wash is 50 meters away, should I walk or drive? it says to walk to the car wash. Earlier, it would say drive since you want to wash your car. Opus 4.5 is doing well, use that instead.

modest topaz
#

Api in antigravity.

rose jackal
spring oar
modest topaz
spring oar
#

but opus 4.5 is less good than sonnet 4.6 ?

modest topaz
modest topaz
floral canyon
#

how are they gonna make muse spark free long term

spring oar
#

sadly opus 4.6 he can be the best

dusty hazel
#

Am I the only one who won't see Opus, 3+ Pro, what else. Was I banned from trying top llms πŸ˜•

outer flicker
gray isle
uncut plume
#

how can i get out of this loop?

spring oar
# spring oar
poll_question_text

Opus 4.6 nerfed in claude chat ?

victor_answer_votes

14

total_votes

14

victor_answer_id

1

victor_answer_text

Yes

gray isle
#

although GLM is just like a knock off version of Sonnet (100% of the time)

#

but it compares itself to Opus

dusty hazel
#

It achieves the result I want much faster than Opus or 3.1 for me for more real-life (not hard problem coding) tasks

gray isle
#

okay i wonder what's this, a scam or what

#

oh i thought it's a bot

dusty hazel
#

Glm's agent mode is very cool. Chinese AI impresses me a lot

gray isle
#

like no. 2 ai you can suggest

#

(I didn't say no. 1, because no. 1 is disputable)

dusty hazel
#

Glm is better than 3.1 Proand Opus as I said in what I said it's better, and it's not bad. Meaning it can achieve everything

#

Deepseek is far behind, but its MAI (Microsoft's) version is impressive

#

Benchmarks agree on all this

#

I believe it's still free on openrouter btw

#

(mai)

#

(the best of deepseeks)

uncut plume
#

i was making a website on here lol

outer flicker
#

bye

azure drum
#

@echo aurora where is mythos bro?

exotic tartan
azure drum
gray isle
dusty hazel
#

I dunno

gray isle
#

me opus? are u serious by that word

dusty hazel
gray isle
#

i'm not technically correcting, but, and but. "Cause they took my Opus" would be a better term than "Cause they took me opus" like what does me opus mean? the hell.

wary nacelle
#

Progress of Arena Fixes so far

#

@light siren has been working on LMArena OG theme

#

waiting till he send his part so we can cook

#

(this is Enhance Prompt button)

gray isle
#

wuss dattt

#

april 26 of whom chatbot?

azure drum
dusty hazel
#

Like, stay 2 weeks behind plebs

gray isle
#

it might be smashing something, but i do not believe the "bleep" out of it..., something is cooking yall

storm dust
#

πŸ’€

gray isle
#

how the hell does it win

#

the site itself has file upload, WHILE IN ARENA IT DOESN'T HAVE ONE, SO WATTT EN DA INTAYR WUORLD ES GUENG ON

storm dust
#

crash out

worldly plume
# worldly plume
poll_question_text

Random question: do you think lmarena will remove Gemini 2.5 pro?

victor_answer_votes

11

total_votes

22

victor_answer_id

1

victor_answer_text

Yes

dusty hazel
#

Glms are really seriously good if you're fine with like two extra fixes after the first prompt

#

I only don't code with glms now, but for basic tasks they're my go-to now

storm dust
#

well for coding glm is definitely good

#

but not good for long files

#

because it usually makes typos

dusty hazel
#

Yeah, actually glm-5+ are really, really good at coding

#

Solve everything for me. 3.1 Pro just does it slightly faster and prettier for me usually

wary nacelle
# wary nacelle

dude making a system prompt is so hard 😭
it keeps finding a way to bypass it and use it to JUST CASUALLY CHAT WITH THE USER INSTEAD OF ENHANCING PROMPT

dusty hazel
#

What I found really bad about Geminis is that 2.5 Flash Lite with a good system prompt works much better than 3.1 Flash Lite. Why?

#

2.5 Pro is often also more fitting even though obviously less capable and a bit unnatural

dusty hazel
gray isle
#

like in my head right now it's saying:

Did he read the announcement?

Was he actually banned?

What the hell is going on and why did that happen to him?

Oh lord... have mercy

dusty hazel
#

Which announcement?

gray isle
#

like i can't use opus too

#

the fact, this app gave me a dissappointing birthday present, something that will make me pissed.

dusty hazel
#

Oh wow they're back but only in my old chats

#

Yeah Opus been unavailable for quite a long time already,

gray isle
dusty hazel
#

But other top llms aren't there for side by side for me

#

Since like two days ago

dusty hazel
#

I see sonnet 4.5, 2.5 Pro, 5.2 high and recent 5.4 mini high

#

Oh, 4.6 sonnet is also there. And 3 flash

#

These are the top for me. No better

#

Are the providers removing their llms from arena? I see better models on screenshots in the announcements here!

#

Maybe used too much without voting much?

#

What stopped me HARD from voting more often is captchas, so annoying, every time

#

I wish I also knew what is counted β€” only the last reply or the whole conversation starting with the last vote

#

Cuz it's getting harder to vote immediately this last year

gray isle
dusty hazel
#

Happened pretty much all of a sudden btw, I only made like four text prompts that week, and then after a couple of days...

gray isle
dusty hazel
#

Half of the time actually, esp for image generation

gray isle
#

for me it's coding generation

dusty hazel
#

For text, not half of the time)

gray isle
#

specifically my favourite request

#

LaTex application

light sleet
#

ever since I joined arena, I've been SO invested in AI bro.

gray isle
#

i use Opus for LaTex after all

light sleet
gray isle
#

but perplexity sucks

surreal zephyr
#

Slop

#

Shlurp

#

Slop slop

#

Shlurp shlurp

silent tree
#

no way u can js ask an AI in arena to make u a web with the Opensource Model link and it can install everything for you and load the AI.

#

Crazy

#

Found about it rn

#

😭

dusty hazel
#

Blizzard must create Slopcraft the real-time vibe coding strategy

silent tree
gray isle
# gray isle me who's only been using 4 AIs before

Ai Studio (found it in 2023), Chatgpt (before it became limited this year, dissappointingly), Bing (a very good idea in the past, it just downgraded actually, and that's the actual problem, although Microsoft can say that they own OpenAI), then some independent random AIs or meta ai whatever

surreal zephyr
#

Dang sora truly was the only good video generator

dusty hazel
silent tree
#

me personally, I am waiting for GPT Image 2 and excited for it.

surreal zephyr
silent tree
surreal zephyr
silent tree
#

opus 4.6 nerfed SO BAD.

#

Anthropic is giving themselves a fallout.

gray isle
surreal zephyr
#

🀣

#

Which makes it more creative but garbage to work with

dusty hazel
#

No way that github misposting isn't a part of their deliberate rollout strategy to cover the expenses of Trump's attack

gray isle
silent tree
gray isle
#

after all it's just Opus 4.7 to me lol

surreal zephyr
surreal zephyr
#

Opus 4.7 will be mythos but quantized and actually reliable

gray isle
#

me having selective amnesia. and i can't pronounce words oh yeah

surreal zephyr
gray isle
surreal zephyr
#

So mythos is basically a hype bait

gray isle
#

freaking out, saying it's good

dusty hazel
#

Opus is good for being objective and not agreeing to any crap you write. Besides this, I don't know why ppl actually choose Anthropic. Their tasks aren't probably hard or demanding stability maybe

gray isle
#

the name itself, already makes it look like a fool. like "Mythos" are u saying it's a myth, like you should just named it Omega

#

it's so corny

dusty hazel
#

Wanna be rockstars. OpenAI did it better with o for omni

dusty hazel
#

The next day after I came up with choosing omni for a lot on arena btw πŸ‘€

gray isle
#

is that even the right spelling, from what i just wrote

dusty hazel
#

I like how Jan Assmann uses the word "mythomotor"

#

In essence, everyone's drivers that embody the essence of current collective memory

light siren
#

who wants it

storm dust
#

i want you 😏

light siren
storm dust
#

ban

astral cobalt
#

bro why all llm gone

dusty hazel
#

There was some paper that claimed that for most tasks "instant" simply gives better results

storm dust
#

is it just me or did people start to use gifs from giphy more often?

astral cobalt
#

all ia gone

gray isle
light garnet
gray isle
#

like what model

gray isle
astral cobalt
#

high

sterile storm
#

PLEASE OPUS 4.6

gray isle
#

don't you guys understand budget cuts?

#

like haven't use experience financial problems....

dusty hazel
#

Have you noticed Geminis became much faster in aistudio recently?

#

2.5 Pro begins answering within like two seconds for me very often

light siren
surreal zephyr
light siren
#

its our extension

surreal zephyr
light siren
golden ocean
gray isle
dusty hazel
surreal zephyr
gray isle
#

if it's cheaper to prompt. it should have been here now, and named as. Daddy's wishes.

storm dust
#

ampro mythos was mainly supposed to be a coding model if i am right so that makes sense

surreal zephyr
gray isle
#

being milked by EA

surreal zephyr
#

With no guardrails

#

And its still bad

#

πŸ₯€

storm dust
#

uhh

abstract hinge
#

Did they remove Claude Sonnet as well? Are even the free models going to start getting removed?

storm dust
#

im not sure about that one

#

you were the one saying that gpt 6 is coming this week

#

i dont see it

surreal zephyr
#

This week was the plan

#

100$

gray isle
surreal zephyr
#

Gpt 6 was delayed from last week because mythos wasnt released

storm dust
#

my eyes are wide open so lets see

surreal zephyr
#

Gpt 6 spud was supposed to come same day as mythos to literally make fun of mythos

gray isle
surreal zephyr
#

But mythos was released.... as private

#

Which ruined openai plans

abstract hinge
#

Claude Sonnet 4.5 and other versions disappeared for me.

surreal zephyr
#

So instead they are pushing the plan (and superapp) first

#

Gpt 6 spud next thursday if stuff go well

surreal zephyr
#

Or in codex but you pay via api

#

It costs 10x of normal 5.4 xhigh

storm dust
#

i have no idea where you got that information from

abstract hinge
storm dust
#

but for me that seems veryy unlikely

#

people tend to believe that it will release in june

surreal zephyr
#

😭

storm dust
#

what the

light siren
storm dust
#

pro confimed?

surreal zephyr
#

Thats out wym

storm dust
#

no i think you're pro confirmed

#

bro is pro

surreal zephyr
storm dust
#

i never do lol

surreal zephyr
#

Its all there, obvious

storm dust
#

i never used those sites

#

seems too much pressure and seems pointless

surreal zephyr
#

But yeah, i had a screenshot somewhere, tibo literally was suprised mythos is private only too

#

Cuz they wanted 6.0 as counter for it

storm dust
#

ok but is gpt 6 a generation leap or not?

surreal zephyr
storm dust
#

because i have been seeing it is

surreal zephyr
#

And thats MASSIVE

echo aurora
storm dust
#

oh thats awesome

surreal zephyr
#

Its new base but its going to be more opus like prolly

#

Because its 6.0 not 6.5

#

Its like fresh start again

#

More creative, less reliable

storm dust
#

but it definitely is not a big leap to agi 😭

surreal zephyr
#

Unless they fine tuned it strongly with the bonus time they had

surreal zephyr
#

Agi is literally moving goalposts

storm dust
#

artificial general intelligence

#

is that correct

surreal zephyr
#

The original definition was "better than average human at most tasks"

storm dust
#

well i really dont know how to define it because i dont really know what agi means

surreal zephyr
storm dust
#

im just saying for some reason some certain people say that it is a big leap to agi

surreal zephyr
#

They dont know nothing lol

#

Agi = average human

storm dust
#

it seems very unlikely

surreal zephyr
#

Can average human write working code at 200 tps?

#

Can average human solve frontiermath problems?

storm dust
#

there's no way something thats a big leap to agi is going to release this week or next week

surreal zephyr
#

Arc agi 1 was agi

#

Arc agi 3 is literally "more than asi"

#

Lol

storm dust
#

well i guess i definitely dont know what i am talking about

#

or maybe im wrong

#

or maybe im confused

surreal zephyr
#

Guess which model is "most agi" rn?

#

Because the answer is funny and unexpected

storm dust
#

no idea

surreal zephyr
#

Why? Because its multimodal properly

#

Yes its stupid

#

But its closest to agi

#

Because its multimodal

storm dust
#

ban

#

call the mods

surreal zephyr
#

Gpt 5.4 and opus 4.6 are both way better at coding than gemini 3.1

storm dust
echo aurora
surreal zephyr
light siren
#

yall is mercury 2 good

storm dust
#

so basically gpt is going to beat gemini's latest model?

surreal zephyr
#

Thats agi

storm dust
#

yeah

surreal zephyr
#

For agi you need a good vision and decent smart

#

Not "very smart", very smart is ASI

golden ocean
#

world models

surreal zephyr
storm dust
#

i thought that after AI comes ASI

#

lol

#

makes sense

surreal zephyr
#

Asi means "better than humans at everything"

storm dust
#

from what i understood

#

AI = mimicking humans
AGI = acting like humans?
ASI = overpowering humans?

surreal zephyr
#

Its different thing

storm dust
#

ok

surreal zephyr
#

AI = does anything at all without needing to teach

#

Ai is like a fly or mosquito

#

Agi is like a dumb factory worker

#

Asi is like Einstein

light siren
storm dust
#

i see

surreal zephyr
#

Use gemini 3.1 flash lite instead

#

Mercury loses to qwen 27b

light siren
surreal zephyr
storm dust
#

liam you're mine

surreal zephyr
#

Like literally

storm dust
#

😏

surreal zephyr
#

It uses outdated architecture

light siren
light siren
hollow mulch
#

Im getting 6 question from cΓ’ptcha and still get wrong, why is time is have too much?

#

*this

storm dust
#

this might be grok's nightmare

dusty hazel
#

Went completely unable to give responses in the right structure I requested

#

This was a shocker for me

fiery gull
dusty hazel
#

I wonder how they trained them and what changed

fiery gull
#

The flash 2.5 is better lol

dusty hazel
surreal zephyr
storm dust
#

immediate ban

light sleet
#

Mods get him

#

@echo aurora When is Super Pineapple arriving? It's already Sunday.

kind rivet
#

There is no gpt high latest no claude no gemini so why you post on announcement group that every model present there

#

Claude opus 4.6

echo aurora
shrewd citrus
#
poll_question_text

More reliable for Legal work?

victor_answer_votes

15

total_votes

20

victor_answer_id

2

victor_answer_text

Opus 4.6 Thinking

storm dust
light sleet
#

<@&1349916362595635286>

light sleet
#

these hacked bots GOTTA stop now.

#

What is discord doing 😭

surreal zephyr
#

opus by far. or gemini

#

@echo aurora delete pls.
cant you set up autofilter for when those exact 4 images are posted?

toxic mulch
#

whats this

wary nacelle
# surreal zephyr Speed and price. And i literally never said its a good model i said its better t...

Dude the hell you mean Mercury 2 is worse than Gemini lite 3 cuz of old architecture

Dude mercury 2 is the one who uses new architecture instead of sequential token generation it does parallel based on diffusion

Basically what that means it can even fix it's own previous written token and be very fast while saving computing power and providing high speed, just give that model proper training like Gemini or GPT and it literally beats anything

#

All ais till now were just expanding their training data

#

Thinking if they keep expanding it will get better

#

That's not how it works

#

Mercury 2 was the first one to change to architecture and try smth else

#

And guess what Google is stealing that architecture

#

To make new Gemini 4

silent tree
wary nacelle
silent tree
#

They said no

#

πŸ˜”

wary nacelle
#

Then wait till proper obfuscation of code

#

So that we just give the source

#

And y'all put it manually as developer mode extension

wary nacelle
#

Y'all could find someone

wary nacelle
#

Who has registered a Google web store developer account with paid 5$

#

So we can upload

#

Or smth

#

Idk

#

We are broke πŸ˜”

#
  • our currency isn't accepted by google
surreal zephyr
wary nacelle
surreal zephyr
#

google

#

or D3PM (Discrete Denoising Diffusion Probabilistic Models) by Austin et al., published in 2021

#

diffusion is old as hell and way worse than seq

#

thats why it was abadoned in the first place

#

its like "yo lets make wheels square again and call it innovative"

#

Diffusion-LM (Li et al., May 2022)

wary nacelle
#

This is Diffusion LLM

surreal zephyr
#

D3PM proved the concept but performed poorly compared to autoregressive models β€” about 2–3Γ— worse than GPT-2 on language modeling benchmarks

surreal zephyr
uncut plume
#

how to fix or atleast redo

surreal zephyr
#

diffusion is faster but dumb so its useless

#

diffusion is not even as good as NON REASONING autoreg models

wary nacelle
# surreal zephyr yes those all are

Second of all inception uses diffusion aka parallel only for token generation, that means they can still train the ai on same data as Google or Claude and get same results but faster also it has less hallucinations because it can fix it's previously written tokens

#

All that is different is the way ai generates tokens

surreal zephyr
#

ask claude or gpt if you dont understand it

wary nacelle
#

So saying back then diffusion was dumber gpt cuz it was trained on less data than gpt

#

Is same as comparing grok to gemini

surreal zephyr
wary nacelle
#

Obviously cuz it wasn't trained properly no wonder

#

When Gemini Diffusion releases then you will see it's potential

surreal zephyr
surreal zephyr
wary nacelle
#

Second of all you are comparing a fish to a bird

surreal zephyr
wary nacelle
#

Diffusion model purpose of mercury 2 is completely different

surreal zephyr
wary nacelle
#

U are mistaking knowledge with intelligence

#

All ais are just memorizing stuff and mixing it together

surreal zephyr
surreal zephyr
#

ok you have 0 idea how ai works. blocked

#

enjoy your mercury while i enjoy gpt 5.4 pro

#

i WONDER which one is better

wary nacelle
#

And yet you keep comparing a jet with an air fryer

#

Look if you are building a project like idk code refactoring ai or Ai Voice Assistant

#

U obviously would use Mercury 2

#

Cuz of it's speed

#

And for example voice Assistant speed really matters

#

And for simple stuff like telling voice Assistant to disable room lights or smth with proper system prompt

#

U would still use mercury 2

#

Cuz who needs it to be smarter when I want it to do simple tasks but faster

#

U are comparing GPU with CPU practically

#

GPU can do tasks at massive scale but not complex ones

#

While CPU does complex tasks but not at massive scale

wary nacelle
#

Cuz it's literally made for integration of ai to projects be a thing

#

And fast thing not wait 19 mins until ai decides to enable the lights of ur room or smth

#

By ur logic coders now are basically stupid at milking the cows at the farm

#

It's simple task

#

Yet a farmer does it better

#

That's ur logic

#

And second of all to your statement "diffusion models are 3x dumber", let me clarify diffusion models intelligence like any other ai is based on training

#

And diffusion models are just harder to train

#

Because they are more complex

#

They literally think differently

thick blade
#

Chud

wary nacelle
#

Whatever making ppl like u understand that comparing a motorcycle to a plane is dumb is useless

wary nacelle
thick blade
#

Are u a chud

wary nacelle
#

What is chud

thick blade
#

U dont know in big 2026?

wary nacelle
#

I am not Twitter or reddit guy so nah

wary nacelle
thick blade
#

Idk

vernal raft
#

is there anyway i can extract a chat from arena?

indigo rampart
#

Bro any one seen the model muse spark in arena