#general

1 messages · Page 303 of 1

strong ferry
#

They won't. At least not for a year. They're stubborn.

outer flicker
#

did opus really comfort you guy?

hollow ivy
#

comforting? not really, but entertaining and intelligent

strong ferry
hollow ivy
#

Opus-4.6 was one of the very few models, which had an actual (long-term) understanding

outer flicker
#

just like blade runner 2049 k lost joi, we lost opus 😄

hollow ivy
#

at least, we still can use Sonnet-4.6 for free
Let's be thankful for that!

#

and also GPT-5.3, 5.2 and 5.1

#

and Grok-4.20

#

and Gemini-3-flash

#

and Gemini-3.1-flash-lite

outer flicker
#

yeah

hollow ivy
#

sonnet isn't bad

#

and GLM-5.1 and Qwen-3.6

strong ferry
outer flicker
#

gork and gpt know comfort us is just officer K

hollow ivy
#

oh

#

but in battle, it is available

#

grok not gork lol

outer flicker
#

why they don"t add onther searching model?

fast dew
#

It's available there too

limber dock
#

so it wasn't just me then haha. Opus is honestly amazing though, and if it were just a tiny bit cheaper, I totally would've gotten it 🥹🥹

strong ferry
hollow ivy
#

and there's also Minimax, Kimi and Mimo

strong ferry
hollow ivy
#

what about Gemini-2.5-pro ? that is still available

#

i used it last year for RPGs, was quite good

#

(if prompted well)

strong ferry
hollow ivy
#

using special syntax, etc

#

have you guys tried out new Qwen (3.6) and new GLM (5.1) for roleplaying?

sand loom
#

Anyone testet sonnet 4.6 vs Qwen 3.6 Plus in coding? Can it compete?

rustic gale
#

Tbf, gemini was bad with long outputs even when 2.5, at least when it came to creative writing. Which is honestly defendable from the devs' POV, but it is what it is

strong ferry
limber dock
rustic gale
hollow ivy
#

but what if you set the temperature above 1.0 for 2.5pro (in ai studio) ?

#

for roleplaying/creative writing

rustic gale
#

(also perhaps the reason why it costed astronomical values for arena. 😄 Imagine how much it costs in API; every single output is like plus 10k tokens)

strong ferry
rustic gale
strong ferry
hollow ivy
#

what about Mistral?

#

(for RPGs/sandbox & creative writing)

normal abyss
hollow ivy
#

and Command (by Cohere) ?

#

and Seed (by ByteDance) ?

strong ferry
#

Not sure how it delivers, but I don't usually do creative writing these days. I ask for long-form essays based on movies and stories well known. Any other model besides the latest Gemini pro and Opus always hallucinates the stories and end up being like 90% false. It gives me a headache.

fast dew
limber dock
rustic gale
#

Command is a coding model last time I checked. And both are kind of third echelon. Because tiny. Which isn't inherently bad, there's just the league of models you can run on a 4090 and the league of those you can't

hollow ivy
strong ferry
rustic gale
# fast dew Lol is the 4o situation happening again

4o was a machine which was unhingedly trying to do stuff in the 'sound human' department. It was so-bad-it's-good category. It won over so many hearts because at the time it was the only one who could do that. Opus does this without the excesses

fast dew
#

I don't get all the emotional stuff but i agree with that

hollow ivy
#

"Life finds a way." - Dr. Ian Malcolm (Jurassic Park)

rustic gale
fast dew
#

For its time probably

#

Yh

#

I remembered the first time i got 4o

rustic gale
# fast dew Mehhh

There was a time people were developing stuff with -13b hosted locally. And don't you all forget this.

fast dew
#

I cant lie though I didn't like it

strong ferry
#

Opus also had a way of deeply connecting plot threads too. You could go on for a while and it will try to avoid being repetitive too. Man, this sucks.

fast dew
#

And we thought 32k was too much

hollow ivy
#

Sonnet-4.6-thinking?

#

does it have a thinking-variant?

light siren
strong ferry
#

Hold on, testing Sonnet now. Wish we had thinking but will have to test the normal one first

mild spade
strong ferry
fast dew
#

Lol from what i've seen like 95% of the community r using opus for creative writing

limber dock
mild spade
strong ferry
#

Oh snap! So Sonnet 4.6 gave me a 15K word response 😲 The ONLY problem is the slight lack of emotion. So the best way to compare the two is

Opus - The Author
Sonnet - The Assistant

fast dew
mild spade
fast dew
#

Where u posting them

mild spade
#

Ao3

fast dew
#

Wattpad?

#

oh

mild spade
#

people are amazed

#

opus is just really natural

mild spade
hollow ivy
#

(but no opus there, but other models, which might be interesting)

mild spade
#

I miss opus

fast dew
#

U probs gonna have to wait for a new model

mild spade
#

I just trust

strong ferry
limber dock
mild spade
fast dew
mild spade
fast dew
#

O

mild spade
#

😁😁😁

#

Trust

hollow ivy
#

(so maybe 1 message to opus-4.6 there)

fast dew
strong ferry
fast dew
#

Releasing GPT-5.5 next week

small swan
#

Is there a way I can create videos using Seedance for free for testing purpose ?

hollow ivy
#

it seems Google-Deepmind has a model-release soon®©

outer flicker
#

why you edited so much lol

hollow ivy
#

lol :)

#

because my fingers suck ^^

#

(and the keyboard)

outer flicker
#

still im use lm for chatting help task, make code, and roleplay

hollow ivy
#

which model do you now use mostly, in arena.ai?

outer flicker
hollow ivy
#

maybe we need to eat more micro-nutrients?

#

(broccoli ^^, actually yumm if spiced correctly)

outer flicker
hollow ivy
#

have you tried out sonnet-4.6 in coding?

#

is it decent?

outer flicker
#

everytime im try is pop up 'something went wrong'

hollow ivy
#

in Python it should still rock

#

as that is the easiest language for AI

outer flicker
#

but im make driffent ai work together to make a appliction all time zone of all country

#

in coding mode

hollow ivy
#

second-easiest might by Java or javaScript/TypeScript

#

java is good, if python is too slow

#

but C# is faster (and of course, Rust/C/++/Go also are faster)

#

also Nim and Zig (both are faster than java)

#

of all really fast languages, C might be the best for AI

#

(Go is slightly slower than C, but is good for multi-threaded apps)

outer flicker
#

are you try coding in arena is good?

hollow ivy
#

i only used text-chat mode

#

and compiled/tested it myself

#

with Opus-4.6-thinking, that's a breeze, of course

outer flicker
#

yo im like coding but each model do driffent task

hollow ivy
#

have you ever coded with GPT-5.4-high?

#

is it decent?

outer flicker
#

is qwen good at chatting? they say the model is 1m token

outer flicker
hollow ivy
#

Qwen is not as good as Gemini-3.1-pro, but probably better than g2.5-flash

#

i wonder, which chinese model is the best for

  • coding & debugging
  • creative writing & world-building
  • GMing (intelligent gamemaster-ing)
outer flicker
#

oppen ai the first ai oppen AI era

hollow ivy
#

lol?

#

closedAI now

outer flicker
#

yeah before

#

cause more good onther model now push gpt to down

#

im found grok imagine not free anymore 🙁

hollow ivy
#

grok-imager is gone?

#

i wonder, what "spud" can do..

outer flicker
#

yeah go check it

hollow ivy
#

the new OpenAI model

outer flicker
#

can make some not strict too much like onther ai

hollow ivy
#

i have the feeling as if this month another model-update will come out…

#

yeah, Grok is not as censored in imaging

#

and also in roleplaying

outer flicker
#

even grok have ani

hollow ivy
outer flicker
hollow ivy
#

it is not as strict, so could be fun for certain people

#

it allows more freedom in your RPGs

#

but not as intelligent as Gemini-3.1-pro

#

GLM might also allow more

#

(And Mistral)

outer flicker
#

im will try

radiant pewter
#

My chat of me direct message with claude kept loading infinitely does not give an answer, and it does not help to refresh the page or log out, someone else with the same problem?

modern wedge
#

So when they will bring back the Opus? (I writing goddam peak fiction but now it has too pause)

hollow ivy
#

but then, there will be better models

modern wedge
#

Hell nah

outer flicker
#

anthropic leak their new model name capybara

#

yeahhahah

molten cargo
#

how did you all even use opus in arena? I used to hit the rate limit after 4-5 responses, surely that's not enough for writing stories with it or whatever the use case was

hollow ivy
#

i wonder if Capybara/Mythos-5 really were april fool's jokes (according to some people, they were)

radiant pewter
molten cargo
rustic gale
hollow ivy
rustic gale
#

And yeah, we're not talking about chatting about life. Creative writing can easily mean something like 'I send a prompt of 2k tokens and would like to have it expanded by you'

sour spear
#

I paid 10$ extra on top of my subscription, and got 6 replies out of the model

hollow ivy
#

there's a new chip for that

#

an ASIC-based one, iirc

sour spear
whole knot
#

when is opus coming back ?

hollow ivy
#

but Cerebras also had a good approach

hollow ivy
#

depends on inference chips available worldwide

native flame
#

Hi guys , since opus is no longer available, do you know another site where one can use it??

#

I used to use it for japanese language learning:'v
It was the better of the three

rustic gale
heavy blaze
#

so quick question, if i have an already made website, how do i upload it to arena so i can use the models there to edit it.

native flame
heavy blaze
rustic gale
native flame
heavy blaze
vivid coral
#

for those asking why opus is gone, just look at twitter and the controversy around Anthropic right now. They are heavily limiting the $200 users now. They are PISSED. They screwed everybody that paid big for it. It's the classic bait-switch we see from some of these bad actors. Promise to be the only "safe AI", "moral AI", "pro-Black", "pro-LGBTQ", "anti-capitalism", blah blah blah.....Then once they bait you in, they switch up on you, and now only very rich and elite "nepo-babys" can afford and use it, who are 99.99% white and straight. The same kind of "bait and switch" many dictators have used to gain power. They ran the playbook perfectly, step by step. And now, you are seeing the result.

heavy blaze
native flame
vivid coral
robust jay
#

Hi

#

If anyone can test it, please rate it and tag me with @robust jay

heady sky
#

is it virus??

civic plaza
#

I really hope they add opus back. The other AI’s are okay, but they are getting on my nerves.

robust jay
#

It's not a virus.

signal pelican
#

oh sure, if it was, why would the person ever admit it anyway?

inner relic
#

e

robust jay
#

It's Google AI Studio, but I made the app for mobile myself.

rapid merlin
signal pelican
#

right... you can't just paste the file here

rapid merlin
#

the usage limits on Claude are absolutely ridiculous tho, yeah

#

I only got the 20 dollar plan but bro

heady sky
#

Claude is bs, it got leaked and within 1-2 weeks all chinese app will be same level

rapid merlin
#

I got usage capped within TWO MESSAGES

#

and the second one didnt even finish

robust jay
signal pelican
#

no thanks

robust jay
#

Use an antivirus program so you can see.

signal pelican
#

no

heady sky
#

@signal pelican stop being a menace wtf

signal pelican
#

you're free to test that for him then

heady sky
signal pelican
#

and there it is...

vivid coral
#

Google AI Studio's limits are pretty bad now too, better off just using it here

robust jay
#

No, it's basically Google AI Studio but as a mobile app, and simply because I had nothing to do and also because I don't have a PC to use it on.

icy frost
heady sky
rapid merlin
#

if it's fun why not

#

tho the 20 dollar plan would be useless if it was really needed for like anything practical

heady sky
robust jay
#

Someone brave enough to test the app I made, please, and for the love of God, give it a rating and say it's not a virus.

heady sky
rapid merlin
#

The quarter daily limits are absolutely horrid

heady sky
#

i use it in my work a lot

rapid merlin
#

you use gpt?

heady sky
#

nah it sucks

rapid merlin
#

I was talking about claude

heady sky
#

i use gemini

slow axle
#

Is video arena working?

robust jay
heady sky
slow axle
#

I think its broken

robust jay
#

Out of sheer spite, I'm going to make an LMArena app for mobile phones.

slow axle
#

Im not asking for that, but in the website I can't create videos

fast dew
slow axle
#

I think its a bug

storm dust
#

why did bro remove the screenshot

proper quiver
#

i finally vibe coded a roblox system since sonnet 4.6 worked

storm dust
#

"working hard"

#

i didnt even had to watch the video

#

i worked smarter

#

i understood everything from the name

rapid merlin
#

is gpt image 2 still in battle mode btw?

severe canopy
#

realease gpt2 right now oks !

dusty wagon
#

Wait so why was opus removed again? I first saw it was about due to financial issues, and then i saw another reason

pastel ember
#

Looks like the guys from Grok paid to have other top-tier competitors like Opus removed. 🤣

whole sundial
crimson ivy
#

When are the opus models going to come back?

#

Did they remove them because of the error issues?

sharp marsh
#

Dude

#

Opus 4.6 got removed

#

?????

analog owl
#

Hi @echo aurora , how are you? We talked in private a few months ago IIRC, about the captcha of death. Just so you know, Im getting the captcha of death again. Hope everything is alright on your side. Also and of course, happy easter to you, your family and loved ones as well.
Thank you very much for arena.ai and everything you do to make our lives better and easier.

normal abyss
kindred sorrel
#

I saw on YouTube that videos can be generated in a Discord server, but I can't find it. Can someone help me?

kindred sorrel
#

Bro, I saw on YouTube that you can create videos directly from a Discord server.

normal abyss
kindred sorrel
normal abyss
#

if your on that

sharp marsh
#

Gemini on aistudio was nerfed a LOT and I suspect grok to be the same

normal abyss
sharp marsh
#

I was able to do like hundreds of requests to gemini without ratelimiting, now it's limited to 5-10 a day

sharp marsh
sharp marsh
#

Yup

surreal zephyr
normal abyss
# sharp marsh Yes

i think its around 20, i never hit rate limits so i never really find out (could be much more or much less)

kindred sorrel
# normal abyss yeah thats from way way back

I want to create videos on a website, and I’ve made some, but there are a lot of options missing. On many websites, there are options like 10 seconds, 5 seconds, 4K, 8K, etc., but I don’t get those. When I click on the video option, it only shows an option to select an image and nothing else.

surreal zephyr
#

gemini had 100 requests they now nerfed to 20

sharp marsh
#

I was able to spam 7 hours straight in aistudio coding, gemini 2.5 and 3.0
When 3.1 released all models ratelimits were nerfed

sharp marsh
#

Also not 100, but 1000

kindred sorrel
sharp marsh
#

It got nerfed to 500, then 250, then 100, then 10

sharp marsh
rustic gale
#

And next step is zero because why would there be free cookies

normal abyss
surreal zephyr
sharp marsh
kindred sorrel
normal abyss
kindred sorrel
sharp marsh
#

Anthropic really killed free users huh

#

Jesus christ

normal abyss
kindred sorrel
#

🙃

normal abyss
normal abyss
kindred sorrel
#

Alright bro, can you tell me any free websites where I can generate videos in really high quality? If you can, please tell me.

normal abyss
# kindred sorrel Alright bro, can you tell me any free websites where I can generate videos in re...

this might not be what your looking for exactly but this has a few free Veo 3.1 fast generations https://tryveo3.ai/features/v3

#

not sure if thats ultra high quality though

fickle pumice
#

image 2

pastel ember
normal abyss
#

with free video generations its pretty impossible to find a fully free high quality option with settings and model selection icl

kindred sorrel
#

I’ve tried ten times, but it’s not working.

normal abyss
sharp marsh
#

How good is coding there

kindred sorrel
pastel ember
sharp marsh
#

Also what model were you using of grok?

#

Is it the thinking mode

kindred sorrel
#

Thanks bro

pastel ember
light sleet
#

48 Hours, I'm in uae and not scared but cautious.

sharp marsh
normal abyss
light sleet
#

Seedance 2 is free in capcut

sharp marsh
#

In reddit there were people saying thinking models have a 16 requests every 13 hours limit

#

Which sucks

normal abyss
#

also grok imagine is on here

sharp marsh
#

And anyways gemini 3.1 pro is better than grok 4 so I'd rather use that

#

1 million context too

normal abyss
pastel ember
#

Gemini feels overly corporate and heavily censored

quaint lily
#

how to stop beign forced using battle mode on direct since battle mode is broken

pastel ember
quaint lily
#

it's broken

sharp marsh
#

In terms of censorship

golden ocean
#

fr

#

where is bro using gemini from

#

use google ai studio

sharp marsh
#

^^

golden ocean
#

it gladly helps u destroy the world even

sharp marsh
#

And if it doesn't cooperate, gaslight it

#

99% chance it will let you do whatever

rustic gale
sharp marsh
#

Gemini 2.5 can be jailbroken

#

Extremely easy too

kindred sorrel
rustic gale
#

So is 3.1, but the question is whether simple galighting, as you put it, is enough, without resorting to specialized JB techniques, unless you consider gaslight as JB

sharp marsh
#

Oh nah 3.1 cannot be jailbroken

#

Neither 3.0

#

I tried

rustic gale
#

See? As I said, you have a pretty tame definition of 'whatever'. They most certainly can, but much fewer people have the know-how which they're also ready to share. Some exist

sharp marsh
#

You said 3.1 can be JB

kindred sorrel
#

@normal abyss 🤦🏻

sharp marsh
#

I said no

kindred sorrel
#

@normal abyss hello bro

sharp marsh
#

I didn't say whatever is perverted stuff either

#

If you're into that it's your problem

rustic gale
#

While getting NSFW (even of tame and vanilla variety, really) is probably the most often employed usecase of jailbreaking, it's not the only one. And yet again, with gemini 3.1, those can be done. Now that I think of it, perhaps there can be some additional problems with 3.1 pro, but definitely fewer with 3.1 flash.

normal abyss
#

you get 10 free credits

regal girder
#

I must be having some issues with tokens, because on Gemini 3.1 Pro after 5 prompts it displays an error and asks me to create a chat 😥

quaint lily
#

is there an option to remove battles from direct completly?

quaint lily
rustic gale
rustic gale
#

Well, that limitation was not present in the initial query. Restart. 🤷‍♂️ Remember that direct chat is basically charity since the site needs those juicy comarisons in addition to all your prompts

golden ocean
#

transformer

strong ferry
#

OH NO! Now I'm getting stupid token usage limits!!! WTF

golden ocean
#

LOL

regal girder
#

the same problem 😥

bright shard
#

@echo aurora I have a problem; the second message I send in code mode I get this message. And then it won't let me continue with the project, it's quite annoying.

light siren
bright shard
light siren
#

gemini is bad at coding anyway

#

why are u even using it

dense fern
#

Is Claude Opus missing for everyone? What happened to it?

sour spear
#

Anthropic do have the best models, but they're using way more compute than Gemini or others to achieve this, hence the obscene pricing. 😉

sour spear
wicked talon
#

Microsoft copilot is actually valid

#

It uses Claude and gpt

prime karma
#

@sullen quest I haven't seen the Gamma 4 and Quik 3.6 on my leaderboard yet. Have they arrived yet or have they arrived?

#

@oak python @quasi atlas @light siren I haven't seen the Gamma 4 and Quik 3.6 on my leaderboard yet. Have they arrived yet or have they arrived?

light siren
#

u mean qwen

prime karma
prime karma
normal abyss
light siren
#

about gemma, it's already here

light siren
#

lowkey I need to get my extension approved

storm dust
#

oh hey liam

light siren
#

can u do that too

storm dust
#

any news about the extension?

#

me?

light siren
light siren
storm dust
#

i cant dm him

#

i need to friend him

light siren
#

his dms is closed

#

that wont work

storm dust
#

@echo aurora im sorry for pinging you but liam wants to talk with you

light siren
prime karma
light siren
#

if u see him speaking here tell him liam asked if he can respond to the modmail

#

havent seen him myself all day

#

wonder what happened

rustic gale
#

As in, this... place is in 6 digits member-wise. And it represents a corporate entity. Something tells me being a community manager here is a job. Which is paid and which has at least some semblance of work hours. 😄

light siren
cedar citrus
#

which is bad

cedar citrus
rustic gale
stuck orchid
#

Freedom for Claude Opus!

#

We won’t let Claude Opus models be mistreated on LmArena or pushed out by others!

#

Claude Opus models have every right to be on LmArena!

#

They are excellent models

#

They shouldn’t be so underestimated, as if they don’t belong here at all

hollow ivy
#

you just have to be smart and patient, then you can use Opus-4.6 (sometimes)

storm dust
hollow ivy
#

(i can understand, why pineapple was not here, when it happened)

#

we forgot, that they spend money for us

sweet tinsel
#

Honestly Opus is overkill for most chatting tasks and only really excels at coding.

hollow ivy
#

and they still do it, and we can use over hundred of models - still for free

stuck orchid
rustic gale
# storm dust even though some or most people mistreat them 🤣

Mistreat, lol. Were there limits? No, or at least those barely understood and often circumvented. Was there comms about what is the correct way to use them? No. Maybe about how much? Also no. Gas occupied all volume available to it; so does human nature when seeing a freebie. And then omg, abuse, creative writing for 5k words per single output (source: me, doing that all the time, for months). What was expected, really? That a model known that it's very good at something will be used for something else?

sweet tinsel
rustic gale
stuck orchid
rustic gale
hollow ivy
stuck orchid
#

Hmm, It’s strange. Very strange. There must be a specific reason why the Opus models were removed. It’s unlikely that it’s just LmArena’s own attitude toward these models

hollow ivy
sweet tinsel
stuck orchid
hollow ivy
stuck orchid
hollow ivy
#

(how many were there? opus-4.1, opus-4.5, opus-4.6; was there opus-4.0 ?)
(also their thinking-variants, which are even more resource-hungry)

sweet tinsel
rustic gale
#

(spoiler alert; there are no trees. In case it wasn't clear)

sweet tinsel
#

Also there is not really an incentive to operate a direct chat other than to lure users in to vote in battle someday. Direct chats don't have an effect on ELO rating, which is the product that they essentially provide to the labs.

hollow ivy
#

and if 100k users used an Opus each day, we get $100-200k expenses just for Opus-usage per day

#

or even $500k per day
that is $15 million, just for Opus, each month

stuck orchid
#

The Claude models already seem to be the most popular ones right now. Their GPU costs are likely fully covered, given how expensive their models are

rustic gale
sweet tinsel
#

This video explains it fairly well: https://youtu.be/j_kJNYLI6Tw?is=arvbc6RYyeKW9zQY

Anthropic just made the limits on the Claude Max/Pro plans a lot worse...

Thank you Depot for sponsoring! Check them out at: https://soydev.link/depot

SOURCES
https://x.com/trq212/status/2032916661452595648
https://x.com/trq212/status/2037254607001559305
https://x.com/Pranit/status/2037353721047491047

Want to sponsor a video? Learn more here:...

▶ Play video
static heart
#

Is the Opus 4.6 trough Arena and the official claude.ai website the same? i bought the Claude Pro Subscription today and Opus 4.6 on Claude.ai directly seems different to the one on arena. i tested prompts i prompted before in arena and it responded verry differently

hollow ivy
loud verge
#

I remember some announcement about models being removed from direct chat.

opaque maple
#

hey can i ask you guys smth

hollow ivy
#

and/or different P-top values

opaque maple
#

i know its ai slop but

loud verge
#

Is that actually happening? If so, when?

cedar citrus
#

bro what is this delete this bradar this is completely unrelated bro

#

and its kinda mid idk

#

also the ai companys give acess to lm arena the ai models for free because they will get more visibility and free datasets

#

free datasets is the most valuable thing

static heart
hollow ivy
#

i thought, Anthropic had something similar

cedar citrus
sweet tinsel
#

Not really. He could use the API although.

hollow ivy
#

(i cant use it, because they demand a phone number)

cedar citrus
sour spear
hollow ivy
cedar citrus
#

simple

hollow ivy
#

(and poe.com only offers 1 message per day; opus-4.6 included)

cedar citrus
#

free data is 100x more valuable than the costs

unreal hatch
#

its say something inappropriate in that bottle

static heart
cedar citrus
hollow ivy
sweet tinsel
hollow ivy
#

no daily credits

sweet tinsel
#

They don't get the Data for RLHF from the direct chat.

cedar citrus
#

still

#

they should only remove from direct

sweet tinsel
#

And it's not worth it if people never go over to the battle mode and just use Opus directly.

hollow ivy
sweet tinsel
unreal hatch
winter cypress
#

i want create video

cedar citrus
#

i coud literally use 100t tokens

#

and they arent charging a single cent

#

i could recreate 100 node js and they arent charging me a single cent more

sweet tinsel
hollow ivy
cedar citrus
#

so for me it would be 1200 reais

slate hare
#

Hi everyone I just want to inquire about Claud Opus Removed

winter cypress
#

i want create video

cedar citrus
sweet tinsel
slate hare
sweet tinsel
light siren
hollow ivy
#

(but low, so one has to be very patient)

cedar citrus
cedar citrus
#

like me

slate hare
light siren
cedar citrus
compact comet
#

poe.com is actually cheaper than regular claude? their rates seem reasonable

light siren
sweet tinsel
cedar citrus
compact comet
#

no of course not

cedar citrus
compact comet
#

it's 75k points per 100k output tokens

cedar citrus
#

api costs are higher than subscription costs honestly

rustic gale
cedar citrus
sweet tinsel
hollow ivy
compact comet
#

and they give you 1M points per month

cedar citrus
sweet tinsel
cedar citrus
cedar citrus
sweet tinsel
cedar citrus
slate hare
hollow ivy
sweet tinsel
hollow ivy
#

(…at least, that is very unlikely)

slate hare
sweet tinsel
#

But note that it's still available in battle mode. The actual product.

cedar citrus
#

i wish anthropic lower costs of their products

compact comet
#

or leechers stop abusing their services

sweet tinsel
cedar citrus
slate hare
cedar citrus
#

with anthropic blocking openclaw with the subscription usage the usage limit will rise prob

compact comet
cedar citrus
cedar citrus
sweet tinsel
cedar citrus
compact comet
#

massive ai models are expensive to run and reasoning takes 10x the cost, more news at 11

hollow ivy
slate hare
#

What About Gemini 3Pro

hollow ivy
#

(aka GPT-5.5)

cedar citrus
hollow ivy
sweet tinsel
compact comet
#

nobody offers their strongest models to free users, that would be utterly insane

cedar citrus
slate hare
#

He doesn't exist.

slate hare
hollow ivy
cedar citrus
#

3x less

slate hare
#

How much does it cost to subscribe to anthropic using Opus

compact comet
#

claude limits for opus are ridiculous

west lodge
#

alot

compact comet
#

the API is cheaper

cedar citrus
compact comet
#

unless you go for the max subscription, in which case the limits are reasonable

cedar citrus
#

oh okay yeah

sweet tinsel
cedar citrus
#

that i agree

cedar citrus
sweet tinsel
#

Distillation isn't lossless.

west lodge
#

shill

sweet tinsel
cedar citrus
sweet tinsel
#

You're better off using GPT and Gemini models.

cedar citrus
#

because entrepise gang will pay alot money

#

and people will subscribe to claude

#

monthly money

sweet tinsel
cedar citrus
#

this will be some sort of industrial revolution

#

im sure

#

its like with pc parts

#

a part from 2006 at the same price as a part from 2026 the 2026 version will perform way more

#

will happen with ai

compact comet
#

the inevitable crash of the insanely inflated AI race will lower prices but don't count on it

cedar citrus
#

mainly because of robots

west lodge
#

im hoping for corporate/closed AI To crash and burn

cedar citrus
west lodge
#

and for open weights models to reign supreme

cedar citrus
hollow ivy
cedar citrus
#

count my word

west lodge
#

yeah ive noticed

compact comet
west lodge
#

im just looking at how i can get a cheap gpu with actually good amounts of vram to run the decent gemma modes

compact comet
#

stop consuming corporate slop

west lodge
#

because 12gb isnt enough

cedar citrus
#

do you prefer

#

writing code for a software in three months

#

getting 1 second of sleep

west lodge
#

i CAN run the MoE model with some offload but its kinda slow compared to the cloud options and i cant give it much context so

cedar citrus
#

or do you prefer doing it instatly getting the same money

west lodge
#

not really a good replacement for gemini at

#

m

compact comet
#

what is blud talking about

sweet tinsel
compact comet
#

AI is very useful yes, it is still crashing and burning in its current state

west lodge
#

woah not that big

#

im talking about 31B

#

its decent enough

compact comet
#

it's not that hard to understand that it's not sustainable

cedar citrus
sweet tinsel
cedar citrus
#

intel arc is 1000 dollars

sweet tinsel
#

You're better off renting an H100 from runpod or such.

cedar citrus
west lodge
#

if i can get a good GPU taht can keep gemma 4 31b in full vram with a good context window im set

sweet tinsel
west lodge
#

ik benches arent anything to trust but

cedar citrus
west lodge
#

this is kindof a win

sweet tinsel
#

Have you actually tried running it?

west lodge
#

especially if i use heretic

cedar citrus
sweet tinsel
cedar citrus
west lodge
#

if 31b actually does better than g3 flash in real life then im set

sweet tinsel
#

Already quantitised.

cedar citrus
#

i have an i7 9700 rx 550 and 64 gb of ram

cedar citrus
hollow ivy
#

with that machine, you are not poor

west lodge
#

wtf ishtis bro

west lodge
cedar citrus
# sweet tinsel It eats WHOLE 101GBs with Q8.

bro the 101GB figure is for BF16 unquantized at FULL 256K context, nobody runs it like that locally 💀

Q8 31B fits on a 32GB GPU. Q4 31B fits on a 24GB GPU (RTX 3090). that's documented on unsloth's page and confirmed by people actually running it in llama.cpp

also the 26B-A4B MoE is probably the better pick anyway — only 4B active params per forward pass, way faster, and a 3090 runs the full 256K context on it with room to spare

so yeah, 3090 is the move

sweet tinsel
rich sable
#

I'm new here, does anyone know where the Claude Opus model has gone?

cedar citrus
# sweet tinsel I used Q8.

what GPU were you running it on and what tool did you use?

because there's a known LM Studio bug that shows inflated numbers for gemma 4 31B — someone on the unsloth HF page literally posted about it and closed the issue after confirming it was wrong. in llama.cpp the actual VRAM usage at Q8 is way lower than 101GB

also if you were counting system RAM + VRAM together during CPU offload that'd explain it. that's not the model "eating" 101GB, that's just your tool reporting total memory across RAM + VRAM

hollow ivy
#

(was too expensive for arena.ai to sustain)
but it still exists in battlemode

sweet tinsel
west lodge
#

damn this would be a great offerr (if it wasnt like mined on or smth) if it wasnt like a whole channel away from me

cedar citrus
west lodge
#

how fast would a 31b model even run on a 3090

compact comet
#

it probably wouldn't even run ngl

west lodge
#

wym

#

it would run fine if you just quantize it

rich sable
#

life hack

compact comet
#

not for any reasonable definition of "running"

cedar citrus
#

give me a sec

sweet tinsel
compact comet
#

yea sure but it's like 1 token per 5 billion years 😭

west lodge
#

great

cedar citrus
# west lodge how fast would a 31b model even run on a 3090

31B Q4 on a 3090 sits around 30–34 tok/s which is totally usable

but honestly the 26B-A4B MoE is the smarter pick — it hits 64–119 tok/s on a 3090 and runs the full 256K context with room to spare, quality is only slightly below 31B

so if you want speed + big context, go MoE. if you want max quality and don’t mind slower, go 31B

west lodge
#

cuz atm i only hit like 15tok/s on my 4070 😭

#

with the MoE

rich sable
#

soo real life hack?

cedar citrus
cedar citrus
west lodge
#

uhm yeah

cedar citrus
#

guys i have a question

west lodge
#

why are you getting claude to generate your responses btw

#

that did NOT warrant a prompt

sweet tinsel
#

You know what, I'll spin up a VPS real quick equipped with an A100 to see rq if it works with Q8 quants tomorrow. I'll message you @cedar citrus.

cedar citrus
sweet tinsel
rich sable
sweet tinsel
#

And I want some sleep on those festive days.

hollow ivy
#

how good is the Gemma 4 E4B SFP8(8bit) in roleplaying?
(That is the biggest of gemma, which i might barely be able to run here, lol)

west lodge
#

truly who gaf

rich sable
cedar citrus
sweet tinsel
rich sable
#

If you want to reuse Claude Opus 4.6, go to a chat where you've already spoken with him, and there you go, you can reuse Claude.

cedar citrus
west lodge
#

google bro

#

you dont need claude to wear your skin

#

just use that brain of yours

cedar citrus
# rich sable

how are you getting acess to claude opus 4.6 in lm arena dawg

west lodge
#

aipac

#

bro trusdt

cedar citrus
west lodge
cedar citrus
#

its gemma 31b q4 vs gemma 26b a4b MoE

west lodge
#

oh wait

#

gemma 31b was compared against 3 flash non-reasoning

cedar citrus
#

lol

west lodge
#

yeah makes more sense now

stuck orchid
sweet tinsel
west lodge
#

but still

#

kinda rough

#

better off just living with the slightly slower speeds of a full dense model

#

or just

#

use qwen

#

LOL

#

itll probably run even faster than gemma cuz less params

#

still dense so itll be slower but

#

worth it

stuck orchid
#

Gemma - Top Model?

west lodge
#

nope

sweet tinsel
#

It's just good for local use.

#

And excels at its domain, being edge deployment.

stuck orchid
#

They should have released Gemini 4 instead

west lodge
#

if you're coding with local models just uhh

#

use qwen3.5

#

its somehow better

hollow ivy
sweet tinsel
west lodge
#

and doesn't start repeating tool calls as quickly

west lodge
#

not coding

sweet tinsel
stuck orchid
#

In AiStudio, you can see Gemma thinking, without any generalizations

west lodge
#

well yeah

#

why would they summarize an open weight model's reasoning

stuck orchid
#

It's strange that Glm-5.1 isn't available on LmArena

hollow ivy
#

(i also have 16GB main ram)

west lodge
#

if you dont mind offloading then E4B

#

if you DO then E2B

#

E4B should still be fast enough even when offloading since its low params

#

(effectively)

sweet tinsel
sweet tinsel
hollow ivy
#

is Qwen-3.6 better than all gemma models and also better than Qwen-3.5?
(at their website)

west lodge
#

it could if you offload

#

and run quantized

west lodge
#

so we cant really tell;

hollow ivy
#

is Qwen-3.6 also better than GLM-5.1?

sweet tinsel
sweet tinsel
west lodge
#

usually .1 increments (which everyone seems to be doing after openai started doing it) don't really provide a big ste pup

#

its more refinement or more RL

hollow ivy
#

so, GLM-5.1 is currently the best chinese model for coding?

west lodge
#

.5 or +1 increments usually have the best improvements (like with gemini, where the flash models outperform last gen's pro models)

sweet tinsel
west lodge
#

son what

#

i mark the sata port as hotplug and i can do this

hollow ivy
#

and which model would you recommend for roleplaying and realistic sandbox games, where the AI has to be an intelligent GM?
(apart from opus)

sweet tinsel
#

Minimax 2.7 works also pretty well but I believe that GLM 5.1 has an edge over it also in raw code output, even without a harness.

hollow ivy
#

(thanks for your answers so far!)

sweet tinsel
#

Genuinely the creative writing outputs of large models are very good. GPT 4.5 had a very great writing style and that falls a bit under the same category.

sweet tinsel
hollow ivy
#

i'm also in the other discord (lmsys)

sweet tinsel
hollow ivy
#

i used sonnet in lmsys in summer 2024

#

i think that was during the big Claude-Sonnet-3.5-moment

sweet tinsel
hollow ivy
#

nice, thanks for remembering me

#

rabbits are indeed cute

sweet tinsel
hollow ivy
#

(and i know of a boardgame, where they play a role ^^)

#

when did you start to "vibe code"?

#

or AI-code

hollow ivy
#

could one vibecode with GPT4-32k?

sweet tinsel
#

I tried it out over time again and again always with miserable results. I to this day don't really completely vibe around but recently I noticed that the models are actually capable now, especially with harnesses like OpenCode, so I've been doing a bit more testing over the last month.

hollow ivy
#

(i never really tried that out)
(or maybe i did , but just with bash script? i cant recall)

sweet tinsel
#

GPT 5.4 for Backend and Gemini 3.1 Pro for front-end work really well.

#

I mostly keep Opus and Anthropic models out of my life.

#

I've been a Plus subscriber since the founding days of ChatGPT and it serves me well with my GitHub Copilot Subscription that I recently got.

hollow ivy
#

what do you think about Grok?

sweet tinsel
sweet tinsel
#

The speed to quality ratio was just great with multiple agents.

hollow ivy
#

Grok-4.20?

#

their 4-agent system?

sweet tinsel
#

Sometimes I am stuck waiting for my GPT 5.4 to finish thinking, for up to 45 minutes.

sweet tinsel
#

And the normal Grok 4.2 Thinking also serves me correctly.

hollow ivy
#

so, which current AI system in direct chat of arena.ai is the best for pure vibe-coding?

#

(of all existing ones)

#

if one can not code, but takes 100% of the AI code face value

sweet tinsel
#

You can also get the GitHub Copilot Student Subscription for free if you're in that age range.

#

But for direct you could use Models like GLM 5 I guess.

hollow ivy
#

even for harder projects such as letting it write an AI for niche boardgames?

sweet tinsel
#

If you want Website slop don't use Loveable or such. That's just a nightmare to host outside their ecosystem, use AI Studio or v0 for that.

#

v0 gets you limited free access to Opus 4.6 btw

hollow ivy
#

i mean, for offline apps

#

not web-coding

sweet tinsel
#

5 dollars of inference monthly

hollow ivy
#

(i use Linux Mint 21.3)

#

i dont have paypal and i wont ever use a credit card online

#

there's always the danger of getting hacked/leeched

#

so it has to be free

sweet tinsel
#

For apps you'd need some way to build them actually. Or you could just run straight Phyton as an example.

#

For offline apps I really wouldn't use Arena.

hollow ivy
#

i can build apps in C/C++ here, also Java and Python

#

and Rust (but this is a bit harder)

sweet tinsel
#

Use something like the free agent from GLM, which has generous limits or really just run Anti-gravity and OpenCode locally.

hollow ivy
#

can i use Sonnet-4.6 for free, using that harnesses?

sweet tinsel
#

With Anti-gravity you can do that with usage limits which are restrictive although.

#

You can also use Opus there as a free user, but with limits that won't get you far.

hollow ivy
#

1 message per week? (for opus)

sweet tinsel
cedar citrus
#

when are glm 5.1 dropping its api

hollow ivy
#

ok, that does make more sense, ofc

cedar citrus
hollow ivy
#

isn't it already available in their chat?

cedar citrus
sweet tinsel
cedar citrus
#

if its comparisons against opus

#

are true

hollow ivy
#

do you think GLM-5.1 will be better than Deepseek V4?

cedar citrus
#

deepseek fell behind

#

if glm is right of the benchmarks being compared to opus then boom

hollow ivy
#

also better than GPT-5.4-high and Grok-4.20?

cedar citrus
#

idk

#

its what they say

#

we cant know for sure until they release to the public

hollow ivy
#

i think, i already chatted with GLM-5.1 at their website..

#

but i'm not 100% sure

#

maybe they do a partial rollout?

sweet tinsel
#

I don't think it's going to be this great.

#

There is something called "Benchmaxxing"

#

Common practice, ask Meta.

hollow ivy
#

yeah, unfortunately

#

they especially train for the benchmark, which defeats the purpose

cedar citrus
cedar citrus
sweet tinsel
cedar citrus
sweet tinsel
cedar citrus
#

anthropic right now is unreachable

hollow ivy
sweet tinsel
#

In the interface for coding.

cedar citrus
#

guys

#

did you guys get acess to the video api in openrouter

#

i actually got acess

#

but its expensive as heck

hollow ivy
sweet tinsel
cedar citrus
cedar citrus
#

i hope they cooks

#

i love their pricing

sweet tinsel
#

And their main selling point will be their lost reliance from Nvidia as they use Huawei for Inference and such.

cedar citrus
#

its gonna be great for me for my api uses

#

but i am using nvidia nim api

sweet tinsel
#

1 percent of battery left and it's midnight. I think I'll head out for now.

hollow ivy
#

do you guys think, chinese models could permanently claim the second place behind Anthropic?

#

and ahead of Google deepmind, OpenAI and xAI?

cedar citrus
#

i think they will be ahead of gpt and xai right now

#

anyways im heading out

#

im going to eat

#

im sending you a message

#

so i dont lose a contact

#

eh whatever

#

its blocekd

hollow ivy
cedar citrus
#

add me if you want

slate hare
#

Gemini 3 Pro or Opus 4.6 is better

hollow ivy
#

(due to past discord shenanigans, i had to do that)

cedar citrus
cedar citrus
#

you can add me if you want

#

im nice

#

anyways byee

hollow ivy
#

(or here, in general)

sweet tinsel
#

That's Backend. For the love of god trust the autistic GPT 5.4 and 5.3 Codex with that kind of task, it really is great at it.

#

That message disappeared now I guess.

hollow ivy
sweet tinsel
hollow ivy
#

and neither is 5.3

hollow ivy
#

but 5.5 should be

sweet tinsel
hollow ivy
#

ah, ok

sweet tinsel
#

Others share this opinion from what I saw.

hollow ivy
#

even in sizable C/C++ programs?

sweet tinsel
#

Most likely yeah, although I don't really code in that language.

sweet tinsel
hollow ivy
#

no, i bet not

sweet tinsel
hollow ivy
#

maybe better than Sonnet-4.6, but never Opus-4.6

sweet tinsel
hollow ivy
#

less hallucinations?

#

better understanding of the code?

#

and better understanding of the user intent?

slate hare
slate hare
hollow ivy
#

so we await these models:

  • Grok 4.5 or 5
  • GPT-5.5 ("Spud")
  • Claude Mythos 5
  • GLM-5.1
  • Deepseek V4
  • Gemini-3.2 or 3.5
slate hare
hollow ivy
#

Spud might be second

sweet tinsel
hollow ivy
#

which is best of these? and which comes second?

  • (current) Minimax
  • (current) Kimi
  • (current) Mimo
  • (current) Qwen
  • (current) Deepseek
sweet tinsel
slate hare
#

1)Kimi/2)qwen/3)deepseek...

hollow ivy
sweet tinsel
slate hare
sweet tinsel
#

I would say

  1. GLM
  2. Minimax (2.7)
  3. Kimi
  4. Mimo
  5. Qwen
  6. Deepseek
hollow ivy
#

wow, i didn't know, Minimax was that good

#

and GLM rivals Gemini 3.1 pro? or maybe even Sonnet?

sweet tinsel
sweet tinsel
hollow ivy
#

and back-end coding is harder for AI, right?

sweet tinsel
#

Maybe GLM could be better than Backend than Gemini with 5.1

hollow ivy
#

as it can not be done as fast in python

restive crypt
#

yo everone

#

wsp

sweet tinsel
#

Codex is the worst at front-end.

restive crypt
#

why isnt opus 4.6 misiing

hollow ivy
#

is Python the best language for front-ends?

restive crypt
#

on web

sweet tinsel
restive crypt
#

nah i cant see it on website

hollow ivy
#

opus shy

sweet tinsel
hollow ivy
#

GUIs yeah

sweet tinsel
restive crypt
#

forver??

hollow ivy
#

probably, yeah :/

#

to save cost

restive crypt
#

nooooooooo

#

tf

restive crypt
sweet tinsel
# hollow ivy GUIs yeah

Ooooff, something that I rarely use AI for, I used Opus 4.5 for that last time and it worked fairly well, I have honestly nothing more to say.

restive crypt
#

broo

#

chat isnt giving that felling that it was giving before

sweet tinsel
hollow ivy