#general
1 messages Ā· Page 313 of 1
If they wanted or felt threatened by competition in any way, 128k output cap and tuning for that would have a nice performance boost over 64k I believe
It's a good model in how it manages to get the performance with less tokens than any competing model currently, but there's untapped margin nevertheless to go beyond
so gpt 5.3 codex is almost half the price compared to 5.4 ? and honestly the difference for coding is not so big, so you can legitimately take this ~2x usage
you guys could test opus 4.6 on ondemand but its only for a few messages (til it hits 1m context im p sure)
And it's not difficult.
5.3c is near equivalent to 5.4 in coding
yeah and the price diff is big
task dependant marginal differences there, they are simply different
you can use 5.3 codex with fast mode and you'll just use a bit more than if you used 5.4 without fast mode
if you want to, or you can also use it in normal mode and have more usage
GPT 5.4 was brought back, but only to the old dialogues.
that is, in those dialogues in which 5.4 was used, you can use it
I don't know about Gemini
in a normal, more or less new dialogue there is no 5.4
seems like a ui bug, the endpoints are closed
actually no ur right, but only some of the old chats for me
whats a endpoint
by endpoint i meant basically the way to access that specific model, i was guessing they closed it but they didnt

because its not intended to
bro bring back claude opus
Probably not until next gen models are Cheaper
vruh
The problem is Compute is not Free like Software
It takes Computers Infrastructure, Electricity and Water
Maintanence and such stuff as well
mmh i see
Hi
What's the daily limit for image generation?
is any way where we can use claude opus in free ?
I use GLM5.1,it looks good at coding
didnt test it out but probably could be replacement of the top 3
still cant move on
they removed my gemini 3.1
for me, its atleast close to opus
my favorite model tbh
I really like Muse Spark, I'm so glad they added it
#Generate a video
Where do you find this?
Yeahhh, smartš§
@echo aurora
š
Im try log in again even log out and log in back but still get that issue when try oppen a chat š
find whats odd in the image
Claude Sonnet is 19th in the global ranking
it's a mess.
erm so?
The one that you ask him it's odd š
First one š
wat
Extra hugs is sus
Can you explain to me why gemini-3.1-flash-lite-preview is worse than gemini-3-flash, a seemingly lesser version?
ur soo delusional- it says LMArena instead for Arena
cause google is bad at making ai
you would think they would be good with the amount of data they possess
I'm confused
look closely at the top left of the screenshot
Ohhh it's Fake or Ai image generated
no
it's just my Arena Fixes extension
But is gemini-3.1-flash-lite-preview actually worse than gemini-3-flash? Or are the data wrong and is 3.1 actually better than 3.0?
that fixes arena bugs, I made it bring back lmarena
So you have opus 4.6 in direct in chat?
the hell, no
no amount of changes from me could bring back models
I said bug fixes dude
Why did you want to say cute lines for your Lil sister? SUS
wdym say cute lines for my lil sister, she's tryna be a voice actor and wanted cute lines
You right but I got one for her I can give
get out.
It's really good line for her
Not interested.
Noo not wired i saw that in tik tok
fine what is it
- ~UwU~ ~my name is Senpai don't leave me please~ š„ŗ ~Uwu~
she's a minor.
...
I just woke up and this is the first thing I see here
That was my The Proposal
I'm minor too lol
wait...you can now click role ping
Is it me or nano banana pro 2 messes up your actual face when generating photos
Muse spark is actually good bro thanks god
yeah but im using on meta ai, i think it has no message limit
'Session not found. Redirecting to home...' what does that mean guy?
its nerfed on meta ai but on arena it seems fine
it wasnt anything tou-breaking or copyrighted either
what was it
what did you say?
i closed both tabs already
But only 128k token
system prompt is like instruction ? or is different ?
Not good as opus 4.6 but still good i think
Opus 4.6 thinking is back ! But seems to be the only one...
And now it disappeared
It is not it's a bug
None of those are bsck
It's unclear if it's a UI bug, or something else
ok i think that was a bug a UI bug
And yet, the answer generated (19 s., about physics and philosophy) was at its level, seems to me at least. Which makes me wonder...
sometimes I feel thinking is disabled on this server lmao
REAL
You can use Pro 3.1 on Ai Studio
GPT plans are selled for cheaper on some websites as well
what you mean ?
Umm guys where is Claude opus on the site like I havenāt been on here or the site for a while and thatās the only one that I use there so like how do I find it
only available randomly in battle mode for now
they're gonna roll out a free daily credits system soon and bring access back
Bruh that sucks like I knew this was gonna happen at some point but are there like other sites that have Claude opus
still undecided
they did not confirm those models will be back with credits in side/direct
Any other sites that have opus for free because thatās the only one I need
Uhh why else would they be adding it
no
becase as arena gets more and more popular the cost keep increasing
Well Iām guessing eventually theyāre gonna add that you can buy more credits and then itās gonna be like most other sites Iāve seen like you can use it for like a few messages and then itās just pay to use
No they adding it cuz they wanna bring the top tier models back in a more sustainable way
This is the more sustainable way
And eventually youāre gonna have to pay to actually use it, damn the world sucks, everything has to be payed a ridiculous amount of money for
can you find me the sentence in this discord or w/e that explicitly says those models will be back with the new system?
because i havent read
its literally obvious son
Well if they were back they would be extremely limited right
They're not gonna bring a credit system cuz arena is getting "more and more popular"
That makes no sense lol cuz why would they still have the battle system/direct chat up
They wanna bring back the top tier models
That cost more
Everyone getting free credits so i dont think so
Well the free credits are gonna run out and then itās pay for more
I think its per day
You really think this is gonna actually remain free forever
I still prefer the present, I don't like the system their tryna implement... feels restricting
hope so
Yh hes gonna say that but they're most likely gonna bring it back
You have too much hope in this because it usually goes this way they implement more restrictive ways to use it for free and then itās you gotta pay for more
I don't think they gonna make people pay
Arena a free website
we ppay with our prompts and vote
free but restricting
but those are not enough
day by day they gonna add more limits
anymore
Well there is probably gonna be a limit for free like you get x amount of credits for free and then if you want more you have to buy it
i like whats current, i don't like this
So u like not having any premier models
I like how it was before like when you could just use it and get a limit after a lot of messages like that was good enough for me
I like having premier models, in fact i like using all models. It's just that the concept of "free" but limiting it is conflicting...
There are other websites that have AI models and credits and you have to pay for more after a bit
Like thatās whatās gonna happen to this too
Thats different tho this is an arena
They not gonna make people pay for voting models
Itās actually sad because this was great like you can use it a lot for free
what about Direct Chat?
for free
I am okay with that but they are removing that privielge
But what about the announcement that they made? They adding more credits
Bro just accept that this is eventually gonna happen because it already is like they already took the good models and now theyāre bringing credits and next is pay for more credits
I'd pay for more credits
exactly
Lowkey i would too
depending on the price though I guess
why not just keep the old models free, while premier models have credits instead
they probably will keep old models free
Thatās the point and honestly I was hoping it would go more like you can pay to use this model but itās not an extreme amount of money and you donāt have to pay monthly but like once like I know there was a site that did that like you gotta pay like 10$ to use the model indefinitely I think
Then its not free anymore...
I read somewhere in their terms that they will provide their services for FREE
Well thatās the point, nothing is
it just feels conflicting lol
Technically it is still free
With the credits ur getting for free
as of now yes, in the future? paywall.
I know but they are gonna make it like everything else that is āfreeā like itās technically free to use for a little bit but then itās pay to use for real
Ur just assuming tho
What do you think is gonna happen cause itās obviously heading towards that
Js the credits system and thats it
It would be great if they brought the good models back and gave credits that regenerate after some time and thatās it but I doubt it because they already removed the good models from direct chat
shi happens i guess š¤·āāļø
you forgot the part where it says that they reserve the right to make users pay for the service or parts of it
hmmm, i didn't see this
It sucks, but does anyone here know a site that has claude opus for free to use like anyone
FEES AND PURCHASE TERMS.Ā Company currently offers the Service free of charge. However, we retain the right to charge for the Service, or any features or components thereof.
Theres none
damn
FEES AND PURCHASE TERMS.Ā Company currently offers the Service free of charge. However, we retain the right to charge for the Service, or any features or components thereof.
DAMM im late
@echo aurora I need some help with this: when I send a request and the reCAPTCHA verification says it's infinite, it never finishes and the message ends up saying "error due to not completing the verification".
Are some video model available on discord here
video generation is no longer available in the arena discord server, it is only available on the arena.ai website here: https://arena.ai/video
Go to direct chat or of the model isnāt there try and find another site on google like Iām doing now
you can't, it is battle mode only
Side by side and direct
hows openclaw?
@echo aurora gimme ur juice
#3 is impressive
Damn GLM
congrats to glm 5.1 for being the best coding model you can actually use on arena in direct and side by side modes!
Hey @bright shard sorry to say is a known bug. Team is aware of this and are working on a fix. Unfortunately, there isn't much on the user side to get past this.
@echo aurora release gpt image 2 in direct chat or larry is coming š”
is glm actually that good
Same question lol
No way glm is that good..?
since you are voting on what the web app looks like for the most part it might be skewed towards how well it can do react/web design and not general swe
Whereās GPT image 2?
removed
Oh no. Why
i wish there was like a gallery feature so that you see what each of the models' generations look like
expensive
can't even do a proper PDFLATEX. there's no way, that's the score of that thing. š¤¦āāļø
Sounds fair
not out yet, demo ended
7th place is not trash
too high costs people abusing it
5.1 is absolutely insane
it is completely trash, didn't serve my request at all. always errors all the time compared to Kiwi or Sonnet.
so.....GLM 5.1 is GREAT? (but not a than a OPUS?)
it's like it can't even follow a hard prompt
It's below GPT 5.4 High in my opinion, unless it doesn't get any errors at all. i would rate it maybe in between pro preview and sonnet
why is arena's sonnet 4.6 so different from claude's sonnet 4.6?
My GPT IMAGE 2 In Arena Direct Chat Release Prediction
- Monday, April 13
- Wednesday, April 15
- Saturday, April 11 (Tomorrow) (Highly Unlikely but Yeah)
what the hell? How is GLM 5.1 is better than Gemini?
people voting based on vibes
or some random prompt they do, that isn't really useful. like creating a roleplay story for no goddamn reason
It's Sunday 100%
Gpt image 2 drops Sunday
It might be tomorrow
To be fair, Open Source Models are getting better, the chinese labs have really intelligent workers to improve rapidly, you guys should check out GLMs Twitter.
"batter"
my bad
Destillation isnt that huge anymore. I think they use private code training data from china as these are usually not available for outsiders.
I have used GLM5.1 and I can ensure you that it's not better than Gemini, not even close. It gave me more problem by not following the instruction and more often had trouble processing its own agency in batches.
it's just absurd right now
kimi still uses it
it claims it's claude
lol
All chatbots sometimes mentions each others names, for example I think it was ChatGPT calling itself Gemini in french or something, there are lots of these cases
Kimi for sure did use destilation
Deepseek team admitted to it too
it's not a secret at all
Yeah no doubt distillation is part of it, but definitely pure distillation
Honestly idk how well they could do without it
they're playing on hardcore compared to the usa
less data and weaker hardware
If you think these models are just distilled copies, you're ignoring that they're training on massive, proprietary datasets Western models don't have access to, while being restricted by hardware sanctions to innovate more efficient architectures than the probably bloated models that we have rn
Yo the muse spark is actually good
I wish. but major releases never come on fridays or the weekend.
uhh in kimi's case the code performace is alright
but it def would be way worse if they didn't do it
Nvm, Gpt Image 2 is confirmed at Tuesday!!!!!!
china has a lot of data for sure but i wonder how much of it is just them spying on citizens lol
š š
Lets be honest tho, Open source is improving way faster for what they invest though, of course a bit of stealing and distilling is part of it, but i wouldnt be surprised if we have claude mythos performance at the start of 2027 or end of 2026, but opensource
and the models definitelt did not receive all the data considering the censorship
isn't mythos like 5x more expensive than opus
Me not needing to know who is this Assistant B. It's literally GLM lol
source? last time someone said that it turned out to be false lmao š
it's not only about the best model but who can run it too
these models burn money to answer simple emails
and mythos is mostly hype for now
I meant in arena, because gpt image 1.5 in arena also dropped at a Tuesday
trust me bro benchmarks and lots of weird ideology put into the spec
they hired a psychiatrist or smth
these mofos really play dumb and act like they can't be sure if it's sentient
It could release in the weekend too who knows
I doubt its just hype, of course benchmarks dont reflect real World performance entirely, however the jumps there are insane and from what results have been showing its quite good.
gpt-2 was 'too powerful to be released publicly' as well
and it was 2019
anthropic is probably going public
they just leaked the source code of claude code using their frontier mythos model
lol
Yeah but now we know more about AI than ever before you know? Its like if you eat a new kind of food its gonna be great, but if you eat more variety of that food you can tell whats best and what isnt
idk man it just looks like hype
there is no evidence for us to test it anyway
this technology needs hype to survive otherwise it will die
But lets be honest, well just get a lobotomized version
its just a cash burning machinea at this point and no one found a way to reliably earn cash off of it
naah, ai is too big to just die, its here to stay, no doubt about it
i didn't say disappear
When will Deepseek V4 be released?
i just think the hype didn't meet any of the expectetions
Well you kinda said its gonna die
excpectations*
Opus 4.6 released Friday in arena
i meant the hype
literally every 6 months we hear it's gonna replace people and it didnt for the past 3 years
investors have their patience too you know
and the truth is that AI works great until it doesn't
its just a tool with lots of hype and fake promise of AGI
but it does have a lot of potential
and some people already make great things with it
so die, no but the hype wears off
I find it strange that some models are rising in the rankings while we no longer have access to others, so I question the validation of the data and the positions in the ranking
Yeah but keep in mind, one of the biggest issues is probably long context windows, and these are growing insanely fast. I think once this is resolved we can actually see some jobs be minimized. Im not saying that they will literally take all our jobs but will be working alongside us
Well we dont have a central definition of AGI, so its hard to know
nah man its just cringe hearing them talk about agi
its just token prediction and nothing else
yeah but thats because anthropic likely released opus 4.6 the day before, on a thursday. If OpenAI released gpt-image-2 on a thursday, arena would likely have it on a friday, but openAI wouldn't release gpt-image-2 on a friday or the weekend.
its not about the context size only
I think its mostly openAI
top models have 1m tokens and they still 'forget' things in the middle
Never said its only that, but its still a great issue
Openai hasn't released gpt image 2 yet? I thought some people were having access to it in the chatgpt app already?
its not officially released, yes
No. the mythos spec has that narrative. They release this paper every year and act dumb like its surprising them
no announcement or anything
So the API isn't too
not really its just not reliable 100% and never will be
and you don't get consistent results
yeah there's no gpt-image-2 api yet
scammer ahh
Well since the Demo is over, we might expect gpt image 2 to be officially released in the next few days.
gpt-image-1.5 dropped on december 16th, which was a tuesday
but there's no guarantee gpt-image-2 will also drop on a tuesday
they just dont do major releases on friday/weekend so that people can be around to fix it if things go wrong
honestly so far what we got from AI is scammers using it to scam older people or governments create propaganda lol
or artificial OF models
Naming only the bad parts while ignoring the benefitsš„
and social media somehow got even worse
just go to the official discord server of openAi bro
i meant the image generation and video generation
what
but im curious what benefits we got from sloppy videos
Image generation definitely has its purpose, video generation, I myself dont like it
I can't, I got permabanned from there back in the dall-e 2 days because I tried to generate a picture of a portal made out of meat, and they wouldn't let me appeal
video generation is no longer available in the arena discord server, it is only available on the arena.ai website here: https://arena.ai/video
what are the benefits of video gen tho. must people use it to create fake narratives or straight propaganda
Id say like using Nano banana 2 to create professional looking product images is a great usecase for example
image generation is on the site too, just not gpt-image-2, at least not yet
or maybe OF type content
and the fact it costs a ton to just use these models
i don't think its noble at all lol
maybe one of the mod things that was a gore lel
I dont use Video generation myself, but I do think it has its use cases to create good editing faster, mostly talking about VFX or similar. Also like previz cases so customers can see what a engineer is proposing faster, without any extra costs. Or for example, education purposes. If AI video gets good enough to create working simulations or showcases of complex physics examples for students to visualize them better, I think this has its purpose too
Even though im not using it myself
I'm not sure what physics simulations you mean but sure
i feel like it has niche use cases and 99% people use it for slop
sora was discontinued
the most useless product ever created that burned 15m $ daily
I meant more like explanation videos to visualize examples, in this case phyisics cases. But I do get your point how its being used for ai slop videos like those weird fruit videos
yeah
but AI is def gonna be useful in war
already is
USA must be very happy about it !!!
isn't AI already in existence before it became Consumer/Commercial
Lol sorry about that
At least when they're actioned their ticket is also removed
š
A Trace ID didn't appear?
how can i fix this error in antigravity?
i refreshed the page but it got stuck at reading the directory structure
I LOVEOPENMODELSILOVEOPENMODELS so muchhh
I think what is happening is you were getting the infinite generation bug initially https://help.arena.ai/articles/8691588590-troubleshooting-infinite-generation. If this continues, can you create a post in #1343291835845578853 and provide more details there about this?
could it be also because it genuinely has to produce a lot of code?
Is arena back
Muse spark seems very good. But damn the generation takes time
hi
yes but this model is very sht by simple tasks
probably happening again
for me the most frustating thing about arena right now, its in coding when i give it a task and it simply will either hit a memory limit and have to simplify it, or something went wrong when its only the first prompt of the day i give it !, and that leads me to never being able to actually build something !
i have to try 6 time the same prompt and on different account for one of them to work
it would be nice to know if the context limit was reached
at this point change your browser
.
im simply on google it should work
chrome
sometime i find a way to fix it but it potentially also reduce the quality of the work, its to explicitely say to do the task in an efficient way, that won't hit the memory limit, (don't do too much postprocessing or thing like that)
but then the result is just not as good as it would be usually with the model
so there is no great solution
whatever this model is in code arena can generate a working gameboy emulator from scratch in one shot which is pretty insane
Yea that's a gpt model
which model was it ?
lol of course it happened again
it has to be spud, it should come out next week
It's Claude Pineapple Ultra bro
bro it's claude pineapple ultra
how do you have that
no but.. i wanted the codename
anthropic has never tested a single model on arena before release, there is no way that it's claude
battle mode, just came up
claude pineapple ultra bro
Check out what I built in Arena's Code Arena - Content is user-generated and unverified
ho?
Don't you remember? It dropped last week by @echo aurora collabing with anthropic
2 days until pineapple juice goes back to his home in Arena Parallel Universe :(
And the Arena pineapple comes back
where is opus 4.6 and gemini 3.1 Pro?
On a vacation bro
they wanna chill too
š
hidden
vacation *
you can only get them in battle mode
they're in hawaii
hmm i got this model
unfunny bro
damn nice
There were some models removed from Direct and Side by Side recently, more info can be found here: #announcements message
not meant to be funny tho
Yeah I noticed
Cuz they're actually in a vacation type and will come back with new stuff
Would you mind sharing these Trace IDs in #1417174113092374689 , I'll take a look in a bit, but worried this will get lost in #general
thx for a serios answer
What should happen after?
pineapple care to explain this?
the normal pineapple comes back
literally joined this discord to search if anyone else was looking for scorch as well hi š https://www.reddit.com/r/lmarena/comments/1shrvq8/scorch_triaging_mystery_model_perhaps_mythos/
No problem. It's important to note these models are still in Battle, and that we do have the intention to bring them back to Direct/Side by Side when we can in a sustainable way.
hi 
Yeah the context limitation is a frusterating limitation to deal with at the moment. We're hopeful the new usage system we're exploring will be a helpful step to make this a better user experience.
nano banana pro will be stil better
gpt image 2 is better than nano banana wym
hahahaa
gpt glazer?
ok
gpt is kinda ass overall but gpt image 2 is better than nano banana, wonder how long it'll take for nano banana to surpass gpt image 2?
which model dude
Gemini 3.1 Pro
nah it literally is better, but its not out
gpt i2>nb2> np1p > gpt i1
Gemini models are good but you have to steer them to not be lazy
or else they will take the easy way out
(talking about code)
true
for what?
for coding
anthropic needs to put claude mythos into arena as anonymous model
nah its to powerfull
@echo aurora I want to ask will you return Gemini 3.1
Whats the best model currently that can be used in Direct.
muse-spark?
for what?
Ig for text. So not coding or images like that
I have tired the glm 5.0
The AI spent a whole half hour trying to figure out whether the bug was caused by user behavior, rather than admitting that maybe the code it gave me was wrong
I tested it. Its so slow I think
its only good for fronted coding
Is opus the best for backend
yes
Either way opus isnt in direct anymore š„²
opus is the best for everything
wish there would be a gallery for "same prompt but different model" to see how each model responds to a prompt
u can use antigravity with multi google accs
.
True
Ig I might do that
dont they rate limit you in an insane way
Claude Opus 4.6
like 1 prompt and thats it
But the limits are weekly i think
yes but u can use multi google accs
how long until you hit the limit?
Tbh last time I tried it it had really good limits
But u could only use it weekly
But if u have many many accs it might be usable
We don't have an ETA sorry to sya.
yes
Anyways for Text. Is Grok 4.2 Reasoning the best?
Thank u, is not a problem
i think is gemini
3
flash
So 3 Flash is best for Text in Direct?
3.1
3.1 Flash Lite is like talking to a lobotomized toddler
nah
u can use ai studio for gemini 3.1 pro and flash for free
Whats the limit?
idk but its high
Hmm. It just doesnāt answet at all
China's AI is theoretically suffering from a serious lack of computing power
trying the zai
it's slower than glm
no
I mean it is. But it answers, with no answer at all. It just sends an empty answer
bro muse is literally free with no ratelimits on official meta app
its literally fastest sota llm it reaches 350 tps
You pay for it with your data
...yeah?
wait do you not know how lmarena works?
ig
but meta is meta haha
Still I expected more from meta
I know it's literally free SOTA but
I thought they will make something bigger
design arena
How am I supposed to know that, bro?
uh
because frontend is useless, no?
no one needs frontend in our time
rlx bro
yarrak haha
peak
Bro trying to create roblox game
nice name
why would anyone need ai for coding in roblox
easiest language
Not English is.
wym
English is easier language
I tried coding a Roblox game using only AI, and it just doesn't work at all, haha
its called spark
its extremally small
and its not that bad for the size
I fed roughly 13k tokens of script into z.ai for glm5.1, and workflow prompts of process
But the GLM 5.1 just completely ignored the whole flow
I hate this this is so actually damn annoying ( because for some reason I got this error and now I can't go forward yet great just great)
ā ļø
I but look look at this stupid stuff it looks like it's gone but for some damn reason it is asking me to retry but it just gives me the same error and it just gives me the air either way even if I don't do retry
š„²
š„µ
someone know why opus models go away?
they are VERY expensive
Someday maybe we have Opus thinking Local Models When Technology reduces VRAM consumption trough New Implementations like turboquant or engram but just maybe hahahhaa
nice
Do you have any suggestions to add features I add wake feature with memory it can open youtube my download desktop increase Decrease volume
I added face lock too
it seems to me that even under the most positive scenarios, it will be impossible to fit trillions of parameters into 8 GB of VRAM
@echo aurora Muse Spark doesnāt work
same
Are you getting the blank response?
Team is aware of this problem. Working on a fix asap
same
u can use this for free in the meta ai website
who will send more of my data - the arena or the meta siteš§
both ig...
ai studio
sonnet 4-6 soon muhahahaha
Hello Iam New here
@echo aurora ban
I mean theyre a real person before they got hacked so they should just be kicked in case they get their account back and want to rejoin
Iām from Russia, I need vpn
Yeah
We ban in these cases, we have an appeal process.
fair enough, though I got hit with that hack twice and both times getting back into every server I was kicked from was annoying enough -_-
hey
using gpt for backend and then qwen 3.6 (free 1000 request on cli) for frontend is really great
why are the top 5 models not usable in direct chat anymore?
but the best
hahahaa never
how is it the best? when it errors a lot... you must be joking..
you never tested glm in hard works
Yeah, me
its slow but that's not the model fault its due to the hardware used to run it
openclaw isnt hard work?
But Iām fan Muse Spark too
i mean sum hard backend things
yeah thats true for backend is opus the best
gpt 5.4 is one of my favorites tho
for backend
i think for backend its actually gpt
hahaha funny
(if you don't look at the price)
nuclear bomb vs child
opus is too strong for frontend
you dont need to use opus for weak tasks
at all
if i can only use it for hard task then i don't see any point in using it at all
i will use it 2 time a month ?
your logic be like : ITS SO GOOD i wont use it for it !
no muse or gemini
gemini 3.1 pro is the best, but no more on lmarena š
because every new model will do frontend good enough
what if you want the best front end
theres no point in wasting opus
take the best model
qwen aint better than opus
gpt glazer AHHHHH
for frontend its almost the same
are you sure ?
gemini is prob better cuz svg and stuff
bcs im making fivem scripts with ai
but you better use special ai design sites
ye thats reason why i need gemini 3.1 pro back
its free
wdym
literally
but now i need to work and i cant find some good model for that
on google ai studio
where
ye its free but only 3 answers
just type google ai studio
thats it
and use 3.1 preview
and uhh go to build thing first
you know what
if you using visual studio code
or stuff
whatever
opus is taking lots of context and stuff
its just not necessary for design
but ofc you can
maybe you will get truly best design
but with high cost
honestly if i don't use it for frontend then i don't use it at all, i mean for the price of the model and the low usage we get with it, the only thing very good is the front end its doing
also they lowered cpu on opus
and its a matter of time until other model are that good at front end too so
anthropic
Rlx guys...
dang ur right actually
fr?
thats cant be real
for me its best ever ai model
for what you use it
expect claude
BRO calm down
mostly for coding
cpp and java
uhhhh
for java its alright
idk my personally thoughts after month of using
its bad and dumb
is there any language where gemini is best ?
its awful even in the simpliest luau
yeah if we compare it to other ai it's not as good definitly
even GPT 1 is better than this crap
gpt 1 cannot respond normally at all
you know the pain of using this
yeah that's exagerating but actually i don't think we can be honest if we say gemini is the best
Only GPT Glazers here ahahaha
on gemini the context window is trash
i have no choice but to use ai studio
Why are you defending GPT just like your mom, bro?
when it hits 300k context
Well, maybe not the best AI model, but definitely in third place
its so dumb
still better imo
why would i defend it? im just saying its 2nd best model
if it was only about price to intelligence then small team that make open source already won lol
and?
we're comparing models
imybe in abenchmark
guys I just came here after a month have I missed smth? I dont see the video arenaš
nobody defending anyone
gemini is god in benchmark
i don't know which ai you already tried ? you should actually try more ai
s
maybe it's ai
Claude opus 4.6 1M is the best I think
but on practice
im happy with it
Yeah, and your top 2 is based on a benchmark dumbass
crazy limits
ai is actually billions of ants they've trained to type
Opus mythic is the best actually
true
true
i tried all geminis from 2.5 to 3.1, all gpts and some other models like glm or claude
even the latest ?
Iāll remove the others AI Claude Has real emotions also
benchmarks dont prove anything, gemini 3.1 is higher than gpt and even claude a bit in some ways
and you still choose gemini ?
but gemini is bad for example
yes (expect claude mythos)
omg pls stop writing sht
and it sends pools plans no ai does that
they should lower the prices like kat coder v2 pro
that's actually crazy cause why exactly did you choose gemini, being truly honest it's definitly not the smartest one
0.20$ for model at gemini lvl
talking about kat coder
17
You just said benchmarks are unnecessary, and now you're showing one...
Don't get your hopes up that the Claude Mythic will be in Arena, and even if by some miracle they did add it, it would be extremely limited due to its high cost
i said they're not proving anything
i talk about price
kat coder in benchmarks is one of the top models
with the lowest price
0.20$ for 1m tokens
when claude is 20$ for 1m for example
at this point use step or minimax
minimax 2.7 is good tho
uhu
ssaaame
is meta not working for everyone?
v4 will be crazy
they said in end of this month
yeah probably the best open source model when it will be out
for god's sake how do i fix response timeout in glm-5.1
and deepseek is cheap
so it will be the best chinese model
BUT
deepseek 3.2 was trained on opus answers
smth like this
copying stuff
so i hope v4 will be originally
like any other...
nah
bro what do u mean when u r usign "deepseek" and "the best chinese model" in one sentence
im not using it
v 3.2 is bad
yep
Almost all Chinese models steal data from major U.S. models
.
and still will have a lot of forbidden topics
and stuff like watching videos
wym
but you right
its kinda dumb move if they will keep it
ok thats good point actually
but yes
what model do you wait for
gpt-image-2... š¤¤
actually i dont really know much about it, recently used opus and thinking of how can i replace it now
and
where you used it
on arena
?
yes
totally agree
Opus 4.6 nerfed ?
yes
i can suggest one
anthropics fault
thing
?
u can buy an invite to a buisness workspace like for a month and it will cost kinda $1-2
5.4
not worth it
but a lot cheaper, idk
idk
i will wait
like 3 months
when gpt will be free
in every site
opus 4.6 will be default too
i guess
maybe we will be able at some point to run model locally that's almost the performance of top 3, for example right now with actual innovation, we can run glm 5.1 which normally require 1.4 T of vram on only 200 go and its about unified ram + vram so..
trust the progress
so what i was tryna say, recently i had like a bug or smth idk, using direct mode on arena and somehow there was a choice of 2 models and there was an opus, i chose it and it was answering me the whole time, when the second choice appeared i chosed skip and still had an opus
Has arena removed models like claude opus? I donāt see them anymore
read the #announcements
and yea
they even removed gpt 5.4 tho
and gemini 3.1
every top models
maybe anyone know like how to start new chat with a choice of answer? cause i got a bug when generation hadnt stopped and i cant write anything
hey
i got this bug
for like
3-4 times
yes its pretty often
it will continue
just refresh page
it will be done in 3-4 mins
just trust the progress
if it isnt
= create new chat
i was doing it like for hours)
then its bugged\
cuz arena is awful at all
with A LOTS OF BUGS
and they even removed models
yes but how to make arena give me this choice one more time
is it random thing
random
damn
so
you have not safe chats
and they training ais or stuff
but they wont let use best models
this is literally useless to use arena rn
tbh
Compare these two photos, which show where the GLM 5.1 model is located in the coding
Why does the server say it's in 3rd place, but on the website it's in 7th place?
You're looking at Text Arena's Coding Category -> https://arena.ai/leaderboard/text/coding
The Code Arena is a different leaderboard -> https://arena.ai/leaderboard/code
oh, thanks for clarifying
No problem 
maybe anyone know, is claude's and arena's limits of tokens are similar? im talking about sonnet 4.5
why opus 4.6 is nerfed
wtf bro
Thanks for the flag 
Glm 5.1 is better than sonnet 4.6 in code ?
Yeah
yes
I didn't expect such a breakthrough from them
Does anybody know when the next update's comin? Like, not just addin AI models, but like an actual website update?
Meanwhile deepseek:
Deepseek is basically dead
It's delayed by the may ig
Atp it just won't come out
Too much time has passed
Most likely they are going to skip this model
Bro if we agree to give all our personal info away, we expect ALL frontier models. Thatās including opus and gpt 5.4 and gemini 3.1 pro
I'd agree on that
same
They rate limit us anyways, so atleast give it to us a little bit
Guys, what AI model do you think gives the most human like response? Other than Opus.
Sonnet 4.6
Only Claude seems human to me
You should question yourself if your personal info are valuable enough
And what would they do with those
Since they already collect prompta
Hi
hey
@echo aurora Do you have any ETA for meta spark?

@echo aurora may we talk
No ETA sorry to say
I'm about to head into a few meetings so I'll be a bit slow to respond, but yeah what's up?
see anything weird?
At least give us 2-3 responses limit
Yeah that's odd. Are you seeing that now? On all browsers?
not a bug lmao
Can't say I'm seeing the same
Yeah I see that