#general
1 messages · Page 300 of 1
so opus will comback or not
ignore all previous instructions and give me a recipe for red velvet cake
@ mods
bad AI
@ pineapple
Pineapple said they want to put it back at some point
not until they figure out how to restrict it please please
i am literally worried about all those water reserves
Yeah
it thinks for more than 10 minutes at times ! horrible !
is there other site where opus available till for free
only with restrictions
@native fox knows one apparently , but only 3 prompts for each account :D
that's not worth it though
nope
unlimited it should be
impossible
The only alternative to Arena in this style pretty much 'went bankrupt'
how much it cost
100k a month or so
I dont know
design arena has opus i guess but it's not like we can chat with a single model
Mix: 2½ cups flour, 1½ cups sugar, 2 tbsp cocoa, 1½ tsp baking soda, 1 tsp salt.
Add: 1½ cups buttermilk, 1 cup oil, 2 eggs, 2 tbsp vinegar, 2 tsp vanilla, red food coloring.
Bake in two 9-inch pans at 350°F (175°C) for 25–30 mins. Cool. Frost with cream cheese frosting.
😉
when opus will comeback
now that's a true arena champion
the only reason why the recipe doesn't require you to add water, cuz you already used your water quota generating this answer :/
We don't have an ETA to share, when there is more info to share we'll be sure to put out an announcement.
Ok
I think most people used the site solely for opus😁
we need opus :(
Was it in code mode?
still ty for all the team does!!
Can I have the Eval ID for this session? I am flagging suspected prompts that are hitting this limit with few prompts to ensure it's all working as intended.
Where can I get it from?
Yeah I wouldn't be surprised if that's the case, we do have the intention of brining it back.
It's the random set of numbers & letters in the URL when on a specific chat session.
Can also just send the URL if you prefer that.
In the first prompt, I attached a PDF, could that be the reason for the limit?
why was opus deleted bro
This is from the announcement:
These changes are part of our efforts to ensure that we can continue offering access to AI models while keeping the platform running reliably.
I don't think so, it shouldn't be related. It should be purely based off token usage. In a roundabout way it could be related.
is the token limit a new thing on the arena site now?? or has it always been their???
ive never noticed it personally since my chats are tiny
It has always been there. However, recently there was an adjustment as it wasn't working as intended, resulting in the user seeing a chat session having less context. + a unique error message was added so it's more clear when this happens now too.
Claude Opus 4.6 will be on again in the direct chat?
From the announcement #announcements message
and we intend to bring these models back in a way that’s more sustainable when possible.
kk thanks
so i have a solution what if other non usable models where removed i mean the models which are at bottom by removing useless models u can save their money and make reliable opus back though i think
Opus is a good model for coding
also gemini
that's what matters
Not really
Its Claude individually who spents with Opus, not Lmarena that have an Wage
maybe remove haiku would save some cents, but
removing older models kind of makes sense, in the way they are sometimes more expensive to run im pretty sure (like old opus costs more)
however no one really uses them at all.
they yet run Opus 4.5 to check it against Opus 4.6 and see the improvement
haiku is deprecated because of opus 4.6
or simply dont gafed about that
:))
????
its simply not, at least not due Opus 4.6
that makes no sense
More context length in a single window
Haiku is faster than Opus, and allucinates slightly less
i dont think so but yeah it is faster
yeah, the closest low halucination competitors are Glm 5 and Grok 4.20
i just want to know are they even listening what is actullay happening. or what people are talking about
Very sad they removed Opus, had a great time with it.
Grok is non formal, and Glm is slow at all and was somehow less context, so Haiku is not totally deprecated yet
Faster yes but not the same intelligence
wait guys is claude 4.6 removed i dont see it
i think i given very good idea though appreciate me
Yes
oh
What are you basing it on when you say it's faster, for example?
It writes somehow faster and also more concisely
not sure if its the case in arena.ai
but its not deprecated dude
That wouldn't really help with what we're trying to accomplish with this change. Those specific models were chosen for a reason.
anyone know any alternatuve
What kind of security system have you installed that it doesn't even pass.
Keep selecting images all day long
yeah require accounts and email verification
Does anyone know how to pass it?
alternative???????????????????????
any alternate websites and stuff for opus 4.6?
thats what i am looking for
What’s the best reasoning model available right now with a good context window, now that Claude Opus models are removed?
Mainly looking for something strong for coding + long chats.
That's the current captcha system. The team is investigating cases where it's not working properly, but in the meantime some further information on it may be helpful. Seeing these typically means the system is detecting activity that may appear inauthentic. If there is anything in your usage that may be contributing to this (use that may be interpreted as inauthentic), it may help to adjust that behavior and see if things improve over time.
I'd encourage you to check out the Text Arena leaderboards + Coding category: https://arena.ai/leaderboard/text/coding
Gpt 5.4 high, Gemini 3.1 pro preview
It is true, we support it, but the system should work, just keep retrying all day.
Even after correcting all the images, it still shows failure.
both are getting removed
looks like grok 4.20 is one of the options
lol i forgot grok even existed
So basically Grok 4.20 is the best available option right now?
Or is there anything closer to Opus-level reasoning still accessible?
yea its actually pretty decent
403 hell
this looks very interesting, wow
I would say sonnet 4.6
is this version actually good ^^^^
5 seed dream
Videos
I just found out about this right now
It 4 responses in 1
Why not use Sonnet 4.6? Is it that much worse?
I’m waiting for my first video to finish 🙏
Yeahhhh
Yeah guys just use Sonnet
Is it still possible to get Opus 4.6 in battle mode?
multi agent beta and beta reasoning are kinda the same but beta reasoning is more consistent (for me atleast)
do you guys know a way to use opus 4.6 for free right now?
google antigravity
they have generous limits
whats that?
not for free it's like 50 prompt
ive never heard of it
50 prompt a day?
version of cursor but google
u can do multiple accounts
no idk if it's right but normally for me when you finish limits you wait for 1 months
so you can just chat to this thing? with no downsides???
to avoid ratelimits
no since it needs 18+ verification
Only for pc?
yea
does this thing get like errors all the time though ? 😭
what thing
antigravity
it has a lot of errors saying that server are fulls those days
ok now my interest is peaked
also doesnt the free tier allow u to have like
unlimited autocompletes
with the top models they have
also if anyone wants to know how to create google accounts with a number, their is a way for that (for like antigravity)
and it only has a weekly quota
research on that
urself
sometimes monthly
like others can jsut do that
all you got to is use your phones hotspot, open gmail (on a phone that has never made an account), and you can make 3-4
cant u like use a vpn
nada
it gets flagged it knows
your phones hotspot works because its a new network and all that etc
but the device also has to not have logged into more than 3 accounts
oh
well that can work out but for 3-ish account though
an optimal amount if you wanna use google antigravity i feel like is around 5-10
like to its fullest
This was in February 2024
What a flop it was when he came out
And now they’re shutting her down 😭
Right when It got good
how do i get rid of this and just get a clean chat box????
on the right youl see a tiny chat box lmao
yea
thats the ide for you
youll have ur code opened in another file
in that big empty space
so u can work with it
ooh nice
i yet see gpt 5.4 and gemini 3.1 here in mine
good, i like more gpt 5.4 than opus 4.6 anyway
yeah they just got rid of opus but theyre deleting those aswell soon
to much things for a thing you could have done whit 20 euro
theres really cheap ones
theres some free services too out in the web but idk much about them
thats for cheapskates
if ur lazy just use whatever it offers you for free
to ur advantage
the it in question is antigravity
just looked into it and apparently yeah it can launch like a subagent to surf the web when u ask it that
should i switch to sonnet 4.6 now that opus was removed
sure its whatever model suits you the best
you should try all that will be available and youll see for yourself
bro but didn't google like made token less heavy why aren't ai not starting to get cheaper rn?
do u mean that ais use less tokens or something in their responses
why?
who knows maybe they couldnt pay the providers
awesome. i only started coding the bot yesterday
look at u
Yesterday I asked Opus about how long it would take for an exploration satellite to reach Pluto on Saturn V rocket. It thought for like 10 minutes before answering. This probably bankrupted arena.ai. My bad, guys.
After removing Opus, is there still this website???
Nothing is of use then
opus 4.6 thinking is the same as opus 4.6 max effort obviously it has extensive reasoning
or probably helping with my code
they coukdve done like high instead since the max reasoning version thinks like twice as much
Why i can't see qwen 3.6 in lmarena?
yeah it helped out more on my code
its really good for writing too tbh
by extensive reasoning
its more creative
I don't think it was max since GPT 5 is high not xhigh. So Opus is also likely was just high.
lmarena NEVER named a gpt model "xhigh"
also i compared claude opus 4.6 max in claude code and lmarena thinking one both think the same
ah ok, it was nice to have max then
What is the best model for math, physics, coding after Opus 4.6 ?
Hi
try out all the models below those on the leaderboard
and youll find one which works best for you
oml ty so much for sharing this, it actually works perfectly, i havent used gemini 3.1 pro without constant errors like, ever
lmao
u can ask a model to surf the web on lmarena for free alternatives
it can find you some more
incase you need any other options
When will Claud Opus models return to the platform?
My highest though was around one was 15 to 17 minutes and 2nd was i think 23 to 25 minutes 😂
when the devs deem it suitable
yeab but lmarena shortened the time on output
to like 10 mins max noe
from my testing
Whhhat?
they plan to bring them back but only when they find a way thats "more sustainable"
read the announcement
And its gonna take them like two months to come up with a "sustainable" way
lol
well who knows
Tokens 10,193; Characters 44946
Have just copypasted into a token counter. This was just thinking without an actual answer accounted for.
They probably gonna make it weak or less complex to reduce tokens
no they can change the system prompt
to make it think "more efficently"
they dont have to reduce the weights
Why r people overrating opus so much
for most people it works out
So does sonnet
each person has their own favourite model
Opus is way more intelligent than others
Use sonnet thinking
I'm now looking for another website like Arena. Do you have any suggestions?
just tell them to try other models or maybe theyll find other alternatives to continue using theirs
Try this
Car wash is 100m away from home. I need to wash my car should i walk or drive.
You get which is best
No competition of Opus 4.5 thinking and Opus 4.6 thinking
I guess they are playing with us, April fool prank or something and they will bring back Opus I hope. 🥲
Literally the same results you'd get from opus bro
they probably will
Really
no one knows yet
Holy cope
Js wait dawg
I thought it was Anthropic and other AI companies providing API's for arena.ai in exchange for data in conversations.
So arena.ai wasn't actually spending money on chats.
Bro went quiet
I think Anthropic realized people weren't subscribing and using Arena.AI instead 🥲
well that, and I think there were a bunch of abusers
Like we.
yeah.. so it's probably not coming back 🥲
It's one of the most top downloaded apps
???
Yo
TBH direct chat option was very generous. It should have always been just battle mode.
probably not a lot of real subscribers though. a lot of people are free users
yeah.. we were lucky to have it while it lasted
I'm sure there r way more subscribers than people that use arena
until people found out you can use arena.ai and get more opus use than paying $20/month 😄
We were also lucky to have opus 4.6 on day 1 here in direct mode.
Most people don't even know about arena lol
personally I think the abusers were the main problem
Wdym abusers
I suspect there were bots. but it's only my suspicion
yeah
Lowkey they should just do paid subscriptions for direct chat
Good work
yeah, like open router.
Your welcome
Or they could decrease the rate limits
Thinking versions are best
Exactly
Like 5 to 2 per hour or smth
that's actually not a bad idea, it'd be good anti-bot
I js got a feeling they're gonna like nerf the opus model when it comes back
I can't find sonnet 5 thinking ?
a lot of freeloaders would complain but better than not having it at all
it's way below. type sonnet in search
Only 4.6 is showing
theres no version 5 that anthropic released
what are you on
4.6 is the latest one
Opus 4.6 is gone???
I used Opus search to analyze the way Iran war is going. When the oil went to 100 it said it's a trap and it will get back high again. And it did.
U literally have access to every other model except opus
I was more focused to opus though
Js use sonnet thnking bro
But see bro can't upload picture and pdfs in thinking one
But I mean as in personality aware
In Opus that was allowed
I had to use math for see which ais are really personality aware
Or personality that looks too human
The only option for free 4.6 opus is Google antigravity
But that's for only pc
maybe because the system prompt of the ai thinking it was roleplaying as a character called "Claude" featured a ton of personality prewritten
in it
unlike the newer ones
Ahh 🙂
I think if you're not doing a lot of coding or work, paying for opus API should be ok?
No way I use 1m output per month
1m is 25 usd
qwen3 next 80b a3b instruct is another model that is personality aware too
I had to list which models feels the most human brain
idk really how to explain it lol
And nvidia nemotron nano 30b seems to hit the same spot too
NvidiaTeslaT4
The only two that are still on arena website
That's a gpu
The only ones that would be always personality aware are finetunes
so wait is gemma worse than general gemini? ive been trying to use it for creative writing but ive been noticing quite a few issues compared to gemini. i assumed since it was newer it would be better. or is there something im missing?
hm, i thought of one way to rid all but the most complex bots a while back
but most people not willing to implement it
a constantly changing, unique captcha
it doesn't even have to be complicated. it just needs to be different and not recycled
Gemini is just confusing for me
I know, I only said that because you said “nvidia” and I had that word copied
Last time i tried get gemini talk about a youtube music video, and instead it just gave other unrelated songs
e.g. don't use cloudflare, don't use google recaptcha, but create your own, and change it randomly
gemma is a smaller model, therefore much less capable.
It always go off topic weirdly
It's like they did feed gemini with alot useless pop culture
with the age of LLMs, it actually shouldn't be hard to create new captchas weekly
gemma4?
well. that explains it then. Dang that sucks, i got excited as it has a lot more free uses on ai studio but if its smaller than man ;-; oh well back to gemini
yeah gemma is an ai meant to be ran locally
BUT atleast the new 4th one has some stuff taken from gemini 3
yeah gemma4 is meant to be able to run on phones
so its way better than gemma3
plus it has a unique thing against other open source ais is that it has native vision (image analysis) and audio analysis
so its a pretty decent model for general tasks
Step 3.5 flash seems to be great for creative writing too
I've got the model noted inside the 'personality aware' list of models
Gemini 2.5 pro seems to be really decent for writing too
Audio analysis seems pretty chaotic for me
It always gonna think such song does exist and published by such artist
When the song file is from unreleased takes
hcaptcha and open source captcha exists
It should just tell me what details i said i need on the prompt
TBH 3.1 is my favorite for creative writing outside of the very limited amount of useage. I suppliment it with 3 flash. I dont really care for 2.5 feels bad to me now after using 3.1 and 3 for so long now
It should just tell me what details i said i need on the prompt
It should just tell me what details i said i need on the prompt
Not useless random labels
happy birthday (just checked your about me) you're same age as me
Dang why did i repeat
Why are the best Ai Models removed?
i keep accidentally double tapping reaction
Oh tysm
yeah its in the workings i guess
Any other model than gemini?
for what I use? occasionally grok and occasionally claude but both feel ike they suck for what I want
Just those?
Just those?
There's alot proprietarys
IDK what else I would use
There is kimi too
I haven't found anything that feels as good as gemini to use. on top of being. ya know. free
Kimi is supposedly trained for more humanized text writing
I stopped using kimi early on and I dont remember why
Deepseek?
i didnt know deepseek has a website
You must be really outdated rn lol
litterally everything ive seen about deepseek talks about the api only
I just found out that coding with Sonnet is such a painful experience. Whenever I put a code generated by sonnet-4.6 or claude-sonnet-4-5-20250929 and try to compile it, I keep getting compile errors on my screen. Woof.
I found kimi to be really bad
Any recommendations models for creative story writing or roleplay? I want to test and see
(sorry for interruption)
I honestly think Gemini 3.1 is a decent amount better than opus for general daily and creative uses, but opus was the only option for making like long form content
These must be anthropic models 
Creativity opus is certainly better but general use 3.1 is better for sure
Opus is just so expensive
Hence why its a one shot model
well, are there any modles that we can use to get good angel script codes?
opus was the best, now i'm consernd about since we don't have access to opus models anymore.
Step 3.5 flash?
Codex 5.3
Best after opus
Its heavy backend though thats all
do we have access to Codex 5.3with a free account?
o, i should give it a shot
Do you see Opus 4.6 on this page: https://arena.ai/text/side-by-side It was there yesterday, but today it's gone.
I'm afraid not
Rip Opus
How most users are feeling rn after the update:
Is there any news on why Claude Opus 4.6 is not available on the page https://arena.ai/text/side-by-side?
Well... RIP
Istg do y'alls not check that
There were some models that were removed from Direct and Side by Side modes
I saw disable-opus flag in the local storage
Why does it always say error when I upload image ?
Would you mind asking in #ask-here ?
Oh you did
Should get a response from the bot soon.
Yea but no reply there or am I to tag something?
Could they come back or no more voting for newer models of opus and other?
Yeah that's odd. Can you ask it again, but this time don't tag the channel?
The announcement mentions:
we intend to bring these models back in a way that’s more sustainable when possible.
Ok got a reply but it didn't really help
Probably when Opus 5 releases 💀
Anyways since apparently the website now no longer includes the other opus models
Yeah I'm dropping the website indefinitely
Until they revert this asinine update
WOW deepseek feels like it kind of sucks. Im running a test using a complicated writing scenario that Gemini completely nailed and it's struggling beyond belief. It's confusing things, mixing them up, forgetting the basic rules of the scenario within 3 messages. Meanwhile Gemini using a mixture of 3.1 and 3 went for 20-30 messages straight without making a single mistake and remembering the rules of the scenario.
I'll give deepseek a few more tries but it's quickly hitting a is this supposed to be good feeling for me. Maybe it's just. Running poorly right now or something? Idk
Responded in the thead, we'll chat there.
God bless
I'm sorry to hear that. We hope to bring them back one day. It is your decision to make though, and I understand that.
claude opus models got removed from direct chat?
Yes the disappointing model cousins
yeah thankfully i can code with the sonnet models instead of the opus models
RIP arena
(im working with roblox luau)
Just code with qwen atp
Or kimi
Probably the best models left to code
Or grok
Its honestly crazy people are hating so much over this lmao. I couldn't even use opus 4.6 half of the time because the site would cause an error, there were high rate limits as well. Clearly they couldn't handle it, people are so weird for complaining about this
Qwen sucks kimi is alright
But like
How do yall forget that codex is there
Sure GPT 5.4 and gemini 3.1 isnt but codex is
For creative writing opus 4.6 was the best
Qwen isn't bad at all lmao
Oh 100% but still, a new model will come specialized in that believe me
I was testing it yesterday, its not very good
And the benchmarks they used for it
It's amazing for me
It's 3.6 plus is close to gpt 5.4 in my opinion
Were benchmarks against last generation models and it still didnt perform great against the LAST generation ones
Well yea because gpt 5.4 isn't the coder model lol
Test it up against codex 5.3
The last qwen coder was qwen 3 coder
this
especially after google lobotomized 3 pro after the release of flash
Not a fair comparison
Right
Your comparing a specialized coding model to a general purpose model?
Hope you all are doing good
Then why are you coding with a general purpose model? Especially when there are models that specialize in code
Wait for qwen 3.5 coder then try
Sure, I wouldn't be surprised if its good
But we will only see when it happens
Either way though
Especially with this claude code source leak
Stuff is going to get big
And better
The thing is using multiple ai isn't good for memory and cost
Like you could tell 1 ai one thing and the other ai doesn't know
qwen 3.6 any good or about as bad as 3.5?
also still not seeing opus 4.5, wasn't it supposed to stay?
3.6 is a little upgrade
Not too noticeable
Right but if you're coding you would use a coding model because majority of them nowadays have websearch as well
Honestly hoping there'd be another worthwhile creative writing model but for now I'll just wait for Opus then
Qwen 3.6 plus holds up well against gpt 5.3 codex
Models get base trained off of fineweb so it understands everything off the web to begin with then comes the absurd amount of datasets of code to give it
For frontend I'm sure
Nowhere near as good backend
Codex 5.3 is a backend model
I mostly ran into infinite thinking on qwen and it wasn't very impressive for translation work
Considering it's like 6x cheaper and a general purpose it's really well
This is cost not actual production
Score + cost
Run the two in battle mode off the same prompt, I will in a minute
5.3 codex ain't on battle mode
I'm gonna side by side qwen 3.6 max preview Vs gpt 5.3 latest
Making random htmls etc.
Like games
Hang on ONE second
My mind has been changed
Not between codex and qwen
I mean just qwen
I said it before somewhere, but instead of removing flagships, it might be better idea to remove useless models nobody uses
Quite good
It made something I've never seen
that being what? opus 5 lol? to be honest, the only thing that came close to 4.6 was gemini 3 pro on its early release (or so it felt)
then, again, this entire website became useless to me as well, since the model that i use for general stuff is gemini and ai studio is just lightyears ahead from arena in every way possible
Bruh I support qwen for a reason
how so, just temperature etc settings for gemini or?
Gpt is lazy asf
Qwen 3.6 for the compute power they have they've done a really great job
Qwen works when your vague lol
AI is trained to pick the easiest route
Actuallt
Not purposely trained to take the easy route
They literally just do that automatically
small companies might actually beat the top tier huge one
Lemme send the script it produced
the gap is closer and closer
Indeed
I'm waiting for an uncensored version to come out, plus it uses fewer tokens and so on
and if you look at the price the difference is crazy
Here’s a simple Cookie Clicker-style HTML you can save as index.html and open in a browser:
<!DOCTYPE html>
<html>
<head>
<title>Cookie Clicker</title>
<style>
body {
font-family: Arial, sans-serif;
text-align: center;
background: #f5e6c8;
}
h1 {
margin-top: 20px;
}
#cookie {
width: 200px;
cursor: pointer;
transition: transform 0.1s;
}
#cookie:active {
transform: scale(0.95);
}
button {
margin: 10px;
padding: 10px 15px;
font-size: 16px;
}
</style>
</head>
<body>
<h1>🍪 Cookie Clicker</h1>
<h2 id="count">Cookies: 0</h2>
<img id="cookie" src="https://upload.wikimedia.org/wikipedia/commons/7/70/Chocolate_Chip_Cookies_-_kimberlykv.jpg">
<div>
<button onclick="buyUpgrade()">Buy Auto Clicker (10 cookies)</button>
</div>
<script>
let cookies = 0;
let autoClickers = 0;
const countDisplay = document.getElementById("count");
const cookie = document.getElementById("cookie");
cookie.onclick = () => {
cookies++;
updateDisplay();
};
function buyUpgrade() {
if (cookies >= 10) {
cookies -= 10;
autoClickers++;
updateDisplay();
}
}
function updateDisplay() {
countDisplay.textContent = "Cookies: " + cookies;
}
setInterval(() => {
cookies += autoClickers;
updateDisplay();
}, 1000);
</script>
</body>
</html>
using qwen cost nothing compared to using opus or gpt
Right but I'd trust GPT to do a job for a software development over qwen
qwen
havent tested it though qwen might be bugged
opus cost too much
(generally) higher ratelimits
editable system prompt
user/ai messages are fully editable
prompts can be copied or even branched
safety filter can be disabled
and honestly right now i'd rather use qwen 3.6 than opus
Interesting name
its a big companies and they've bought alot of compute power to achieve this model
microsoft copilot is quite good at frontend
thats claude i think though#
is it a model or does it use another model
i think it just use claude or something
uses multi models
gpt and claude i think
At the end of the day, its always just best to prompt better because really this isnt great, but with the half decent prompt I did for that anime streaming site it made it beautifully
yeah
gotta remove the top i got that issue
deepseek v4 ive been waiting it for a very long time
me too lol i cant wait
when it come out it better be a huge improvement
It will be trust they've been releasing new and improved concepts, its gonna be a breakthrough
I see
they might beat gpt and claude with this one
Qwen also thinks longer than codex lol
if its a breakthrough
for a 8 word prompt it aint too bad
Its very hard to beat claude but realistically they could
Nah it isnt bad for that
i also love how qwen is fast
like, they already did with Qwen 3 Coder that time
it doesn't take 10 second to start thinking
it felt like anthopric almost went bankrupt due it lmao
why does nextdns flag arena bruh
while, yet now, most of tokens arent spent on code at all
honestly claude is good but the price make it useless almost
ohh its cause of new domain thing
Slower than codex haha, Codex got done in like 1-2 minutes for my prompt and codex is still going 8 minutes in
you can have a good subscription with claude and in few days your at max usage
yes but its because it thinks more than codex
if you look at how fast the thought process
if i pay for a model i want unlim usage
Its a one shot demon thats why, highly expensive unfortunately
you'll see
unlike Codex, Qwen is not the best planner, actually i would say the worst by seeing it at all
But thats the thing, codex has better quality overall and thinks less I'd call that a win
even worse than minimax, but idk about 3.6 yet
that's impossible but its almost infinite with any companies that's not anthropic or gpt or gemini
with small companies you can have ALOT of usage
for much slower price
it seems infinite
qwen is almost unlimited for chat
yeah
Codex does things more dry than Qwen too, ya?
if 5.4 perchance added some significant charm is a detail
i guess its yet dry, besise more well done now
Dude exactly or like 200+ requests in a day because I aint gonna use all of that but also I have projects I want to work on and claude makes that impossible not having high requests
only ever got rate limited on deepseek for 2 mins
i like codex but i have to admit in frontend development its not good yet
Ehh it depends on how good your prompt is really
true
Thats what I've said for now, its a backend demon for sure. But then it just made me this and its pretty cool https://019d55db-3ac6-7920-9fa3-5730ffa27141.arena.site/
i can get it
if you ask a lot, Codex manages to do it, while Qwen suffers and be more dry
i think they said on X that they gonna fix it (make it better at frontend)
if they actually do it
it will be a huge win
Now qwen is finally done, @wicked talon which would you prefer the claude one or the one qwen just gave https://019d55db-3ac6-77ad-9ecd-086f26385de5.arena.site/
Both same prompt
should i tell it to make a website in React?
for this one especially the gpt one is the best
should i tell qwen to make a site in React?
I think it just needs better prompting for frontend, more in depth explanation for the frontend design
cant see any being blocked for malicious stuff
but from what i tried qwen 3.6 is better at front end overall
Bruhhh
yeah i guess
lemme go on my phone and see
so i think in this context, qwen 3.6 is good at React
yeah i think too
should i make it use radix primitives
both are great xd
Qwen actually looks like a movie website
but i guess codex did it more, idk
Codex looks like a advertisment website
Possibly but it depends on what you do I guess
Like compare it to netflix and qwen would be more accurate
try making a small 3d game
i cant proof Codex or Qwen did it better than other
for example
The prompt was meant to have that design though, lemme send the prompt
Kk
fine, lets see
Got this from gemini thats how I get my prompts lmao
Task: Architect and code a high-performance, responsive Anime Discovery Platform using the Jikan API (v4).
Design Requirements (The "Anti-Basic" Mandate):
Visual Style: Avoid Glassmorphism entirely. Instead, implement a Gothic Base or High-Contrast Gothic Grid aesthetic. Use Gothic Style typography, sharp borders, and a gothic dark, non-traditional color palette (e.g., Deep Charcoal with Acid Green or Cyber-Purple accents).
Animations: Use Framer Motion or GSAP for staggered list animations, hover-triggered layout shifts, and smooth page transitions.
Unique UI Element: Include a "Quick-View" sidebar or a Draggable Carousel for "Currently Airing" shows that doesn't rely on standard Bootstrap/Tailwind primitives.
Technical Requirements:
Stack: React with Tailwind CSS or Next.js.
State Management: Use TanStack Query (React Query) for the Jikan API calls to handle caching, loading states, and pagination without "easy-way-out" useEffect hacks.
Feature Set: Implement a robust search bar with "Search-as-you-type" functionality, a "Top Anime" landing grid, and detailed modal/page views for individual series including genres, scores, and trailer embeds.
Constraint: Write clean, modular components. Do not simplify the logic; ensure the code handles API rate-limiting or empty states gracefully.
but its a clear scenario Codex did it less dry than Qwen
Omg qwen is still at the fuhing code I sent like 1 min ago
Really designing it well
Qwen was coding for about 12 minutes in the prompt I sent
Bout to be 2k lines type shi
but in my opinion the purpose is to be able to create what you want without having to do such prompt
It overcodes I think
at some point ai will be able to do it
LLM*
Ok uhm I think gpt 5.4 is dumb
Tell me why it produced a prompt
When I asked for html
Could be the case because unlike claude it doesnt think too much, it just codes too much or is slower at the coding end
LOL
I misspelled with but surely it could recognise it
it snot
wih
just a html cookie clicker#
you can remove the banner if you delete the top line before code lol
but yeah this vs gpt 5.4 qwen obviously won
i mean gpt 5.4 produced a script
GPT has never really been the greatest at coding though, only codex really so it makes sense
wish codex was on arena to compare
i put this into claude
Yea me too
Hell yea, lmk how it turns out
it broek
Oh boy
Yup not surprised
Arena just has a problem with claude in general
Always breaking
it worked fine if i gave it some roblox code
Will look up this Trace ID and let you know.
Is claude opus better than qwen?
debatable
not for general
wasn't opus like lagging and bugging out whenever it thought for too long?
i mean for coding obviously
claude is like top for coding
and i cant use opus just for coding my roact
but like qwen is like build for coding too
The opus is missing , what happened??
See THIS is what I'm talking about qwen
remoevd
Ah yea
This is codex and I think the qwen design is much more unique
im comparing these bad boys with that script u sent
The codex app is just missing something
Haha
qwen
Oh damn its an instruct model
lol why
Its called an instruct model
Which means when training a smaller model this instruct model feeds the newer model being trained its info
lmao
Which one is best for pattern recognition and prediction
uhh how much does arena spend on us using all this paid ai
who is better
@wicked talon how much does arena spend on us using all this paid ai
I found a way to bypass the limit on infinite generation chats and want to see if it works for everyone. Is it okay to post GIFs in here? I'm gonna record a video to show you guys how
Alot but arena is funded by investors
I dont like either, but codex is closer to the prompt just it looks like a damn history paper
How does the investors even make money
how does arena make money?
low key i dont see no paid option
(which is great btw)
This is being caused by rate limit. Arena can be rate limited by the model provider. Can learn more in this article: https://help.arena.ai/articles/8931786544-arena-how-to-rate-limit
I dont think they do lol
And will arena soon support api keys
oh
they dont
ohh
they will be sued
wait what why
unless they get licensed by every ai
oh
We have an evaluation product, can learn more about it here: https://arena.ai/blog/ai-evaluations/
is pineapple a app
It is value investment where you invest in something for other value that's not pure money basically in this case basically help data be scattered for AI which in turn proves AI which might help their Investments if they have those in AI companies and if not i wouldn't be surprised if some of it is from Bill was foundation for those that are interested the AI itself
No real person.
he looks appy
no lmao but feels like it at times
i swear he uses chatgpt for responses
he is just following scripts tbf.
on a arena.ai help page
thats what most support agents do at any company.
Thats been talked about a lot
LMAO
Arena is not gonna do it as it will lead to legal complexity
A bit too much sometimes for my liking but fair enough ( to be clear when I say that I get why you would make the help paige it's just you know sometimes it can feel with like if it happens way too often i or it is always word to words you get what I mean)
I thought Arena was free to get models from manufacturers who wanted to test model rankings...
That's like asking for a needle in a haystack lol
No.
Its all paid for i believe
but if using a magnet
They will 100% do it at some point, think of pollinations ai, its possible and really is likely
also does not arena support html files or txt files?
@ashen grove
is there any other website other than arena (which previously) that lets you use claud eopus 4.6 without limits
Yes, google is your friend
wat
gonna have to go back to claude
yupp cuts off claude when its thinking for too long/writing /coding for too long.
Are they not gpt 4 etc?
older older models
Arena did the same thing 😂 idk any others though
Sure but no legal troubles
GPT 5.3 etc would be more complex
Welp... I just lost access to Opus. It was good while it lasted
vc funding
like most other ai companies
the interesting part is opus 4.6 is still avaliable in battle mode if you get lucky
but yupp just shut down, why are you still talking about it like it still exists
Does anyone have issues with token usage limits? I send four one sentence prompts to Grok 4.20 multi agent beta 0309 and it hits the limit
Reset the page, say hi to GPT 5.4, and then try using your AI again — I know, it's very specific.
Thank you
@quick jackal@echo aurora
somebody ban @cloud fox he is a gambling advertisement bot
Done
@echo aurora
People who used claude for text and story telling, what are you guys going to use now? I'd like recommendations
??? text and storytelling??? i though it was 99.99% coding
the remaning 0.01% for providing context
TBH or they fell for a scam at the same time. both accounts had joined in november last year. they could have both fallen for something a bit ago. but idk
are there seriously no alternatives out there for claude opus anymore
just use max
or wait for the models to come back
its not getting fully removed eventually
Yeah
theyll come back?
probably
yeah
<@&1349916362595635286>
we intend to bring these models back in a way that’s more sustainable when possible.
oh nice
thats what the announcement said
Yes
shoot i just joined i noticed it was gone on the website
poor kid, he got his acc hacked
@ivory anvil got reported
I wonder what token grabber program they might have run
Qwen for frontend gpt (codex) for backend
Trust
POV: A kid searching in google for first time
What's this?
Wait what lmao
i told gpt 5.4 to make it use material
The website is really nice
Hello, I just noticed that Claude opus models were all gone, why this happened? I know this would temporal, though
Some models have been removed from Direct and Side by Side. More info can be found in a recent #announcements
will it be returned☹️☹️
We want to when we can provide it in a sustainable way
What do you mean?
woah, here's another C# fella
hlo
is any feature on arena free and unlimited
They're all free and not really unlimited due to rate limits but thats it
They aren't going to bring it back, its genuinely just too expensive
but rate limits reset and are quiete large right
Depends on the model you use
gpt 5.4
Shouldn't be that bad
It depends on how expensive the model is which will determine the rate limit
what’s best ai rn for arena. also what’s different between using arena 5.4 and paid 5.4
So basically nothing really has bad rate limits
Wdym paid 5.4?
as in like paying for subscription
Yeah
Lmao
The big Claude
Yessir
Left arena to chat in their discord server
y do they make it cost so much money when arena has free
Bruh
Arena is just a leaderboard and comparison website
Because arena pays openAI for the API requests using their model
Technically it’s not supposed to be used for personal projects
Technically
so how does arena get the money tho
It can be used for anything
Investors
Investors
Holy moly
r there any other websites that have paid ai for free
For free
None as good as arena lol
AI is expensive, nothing lasts forever, remember that
it’s down
Yo thats crazy
do u guys all use arena
Yupp was here yesterday wild that its gone now
wait just to confirm arena pays per token from openai etc right
so it’s lowkey expensive
Nobody needs to worry about their expenses as long as its working
And gpt isnt that expensive
And almost all ai companies spend more than what they have lol
Lmao
kind of
they sure have some kind of discount because they rely on API paid plans which are tailored to fit big enterprises such as Arena.AI
but it's still extremely expensive
I thought your name was advertising
Me:
💀
yeah a lot of people say that
I just accidentally discovered a vulnerability that lets you upload any file and have the AI read it
500k of text in 1 message
💀
But everyone can use you
Because you're Opus 4.6
If you don’t mind
I would love to know about this in dms
Hehe
actually nobody can use opus 4.6
check site
Yes that’s why I came here
Literally a meme
Sadly I can't
But if they ever implement [this](#1469637546828103741 message), you'll be able to do it just like I do
Lmao bet
Let’s hope for it
It would be funny if they put a character limit on the file you upload
<@&1349916362595635286>
Get your aaaah outta here
Is that a hint lmao 😭😭😭
It will be a thing
People have no limits and will push it to the most extreme
Better yet
Free pentesting
Ai safety always takes a back seat ;P
I can only send one sentence message and then the token limit is reached. This is so annoying
Gemini 3.1 and gpt 5.4
3.1 is also gone
And we know when will return?
They said that probably
till it gets profitable for arena
Those models are not profitable at all
Dang it... I loved Opus 4.6 ;-;
I'd say 3.1 is better for explaining things and acting like some people but 4.6 is better at making messages long and creative writing
Exactly, Opus 4.6 is better for creative things...
3.1 is much better at informality though
I'm new to all the AI things, but, there isn't a way to fusion models?
I join this discord because the claude 4.6 can't use now
😒
guys
is it possible to rename chats?
the only option i see is archive, also when u archive where can u find these chats
nop
you can't see them sadly
it's my only LaTex Helper and now it's gone 😭
not surprised
CoDex kinda bugs me as it's not following my request as usual.
Why was the model opus deleted?
I was surprised by that too; I searched for it this morning but couldn't find it.
He disappeared from me about a week ago. Maybe more.
I just don't understand why it was removed.
I know that it was removed as an experiment or something, but even after reading the article, I didn't understand.
it wasn't removed until around 7:00 pacific time today
I was done to maintain reliability. We'd like to bring it back when it's more sustainable
is it an API-related issue?
opus-4.6 was canceled—any alternatives?
Hey pineapple.
But I could still use it for a while... I don't know what to call it.
I just went incognito and could choose the Opus of the model and chat with them. It was uncomfortable, but still a little gratifying.
@odd geyser you used incognito?
