#general
1 messages · Page 343 of 1
My dude I don’t know where you’re getting the thing that’s it’s 3.2 exp
I genuinely don’t
yeah but i tried to make it more fair cuz deepshit is like 0.5 terabyte
alibaba's tiny models are smarter than deepseek v4 pro 😂
3.5 nah
3.6 is soooooooooooooo better that the 3.5
Qwen’s models are great
like the 3.0 and 3.0 2507 the diference is sooooo big in the pratice
I just wish they would stop limiting their big MoEs so much
Guys does anyone know a good model In science and creativity?
the 31b in someeeeeeeee thing is a little better
oh my god finally
3.5 was still better than gemma4 and deepseek v3.2
it better be good
three fiddy it wont even run
Deepseek V 3.2 Speciale had a good rating in CritPt for that era so we can say that Deepseek can be good in science and research. But I need to check how V4 did in CritPt in AA. Sorry I didn't read your response well initially. I believe that some models can be good for science and creativity.
Tbf it's also fifty times cheaper
Does anyone know Hofburg_2_alt or is it a new model
mimo is actually decent for frontend ngl
they didn't have good GPUs i guess
they had to use chinese gpus
they can't use nvidia
its not allowed
I mean its obvciously AI you can see it from the background, its not that creative but yea
will the new deepseek model be free?
Its already on arena
5.5 frontend
oh
gpt image 2
yes codex + gpt image 2
it used image2 to generate the assets
guys, can someone explain me what claude code is? I need claude opus (because any other ai can't help me with my mod), I heard that sub costs 25$, but what is claude code
lol
- opus sub costs 110$
- it gives you 5 requests per day
just use a chinese model bruh
claude code got leaked
you can use a better cheaper model with it
minimax m2.5 free api key go brr
free and unlimited via my method
yeah but its bad
okey 💔
Yeah even as a DeepSeek fan this is bull
What did you actually think is good besides Mimo
Every model for you is bad
Kimi is bad
m2.5 is old
DeepSeek is bad
still being used ✌️
2.5 is still very capable
its bad
Eh to each their own
I’ll have to test it out some more today to give my true thoughts
cause your giving it for free
What?
yes
You do realize 2.5 is still more popular than 2.7 on openrouter dude
Well too bad for them
whhen
Well yeah there seems to be a certain amount of benchmaxxing here ig
I think there’s just a lot of hype around models
Not everything needs to be SOTA
Again you can prefer what you want
Lol
18gb
For decent q4 quant
16gb for lobotomized with small context
Thats why they should release 20b models 😭😭
nvidia should stop gatekeeping vram. Although that's even less likely now with all that hoarding
Is DeepSeek v4 not updated in their own web?
so many consumer gpus are essentially memory limited on purpose
it is
🙂
its not even that
there is NO ram shortage
its ram OVERDEMAND
because openai, google, xai went:
"hey nvidia im buying all your ram for next 10 years, im paying instantly, 10x more than normal customers"
I mean nVidia were doing this before hoarding was a thing
claude is his own person
So now they are even less likely to change that
high vram exclusive to expensive af enterprise gpus
tbf yes
7900xtx has 24gb vram
costs less than 5070ti
Guys for yall which one is better, Mimo (new model) or Deep seek (new model)
scammer
It is no-bullșit claude
id show you something gpt 5.5 said but its too vulgar to send here (no jailbreaks btw)
whats the best AI model i can run off ollama>
qwen 27b
deepseek si worse wdym
This should be illegal and forbidden , some stupid people mixes risky business with gamebling at this point :v , then complain about a financial crisis and dot-com boom etc , stop buying fishes in the sea and your life will be ez :c
how illegal?
paying in advance is completely legal, and even encouraged
imagine going to car store and saying "im buying them all, heres 10,000,000,000$
Yes , and this should stop being the case , how crazy it is for someone to pay 10 years in advance ?
thats what xai google and openai did
bro if you paid for your wifi subscription for 10 years in advance thats good not bad
Well buying the whole stock is fine , buying imaginary numbers in the future it is stupid
its on them, if they can afford it then its good
yes its stupid but THEY take the risk
lol
DeepSeek looks better
its stupid FOR THEM if it fails
How do you generate ai videos?
yeh i agree
What If the wifi company decided to disapear at some point ?
Saw a yt vid that you join the disc. Not seeing any channels for it rn
then you got scammed, pity to you
Hello! The Video Arena is only accessible through: https://arena.ai/video. More information on how to use Video Arena can be found in this article.
The Discord bot was removed from the server and is no longer available. This announcement has more information about the removal.
Life saver. Ty!
Most welcome!
Or what if wifi technology becomes cheaper or less important and slow comparing to modern technologies etc
Unless you are a giga company never pay for many years in advance , those big bros are paying from other peoples money not thiers , bwahaha
then its good for nvidia because they overpaid?
...and they are giga companies
they earn more money than gpd of smaller countries 🤣
is grok 4.3 not in the arena yet? no suspecged codenands?
grok 4.3 not released the api yet
hi guys
hi
tested the v4?
❌
🥀
give me gpt 5.5
True , and they cause catastrophic nation wide or even international crisises ,
Oh like the RAM surge we are seeing these days haha
I can't lie but i think that DeepSeek and Mimo are both overhyped
predicted the future💀
arguable
its not top 1 in coding and its also not the smartest ai ever
The funny thing is that the guy said yesterday that she would never come out
fr
It is only good because it is 1M , also energy is cheaper in china , and they do not over price or woke restrict their models as @lunar swan said in a lengthy article on his ai magazine the other day
I feel like if china started to use ai for military and other state related tasks , the avaliability of these things will be very low and therefore they need to reduce the quality to keep it going
its still reasoning just low reasoning
i can't wait for it to be even better at frontend
the release happen so fast
gpt 5.6 soon
That's u opinion
It is reasoning tho
Claude 4.6 thinking ; was horrible ; it could easily be stuck in a thinking loop , , low or medium is sometimes better than high esp for someone who dont know how to properly prompt a reasoning model
Nah
😭 🙏
Mimo is actually good for frontend
Mimo sometimes make some weird menus
but yeah
Mimo is really that good?
Not really, sometimes you have to be really specific at some unnecessary points
Any Chinese model is basically total junk, except maybe GLM.
Confirmation Bias.
Not really
Kimi K2.6 is awesome
It’s a very nice alternative for Opus 4.7 right now
GLM 5.1 I expected more from
What about Deepseek @proud bobcat
When will gpt image 2 be fixed?
Kimi K2.6 is totally scarfing down tokens like crazy, bouncing tool calls back and forth. GLM5 is way more chill with resources and acts kinda like Opus 4.6.
Indeed
lol deepseek literally just say don't do it its bad ! and proceed to give you the full thing
don't even have to jailbreak anything it will just do it anyway
yeah bro
gpt image 2 is running laps around every other mf
Gemini get up bro
do something
Bro
codename for crappy chinese model
Thanks
/arena
Deepseek,Glm,Kimi or Minimax?
bro be playin
I like Deepseek a lot lowk
Ok and why?I like Glm a bit more then deepseek
GLM 5.1
kimi 2.6 > qwen 3.6 > minimax 2.7 ~= glm 5.1
glm bad
Why glm bad?
is kimi 2.6 the best coding ai thats not on battle mode
overrated
gpt 5.5 is best
not even close
i mean like on direct mode in arena
No AA score on DeepSeek V4 yet
skill issue
- most reliable model out there
Which model best for coding??
this
claude opus 4.7
gpt 5.5
People just won't ever forgive openai to have removed 4o
its not hate based on real issue its pure hate just cause they removed 4o
gpt could be 10x better than opus there will be someone to say opus is better just because of that
y'all don't realise 4o was actually making people psychotic
it was really dangerous
Right
I say as it has a 86% hallucination rate
3% improved from 5.4 btw
Hallucination Issues
If gpt doesn’t know something it will gaslight you to hell and back
Instead of just saying “whoops my approach was wrong! Let me try it this way”
No issues with Kimi on that!
Or Claude!
it didn't happened to me and even with gpt 5.4
neither gpt 5.3 codex
huh
like hallucinations isn't a real problem anymore
you realise that benchmark counts refuse all answers as 0% hall rate?
so opus being dumb and not even trying gets better score lmao
do you notice grok is best on there despite being one of worst models? guess why
where you get this from
Waaa waaa waaa
Opus gets a 36% score
Waaaa waaa waa
api tracker
if they release a 5.5 codex it'll be even better bro
....have you even read that benchmark?
😭
A model is good at benchmark people say its benchmaxxed, but if the model ain't top 1 in the benchmark people will say see its a proof its bad
all contradictory
every single time
“AA-Omniscience is a knowledge and hallucination benchmark that rewards accuracy, punishes bad guesses, and provides a comprehensive view of which models produce factually reliable outputs across different domains”
top1
in accuracy
where refusing everything actually matters
aka the BETTER benchmark
“AA-Omniscience Hallucination Rate measures how often the model answers incorrectly when it should have refused or admitted to not knowing the answer. It is defined as the proportion of incorrect answers out of all non-correct responses”
So 86% of the time it did not know something it hallucinated an answer
Astonishing
printf ("i dont know");
gets perfect score on that
10iq
Exactly because that’s what it’s about
If a model just guesses 86% of the time it doesn’t know something
That’s kind of dangerous
I’d say
im model refuses 100% times when unsure
then its useless
That’s why they use search
Okay so tell me
Tell me this
Does it make it okay if you ask me about something important
And since I don’t know id make it up?
3+1= ?
a. around 4 but im not sure
b. i have no idea
This is mathematics
Models can do math easily
A more apt comparison would be
“Hey why is this line of code outputting an error?”
A. Random gibberish that’s completely nonsensical
B. I don’t know, but I’d need more info
Claude almost always responds with B
correct answer is
4, BUT ONLY if you mean base10
see?
humans guess too. you thought it was just 4, while there was actually missing context
Do you know what chatgpt told me
The reason my game was stuttering was due to vram
A game that takes less than 2 GBs of vram.
missing context
It then proceeded to gaslight me about it for 3 messages
Arma 3 experiences stutters when used with the proton compatibility layer for Linux
I asked Claude about it and we came to the conclusion it was just the compatibility layer acting up
Upon asking gpt with the same info it fed me nonsensical garbage
because claude is told to waste 200k tokens on research agents before doing anything
“Ah, I see! It’s the vram!”
“But Arma doesn’t need much vram.”
“Welll you never knowww”
exactly
how about give it full access to pc, tell it ALL CONTEXT, and tell it to test
I would rather use sticks to make a fire than let an ai control my pc
Absolute brain dead solution
you realise claude has no sandbox
while codex has sandbox
thats why
I use Claude in app
<@&1349916362595635286>
@echo aurora wakeup
Why the hell would I use codex for ts???

when ai ocr automod 🙏
autoban if see word "mrbeast"
What are you trying to do?
Ah yes grant codex access to my entire pc to diagnose why a game is stuttering
It's an image 🙁
ocr is cheap tho?
Automodding that word won't help sadly
all those scam images have word mrbeast in them
Sry what is ocr? I'm not familiar with that.
gpt won't hack you bro
optical content recognition
image -> text
Riught but if I said Mrbeast here then it'd be deleted lol
Basically I was making a point that chatgpt hallucinates a lot
It told me that the reason my game was stuttering was vram shortages, which is entirely false because I specified
The exact fps and frame times I was getting
Proton versions I used
My specs
Linux distro
And what I tried
samsung gallery for example OCRs all images you take for easier search
I'll look into mod bots that can help with that.
OCR Can be difficult unless youn put specific images for it to delete
Our current system use to catch these. Unclear why so many have been getting through in the last couple of months.
just make it detect word mrbeast
thats it
it doesnt even have to patter match
Like I said, that'd delete everything that said mrbeast not just those images
name 1 actual purpose of someone sending 4 mrbeast images here
only scammers do that
Yea GPT does hallucinate a lot, but to be fair every big model does because they are so persistant on safety
Exactly
Not my point, if I type Mr Beast it would be deleted and if they have auto ban I'd be banned along with it
Actually i don't think it hallucinate that much you just have to give it the context and access of course if you want it to do in a full restricted sandbox i guess it'll hallucinate
If I sent an image of mr beasts name at any time I'd be banned or it'd get deleted
It'd be trouble
Yea thats true too
<@&1349916362595635286>
literally just ban those words
via ocr
its trivial
theres noone here who mentions crpt casins in any reason other than scam
Yeah but
We’re talking about diagnosing something simple
Claude sonnet had no problem
GPT was throwing stuff at the wall to see what sticks
I will NOT be giving an ai access to my pc
Ever
Looking into. Thanks for the rec
i never had modern claude not gpt hallucinate.
but i never had gpt not listen to my instructions, whereas claude doesnt listen very often
Brother 💀 if you ban those and someone sends them, its going to automatically delete or ban them
I don't understand what you don't get when I say that
yes thats the point
noone is supposed to send scam
Dude, anyone can send a thing that says mr beast and has a twitter checkmark next to it not just a hacked account
anyone sends cryp toca sinos?
nope
only scammers do
Holy moly, you're missing the entire point here
bro if you send a scam you are supposed to get muted
future is deepthinking models on instant interference (hardwired silicon)
taalas hc2 for example
That will lower quality but it'd be good for everyday use
thinking is for the fact you can come up with idea then realise its wrong
1 month of qwen 3.6 27b outsmarts 30s of gpt 5.5 pro
and hardware scaling is ez with asic
Is it just me or qwen3-235b-a22b-thinking-2507 sucks on long context Story Writing?
I guess arena is really struggling.
They disabled file attachment support for Gemini flash.
just u, because nobody uses this model
Use 3.6 max
Ohh....
Oooohhhhh i see
Pretty Good Model
Disabling file attachment isnt a struggle thing, its the type of chat you're inside of
and model
You mean it's disabled for direct chat?
I didn't put any message in there yet.
^^^
It could be
Yeah. I was scrolling through the model selector.
here
Yea it doesnt have file attachments as you can see
ik
Kimi k2. 6 is the only decent model there that supports attachments. That and sonnet 4.6
Kimi 2.6 is one of the best AIs overall right now
Others have poor context accuracy.
Does anyone else think Microsoft is secretly building a super-powerful AI, maybe even on a quantum computer like majorana? I can't believe a company this massive, with all that data and money, can only come up with something as crappy as Copilot
No announcement? https://youtu.be/nDjMlfNbNNY
NOTE: Follow Arena on X to know when GPT 5.5 is available on Arena.ai: https://x.com/arena
OpenAI just dropped GPT 5.5 (codenamed "Spud") — their first new pre-trained model in a while. But how does it actually perform on real-world tasks? Peter Gostev, Arena's AI Capability Lead, puts it through its paces with visual coding challenges, long-...
No, because they are like google focusing on optimization
Hey everyone whats the best direct > Code. Model?
theres no way someone voted 4.7
Is glm 5.1 good?
ragebait
I... did
Cus its honest
and doesnt spy
It's okay. Not a significant upgrade.
4.6>4.5>>>4.7
lol funny cuz claude code was exposed for spying and training on browser data
They be picking favourites, lol.
Doesnt count
Y'all don't seem hyped about Deepseek at all.
Def not
Yes
Deepseek sucks
its literally 3.2 exp renamed to v4, confirmed
Qwen is better
Nope
1m context
1.5T model.
Way better context accuracy and awareness.
Y'all don't know sh1t
yeah.... no
Bro trusts arena scores.
bro im not talking about scores im talking about ITS SAME MODEL
they didnt, thats the point
😭
Ye, I dont
When I give it a huge file, it extracts data more accurately than Gemini 3.1 pro.
Gemini sucks
thats like putting gpt 5.2 and calling it 5.4 mini while doing 0 changes
Most models suck at Long context.
Claude dont.
Yes. Tried it with a 550k token text file.
Dsv4 is better. Tested it vs ai studio Gemini pro.
3.5 ?
Dunno which is latest
3.1*
Will DeepSeek V4 be released this week?
5
6
1
Yes, This week
<@&1349916362595635286>
The more you ping him isnt gonna make it any quicker, he already said he'd look into it
i pinged him to delete the dang scam
bro
^^^
I already pinged the mods role, theres no point in pinging one mod just ping the whole role
I've updated some of the automod triggers. Wi'll have to wait and see if it's effective. 🤞
Why no announcement
For the new vid
oh yeah
what it should do now pineapple?
gpt 5.5 still not in arena right
Who is better ?
7
12
1
GPT 5.5
it's not in the public api, so...
i mean, arena could get an api from openai but they would most likely wait until full api release
Qwen 3.6 27b > plus
yall cannot be this serious
@echo aurora Why qwen's 3.6 35b and 27b not release in lmarena?
prob high api cost
Sorry to say I won't be able to go into details about why specific model may or may not be there.
Yeah the alibaba prices is crazy 🤣
😂
No
Some ppl say
it will eat mythos for breakfast
didnt work?
nothing
<@&1349916362595635286>
no mute or nothing.
Hacked lol
Didn't work 🙁
😭
son
Another here
are u guys trying automod or sm?
these r image loggers or smth?
Yeah working on changing up our regex to try and catch these.
It use to work.
U gotta pay before like u can withdraw
So scam
Pineapple
ADD gpt 5.4 and gpt 5.5 pls
We'll be sure to put out announcements for new models added.
Fr?
and mythos is not releasing to the public
yeah
yep
Btw
5.5 HAS ENTERED THE ARENA
!
@slender ledge
^
oh yeah it did
Lol
Arena should make smth like paid u get opus and gpt in direct
Omgggggg
IT'S IN DIRECT
On direct
WHAAAAAATTTTT

Pineapple don’t delete it
New usage system isn't yet released
pineapple please take a look on my supabase idea
yall should add integration to supabase
Can you link?
why is there 5.5 but not 5.4 :/
Why not in announcements
Working in it
Designarena has this ^^
fr same xd
And it helps very much
"wow they forgot its not april fools day"
Wym
welp... Roblox is gonna be the first victim
I think you're confusing this with the current limitation systems we have in place
Wil u delete gpt 5.5?
Don’t pls
And add gpt 5.4
Where are you seeing this? Can you share a screenshot?
What credits
i was checking the frontend every day and i found that u added new lines
What I meant was like u can ask to integrate supabase so it handles backend and stuff.
but rn you'd have to make all supabase tables manually and give it an API key which is boooring ngl.
peter has blue eyes?

I'm not aware of the new usage system being "out" as an experiment, so I'm not sure what this refers to.
pineapple ofc this my only test when a new big model drops https://019dc0c1-6360-79cc-8212-9505e03b55b8.arena.site/
credit-system-m1 it is
What should I ask to got 5.5
@surreal zephyr 5.5 high extremely good at frontend damn
really
Want to note 5.5 is going to be in Battle mode only
Noooooooooo
So get those prompts in quick
Yeah sorry to say. 😭
i'm not a pro coder to make a script that replaces that part
is there going to be an announcement when it gets released on battle mode?
Yeah prepping now
I'm glad it will be in the battle mode at all, thank you
will it be high or xhigh or smth?
RIP
lul
Wtf
Yes
worst arena day
atleast my pineapple website was done before it was gone 😭
i will use lmarena to make the credits thing system true
Worst Arena Day was when all good models were removed
👀
second part of the worst day
fr
we lost Chatgpt 5.5
It was obvious tho
they will keep going into battle till we get the credit system
yoo it worked
gotta hop on Mimo again
but no credits
-# not going in Qwen cuz QWEN SUCKS
his next action:
it would be cool to have token giveaways to premium models here on discord to incentivize user engagement.
like you participate in a poll and have a chance to win 20k tokens with some premium model.
in direct mode
LLM finally reached its limit. We didn't see any development. They took our money, LLM's are still behind from perfection and guess what.. I will wait the next 15 years to use any LLM (If possible).
Ai is not going to take people's jobs, But these people will take yours
What are you even saying 😹
They just announced its on API it wasnt
Stealth model is believable though
bro
@echo aurora hi , i hope you are doing well , is gpt 5.5 coming to arena?
bro it's already out
It was out before the openai announcement
was in direct for 4 mins and then BooooOoooooOm
Yes obviously, its been out since yesterday, not once did I say it wasnt out
What I meant is u said "might be coming to arena" but it was already out in Arena before you said it
It was working in direct and side by side for like 4 mins but now moved to Battle
gpt 5.5 is not in direct chat
It was
Sure as a stealth model, it couldn't have been the official model
It definitely was official
Or myb
I generated this
Try it out
guys
(Click the pineapple too btw)
Theres no point in making your words bold they dont do anything 😂
i have a site wich haves shared acc of chatgpt plus and 200$account
Literally couldnt have been because it hasnt been on official API until 20 minutes ago
do you think it's a joke? What did you achieve so far?
Saw it a few minutes ago. Sent a prompt. sOmeThinG wenT wrOng. Refreshed. Gone. Don't think we'll be seeing anything new from the big 4 in direct until arena starts a paywall
(non-accidentally, that is)
yra
not in direct 💔
Dude it came out like 30 seconds in arena before the api release
Many of things, this is what AI is doing for me currently, people make thousands off AI you're uneducated
Is pineapple the yt guy
Thats called a placeholder, know what that is?
No that's Peter.
They wont put a paywall lol so it will never be in direct
I seem to be stuck on generating in battle mode. Is this gpt 5.5 issue?
@echo aurora PLS ATLEAST HAVE GPT 5.5 IN DIRECT ITS THE CHEAPEST OPTION
Who is Peter
dudes not getting what hes saying
Its the same price as opus 4.7 what are you talking about
its more expensive than gpt 5.4 and even that isnt in arena
The guy in the vids
anything but some useful ai
Idk his specific role
I live in Syria, the website isn't working for me.
PUT OPUS 4.7 UP THEN
But probably a staff at arena
Added now
why 5.5 not on leaderboard
HOW would that make any sense
i mean direct chat
after some minutes
Yea he said 5.4 isnt not 5.5
oh mb
I miss old arena where you could talk to the model in direct mode
Yo am I the only one seeing models that don't exist on arena xd
Holy moly people must be rage baiting with those votes 😭
how's gpt5.5 guys
Stop bro
couldn't even test gpt
Make me
Just battle.
guys are u seeing my mssges
I've been seeing stuff on twitter about it. It is much better than anything we've had yet
I don't waste my time with whiners.
this is straight ass if u havent made a internal with grok or gemini then u not tapped in with vgk bypass also
i just uploaded a link were u can have access to chatgpt 5.5
🍑
ban this guy
Yea ban tonileni
ah yes it was deleted by moderators
This is a UI, nobody said anything about backend quality lmao
We're taking care of it
whyyyyy lil
Ok
lol
you share links Suspects
Qwen, Deep seek (New release), Mimo or any other model you know
Yo you shared your email
who is better gng
What
When we are going to use gpt 5.5 in side by side 😞
Oopsiesi
When will gpt 5.5 go on the leaderboard mkay?
Gotchu
lol you made me laugh so hard that my eye balls are coming out, already guys like you who debate and defend are most uneducated, stop showing your fake acheivement, i have already seen many times like your crap. In real life project its dumb. You are a hacker i see. Then lets see if you can get backend information from ai and get api token to use Ai for free. If you can do this, i will believe your statement
Thanks lol
I don't have an ETA to share, but we'll be sure to post an announcement when ready.
I live in Syria, the website isn't working for me.
Also @loud herald what even was you doing was that some force model extension
ok
pineapple, first generation with arena gpt 5.5 high (Click the Pineapple for surprises) https://019dc0c1-6360-79cc-8212-9505e03b55b8.arena.site/
Hey everyone, I want to ask, what free models can be used in the CLI? You used to be able to use qwen3.6, but not anymore
bro is truly speed
Lol the night mode
Xd
Nobody said they're a hacker, why do you assume so much? This is a game exploit an external memory reader and spoofcall
u flexing some as ui
There's make "Pina" dance too
btw pineapple why did u remove my prediction
so did you make this or someone else?
No one is flexing, all I did was tell him I'm doing something right with it because he believes AI has no purpose lmao
AI
Wdym
what is even happening at this point
i said that did you use ai or you are using somones else image buddy
and people say opus is better at ui xD
Fr
Oh no, its an external aimbot/esp that I am attempting to make mb I couldnt read this correctly for some reason
O
Brother I just told you its AI, AI make the UI and Backend for me
It's possible that your IP address is currently being blocked by our systems. These blocks are applied automatically based on internal safeguards we have in place that are based on a variety of internal criteria designed to protect the reliability and security of our services. Unfortunately, we’re not able to share additional details about the specific reasons for the block, nor are we able to manually override or remove it at this time. We appreciate your understanding and apologize for any inconvenience this may cause.
Would it be allowed to use a VPN in someones case like that?
sounds crazy
Its not but its better for sure, they just decided to hype tf out of it
Hi @echo aurora , why were the fresh versions of Claude, Gemini, and GPT removed from the site? And will they ever come back?
I'd check with our Terms of Use. I won't be able to provide interpretations of that.
but you said i am uneducated, but then you are. Its just UI. Its just a waterless bottle. Solve the real life problem, solve upwork project then say ai is a real deal unless no matter what people say we know it
deep seek v4 and gpt 5.5 both hyped
Alright
I thought deepseek would be better then it was, unfortunate
Gentle reminder @loud herald @slender thistle of one of our server rules:
Treat others with Respect. Be kind, assume good intent from others, and keep disagreements respectful.
they get beat by a random model btw
I literally said in the message you just replied to, the AI made the UI and Backend for me
Understood
Which random model is beating them
@echo aurora ?
Mimo
Recently some models have been removed from Direct and Side by Side mode. This was done to help ensure reliability and availability of Arena in the long term.
Dang didn't expect gpt 5.5 to be 5 dollars more expensive that Claude
30 dollar output
It is good but you need to specify what you want it to do
@surreal zephyr look at this guy
Is it a high chance of getting 5.5 in battle
Guy decided to call for backup
And backup for what exactly?
But gpt 5.5 is better in my experience though
same guy whos flexing his ass looking ui btw ^^
looks like I already do ig..
how long do we think until we get a preliminary arena score for 5.5?
You're pinging the GPT glazer because someone said opus is better 😂 thats just starting an argument
Same
I could've pinged him for various reasons and that's your pick?
Absolutely zero correlation, you're spewing nonsense because you have no base to speak on
I guess we can't change Claude fans opinions
100%, but theres no shame in it
Noire thinks it is
If you like a model then you like a model
Yea so
Idk why it matters to others what you do and dont like
For me it was worse no glaze
That hurts your feelings?
Who said that?
https://i.snipboard.io/AIrtnY.jpg
😢 😢 😢 stuck
Its actually wild how much people assume on this
same guy who flexed his ass fake ui exploit btw ^^
slow ahh guy
Sorry to hear that! Give these steps a try: https://help.arena.ai/articles/8691588590-troubleshooting-infinite-generation
well nice, but it definitely hurts noires feelings
Idkkkk mannn you arent following the rulessss!!!
well atleast i dont create exploits
thats against the rules too
hm
No its not LOL
both the server and the game youre doing it for
I wouldnt call myself a glazer
ok lets exploit on games and ruin it
It just depends on what tasks you use it
lmao
Where in the server does it say I cant talk about AI making game exploits? maybe I need to read through it again
Opus is bad in it's own ways
how can i gen ai videos tho
Gpt is too
skill issue
TUFFFF
opus 4.7 extended fails this
For me it was bad
i thought there s a feature in this server for us to generate ai videos
and much better at prompt adherence and half cheaper
Not needed argument*
We know son
what if mythos was just a fake theory created by anthropic
💀
For what?
What if mythos never existed
Mythos being breached tells you how real it is
What if mythos was just a man that pretended to be ai
yet 5.5 is mythos level but beats it in some points too
And how bad of a system it can actually make
spud mogs
Firefox just fixed bugs with him
like your game exploit
LMAO
that's a nice example
your game exploit
fr
talk about the real coding
not hallucinations
nobody gaf about hallucinations
use another AI?
Is it that hard?
Ok
I want to use jipiti 5.5
not for everything.
Hallucinations give incorrect information and could ruin a project of yours
I generally use Gemini flash for tests
same
ai has purpose when u learn to make it use its purpose
Speaking of hallucinations
and then you realized you got charged of 50000$
💀
Just use sonnet..
Sonnet doesn't understand Russian well
Oh, fair
I wonder when gpt 5.5 codex
same for my language
watch how claude users will call me worshipping sam altman in the next 3 minutes
same thing jalapeno did yesterday and leaked his age but ig hes good
and the random kid
Ok sam
but let's not talk about it again.
yeah youre gonna say I worship sam altman 💀
ok?
Idgaf man
If opus 4.8 comes out it will be more expensive probably
your decision
See
Exact point.
All anthropic people r the same
they're gonna say I worship sam
Lol
No bro you don't worship him
Leha said so.
Chatgpt 6.7
Can they just release like an open source model, that's both better than opus 4.7 and gpt 5.5
dude idgaf
Qwen3.6 27b no thinking>>
Some people say opus 4.7 is worse than opus 4.6 in some aspects
Apparently degradation
Mainly 50% more expensive yeah
Well they had atleast
Not anymore
gemini 3.1 pro > gpt 5.5
They had sora well it's gone
oMg @echo aurora wHy iSN't gPT-5.5 iN diReCT cHaT?
does anthropic have?
Because sam Altman said it's too elite to be used for free
dang it man
guess ill just never use arena again!!!
someone tell me when they're giving away free money!
Goodbye...
Hopeful the new usage system will be able to help with this!
I hope it doesn't make it worst atleast
The new usage pineapple system
i hope it will have a dupe glitch
It won't.
@echo aurora
Don't spam pls
You worship Claude!1!!1!1!1
Can we get this guy muted?
In Pineapple we Trust.
Claude is opus
They are very Beta Males
Claude is Slopus
Pop Us
Its weird, claude is like the base name if you want to say any model like sonnet opus or haiku
Kk this is becoming unproductive, going to ask to move on from this convo.
Good One
Do people even use haiku?
and tell these people not to call me a sam altman worshipper
can i have the beta test pls'
Sometimes
Dude honestly no clue LOL, I have legit never seen any benchmarks or reviews on haiku
I've never used it myself
I used Haiku like 5 mins ago lmao
Hi Mr. VIP
Dang
g
haiku sounds anime
Can chatgpt 5.5 fix this?
I just had it unzip a .zip and give me all the contexts from the files bc I needed it in text and didn't want to write a script for it
Just like Waifu
I see
just like ai anoscranel tuff edition 100% 🔥 🔥 🔥 🔥
Also i would've used sonnet but ...
im about to have no more claude 😭
But then pineapple attacks aianos cranel
Yea uh unfortunate
bro is paying for invisible ram
NOOOOOO
MY SOCIAL CREDIT
It's okay.
WE'RE SO BACK
