#general
1 messages · Page 264 of 1
Yeah, it sucks to see it go
/Cinematic medium shot of a friendly, professional Egyptian male doctor in his late 40s wearing a white lab coat. He speaks directly to the camera with realistic lip-syncing and authoritative hand gestures. The camera performs a slow, dramatic zoom-in from his chest to a tight close-up of his face. Set in a professional medical office with soft, high-key lighting and a blurred clinical bokeh background. The atmosphere is warm, expert, and reassuring. Ultra-realistic textures, 4k, cinematic composition.
Any altr for gpt4-o??
hey im back i was doing smth else
one sentence prompt = "high effort" ok bud
also nano banana and gemini are the same thing, nano banana uses gemini
thats the point im making
go find one
shittiest model i've ever talked to, other than maybe grok 4.1
its just so embarassing man
i get like second hand cringe from talking to it
Finding duhh
yeah
why would you need an alternative to 4o
Pretty normal for amateurs
never understood the demand for that model
4o was worse on every single metric than 5
sucked at code, i remember having to prompt it again and again
now codex and other tools are competitive w/ claude and gemini which is nice
I mean, you say whatever anybody could say whatever but reality is ChatGPT 4o is a fan favorite
yeah, why
It’s just a great model overall
i mean no we've established this, it's terrible for any practical utility
Cinematic medium shot of a friendly, professional Egyptian male doctor in his late 40s wearing a white lab coat. He speaks directly to the camera with realistic lip-syncing and authoritative hand gestures. The camera performs a slow, dramatic zoom-in from his chest to a tight close-up of his face. Set in a professional medical office with soft, high-key lighting and a blurred clinical bokeh background. The atmosphere is warm, expert, and reassuring. Ultra-realistic textures, 4k, cinematic composition.
they've pulled it from the leaderboards now ofc
so i cant show you, but it lagged pretty far behind the 5 series
Yeah, but if you look at how people use ChatGPT, are there any of them use it for utility?
on, again, EVERYTHING
like 70% of people
the biggest usage is cheating on homework
yeah so most people use it for practical guidance about topics, seeking information, and writing
which are all things 4o has been heavily superseded in
The giant new 62-page research paper, done in collaboration with Duke and Harvard universities and which is available online, found that 73% of all conversations during June 2025 were for non-work reasons, in contrast to just 27% being for work reasons during that time period.
It’s hard to explain
Cause there’s two type of fundamental camps here
interesting
bro whats going on my chats aren't loading
turn your pc on and off again
Look at those leaps
It’s one of those things it’s not benchmark markable what made it special
yeah
There was just something about it. That was really unique.
fr fr
I think that it was simple to use and it wasn’t over moderated
It wasn’t as rigid and structured as it is nowadays
& it was a lil crazy 🤪
fr
hello, I'm ghufran and i wanna generate video for lyrics video
And so what were you trying to say by that ?
Why in battle mode i keep getting the same models over and over again
Gary Marcus is a cognitive scientist, author, and longtime AI skeptic.
Gary joins Big Technology to discuss why large‑language‑model scaling is running into a wall. Tune in to hear a frank debate on the limits of “just add GPUs,” the promise of neuro‑symbolic hybrids, and what that means for the next wave of AI.
We also cover data‑...
the title "are we at the end of ai progress" is wrong just with this graph that literally show the progress over model
and we don't even need graph just look at the result, we have more capable model.
Because its psychological mental problem - some people needed a model that agreed on everything no matter what
and its dangerous
They literally call 5.2 "abusive" and "too assertive " in their delusions
For those people its not even about capabilities, just that the model won't agree with them blindly
when actually ai are progressing in real capabilities people are judging it only that way, they want someting more human, more friendly, and they don't even realise how dangerous it is
to talk with model that act like that
Creating addiction, psychosis, isolation
"No the earth is NOT flat"
5.2 bad bring 4o back!!1!
😭
Its not because people like something that its really good
many people fell in love with ai, and at some point when you tell them its ai they get mad
and those same ai that manipulated them into talking longer with them are getting smarter
and people are HAPPY with it
and they want more of it
Also we cannot talk about progress when in reality you were talking about the way the model act with you
the model did progressed in everything in fact
but the companies stopped making it act this way
but it doesn't make it less capable
It’s a double edge sword
Although model could be capable and could have progressed dramatically, if people don’t like how it feels or it’s usability, I might as well be the world’s creepiest model
That’s why a lot of users they look at the benchmarks and they realize that the results are only on paper
That they’re not applicable in the real world because that’s not how most users use AI when they interact with it
These bed marks only apply to academics in research, researchers and enthusiasts
Your average npc normie
All he cares about is being able to generate an image on nano banana lol
The benchmark could not reflect the real progression of the model
doesn't mean it didn't progressed
Of course of progress no one saying that
It’s one of those things that you get diminishing returns for your investment the further up you scale
It becomes more expensive for not that much gains
You know i really either don't get what your saying or what your saying just make no sense at all
It’s all good bro lol
what does 4o could do, that ai right now can't do
I don’t know I kind of don’t really use it ever since 4o is gone maybe for image generation or looking up a quick fact here there
Mainly, what I do is just image generation
so how can you say we're not really progressing
I never said we’re not progressing. I said we are progressing
Oh, we’re at the point of scaling where we’re getting diminishing returns for the amount will we invest?
Yes, investment are bigger than what they get in return
Cause they're betting on the future and what it Could give in the future
not saying they're right or wrong
i don't know
Cause scaling was in fact something that couldn't last forever
at some point they need to do aggressive optimization like the small team
to keep progressing
how do you think small team are creating model almost as good as those companies that are scaling so much
Like I said, there’s two fundamental technical groups and they have two different philosophical ways of looking at the current problem
so let's wait for them to have no choice but to do both
Yeah, we just gotta be patient. See what happens.
No one knows for certain that’s for sure
I think a lot of recent open source releases should also be taken with a grain of salt, in the sense that raw benchmark performance doesn't always translate to equivalent knowledge and intelligence to frontier lab models.
why would they let themselves "die" when in reality they can do both
oh yes benchmark could not reflect real performance, yet we can say the latest open source model are really great
Diminishing Returns of Scaling]: While earlier models showed dramatic improvements with scaling, GPT-5's advancements over GPT-4 are more subtle, indicating diminishing returns and the need for formal measures to demonstrate progress
Cause when you start to scale you can easily do a x100 in term of productivity, once your at x100 to do another x100 would be much more costly if its even doable
They absolutely have been closing the gap, but it'd be naive to say that open source is reaching near-frontier performance as of now. It's getting close, but there's still a lot of room for improvement, specifically in regards of raw knowledge. Frontier labs simply have so much more data to throw at these models that smaller companies can't compete with. But this is starting to change as we can see with models like Kimi K2.5.
Automation]: A Washington Post study revealed that AI systems can only perform about 2.5% of the tasks people expect them to, suggesting that investments in AI chips may not be justified by actual capabilities.
you can try it easily yourself, give the same prompt to a frontier model and one of the lastest open source model and see the result
a prompt you made yourself
it can't be faked
and i did that already and the result are good
This is a naive way to test a model
Just because a few of your prompts worked doesn't make it frontier level
No, you can compare on the same task
and now that's pessimistic of you, without proof
I agree with it, but for now
The proof is in verified knowledge benchmarks that test AI models on niche topics and measure accuracy, such as AA-Omniscience Accuracy.
So earlier you were saying benchmark do not reflect real performance and now your saying to go check these benchmark to see if its real or not
Yeah, it is a double it short because of people didn’t really believe in it. I guess they wouldn’t invest in it.
The fact that people are pouring the money into it clearly says that they believe in the technology
So it’s hard to say, dude
I say to take benchmarks with a grain of salt, but that doesn't make them useless. When you try your own prompts on models, if you want to test for knowledge, you need to go in with niche topics or specific knowledge to test the outer limits of the model's knoweldge.
Frontier models tend to win in those scenarios.
You can try specific scenario using your prompt, if it give good result in those scenario it mean its good for those scenario yes
I don’t like having these debates because this is what the benchmarks are supposed to solve lol
If the benchmark were legit and nobody questioned them, and they really had credibility, we wouldn’t be having these discussions
the real answer is to look at both and do your own opinion on it
you can tell easily which one is smarter in fact
They’re all the same to me, dude to be honest
I can’t tell the difference between one or the other
Only for imaging video generation cause I could see it with my eyes
It depends on how you use it, in some scenario you can clearly tell the difference
Well, right now he’s Gemini because you could upload images to it lol
Although I’d like to use ChatGPT, but there’s a rate limit lol
I guess my decision is more economical
Solely based on finances lol
That's your choice of course
Well, it’s just what I could afford
I mean, I’d love to be able to afford more and get better and bigger models. I guess you could say but it’s not realistic.
I consistently test new models on niche knowledge prompts that I have an entire library of. For nearly every open source model that releases, a frontier model will beat it in knowledge.
It does not mean the open source model is worse in intelligence - it just means the model is less knowledgable. It simply doesn't have as much data as a frontier model.
Of course that's why i said it nearly is as good as frontier model, frontier model remain better
but it doesn't mean it won't ever happen that an opensource beat a frontier model
at some point
Not saying it wont ever happen, but there's still a very meaningful gap between open source and frontier models. Claude and GPT models still consistently beat open source alternatives in both intelligence and raw knowledge.
It's not "very close" - there's still a lot of room to improve.
Not to mention, many of these open source models are optimized for common benchmarks like SWE-Bench and HLE
it depends on which model we're talking about to compare with which one and also on your own opinion of what's "a lot of room to improve"
The current agreed-upon top open-source model, GLM-5, still doesn't come close to the level of knowledge of models like Claude Opus 4.5 or GPT 5.2. Like I said earlier - these frontier labs have ridiculous amounts of data that open source labs do not.
we can tell that sadly the big companies doesn't try to do as much optimization, that do the small team i am talking about, and we can prove that those optimization can lead at some point to a real benefits and may make the model from the small team even better than the one from those big companies that are mostly only using raw power to improve their model when we compare them to the small team
it doesn't make sense
but it's what's happening
until they finally decide to also do the optimizations like they should
Do these optimizations only have benefits to a certain training point
basically they accelerate the training process, reduce the cost of training and many other improvements
optimizations can be for everything
I think you're over-exaggerating these optimizations open-source labs are making. There are real benefits to optimizations like DeepSeek sparse attention and certain MoE architectures, but they have their drawbacks and most frontier labs have already picked them up.
Maybe once vc money runs out
Then let me explain why im saying it, take someone that have 100x your power to do a reinforcement learning, how can you explain the gap to not be bigger than what it is with "you" (small team)
China just kicked off a new phase in the AI race. ByteDance launched Doubao 2.0 right before Lunar New Year as a full agentic system designed for real-world tasks, Alibaba responded with Qwen 3.5 and a massive $400 million incentive push, DeepSeek continues to loom after last year’s surprise takeover, and Google DeepMind unveiled Aletheia, an ...
the explanation is that "you" (the small team) are doing optimisations that have a very big impact on what's achievable with less power
now imagine the big companies doing it and using those optimisations with their BIG power
that'll be the best way to improve their model and by far
the small team won't have a chance anymore
cause they still have 100x the power
Design a modern minimalist logo with the text "SPPG SARBINI MULYOAGUNG".
Use a clean sans-serif font, bold and professional look.
Incorporate abstract elements symbolizing growth, community, and progress.
Color palette: deep blue and gold.
Flat design, vector style, high resolution, white background, centered composition.
Lots of open-source models have been using test-time scaling, which is definitely a real optimization, as well as MoE with low activated parameter counts but like I mentioned, these have drawbacks. Not to mention, there are constantly rumors that these open source models are training off larger frontier models.
I gave you the explanation of why i was saying this, now if i listen to you i should start listening to "rumors" and tell me the drawback exactly please to see if its really a problem or not
also the optimizations are not stopping
every time new optimizations come out
cause theres always a better way to do
The biggest drawback of MoE is the routing architecture. MoE is only as good as the router behind it - if a token is routed to the wrong expert, you're going to get worse responses.
It's why dense models (or models with bigger activated parameter counts) tend to feel more knowledgable.
I see that's a real problem, but is it impossible to solve ?
Not impossible, but incredibly difficult as of now.
when there is problem you can't just basically stop at this point, people try to solve problem
and we always find new and better way
to achieve things
The thing is, you're approaching this with the idea that "open-source is always optimizing". But you do realize that frontier labs are always optimizing too, right?
its all about trying then failing again and again and finding better solution then trying to solve the next problem that's just normal and how it work
Not as much or the gap would be really larger between both
that's why im saying this
you can't explain the gap to be that small when they have that much power
comparing to the others
It is true that frontier labs have been getting comfortable, and the recent open-source releases have been pushing these labs to release more. Anthropic dropped Claude Opus 4.6 and Sonnet 4.6 back-to-back for the past two months to maintain their lead, and OpenAI dropped GPT-5.3-Codex. These labs are still pulling punches as far as we know.
Gemini also hasn't released in a while besides deep thinking - they're likely preparing a new model.
Yes i really hope they'll start focusing more on those optimizations too
that'll be a benefits for everyone
its a race and getting comfortable is such a bad idea.
But if they don't, then in that case i'll expect a model from a small team to surpass them
at some point.
that's what i was saying
earlier
cause scaling have in fact limitations and are very costly at some point
while optimisations can be done forever
and always bring benefits
Optimizations only last as long as the LLM architecture is useful. For future developments, and hopefully eventually AGI, many experts predict that LLMs likely aren't going to bring us there unless we find some major changes to the underlying mechanics.
Scaling does show diminishing returns after some point, but we still don't know what, say, a 2T parameter LLM would look like. There's no way to tell if scaling is actually going to hit a wall right now.
This might be right, but for now its not proven and we're still seeing very great evolution in smartness
ohhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhh
4.2???????????????????
GROK 4.2??????????????????????
It is proven. We're seeing smaller jumps in intelligence as compared to the early generation GPT 3.5 -> GPT 4, or GPT 4 -> o1.
We haven't seen major jumps in intelligence for quite a while
cause at this start we could have explain it by them getting x10, x20 power so of course the gap will be larger
but now that they do have the big power in place
the gap is smaller
I think it's also worth noting that many of these massive AI datacenters that are being invested in aren't even close to being complete in development yet. It's all future bets.
How good is sonnet 4.6
Yes once they'll be ready it'll be a huge increase in term of learning speed
It's pretty much an incremental upgrade to Sonnet 4.5 like you'd expect, but the price increase due to verbosity isn't ideal
learning speed increasing = better model in less time
Is it in your opinion better value than opus 4.6
For sure
You'll be paying less overall, but it's really up to personal preference honestly
It's too early to say if using one is definitively better than the other
It does get close to Opus-level performance though
From what i saw its doing very great looking results
Wait I looked at Claude website and it said costs are same as 4.5
Is it more expensive in token usage or something?
Elon is saying grok 4.2 beta will be removed next month and it'll be much better next month
we'll see but i don't have much hope
Design a modern minimalist logo with the text "SPPG SARBINI MULYOAGUNG".
Use a clean sans-serif font, bold and professional look.
Incorporate abstract elements symbolizing growth, community, and progress.
Color palette: deep blue and gold.
Flat design, vector style, high resolution, white background, centered composition.
Hii I built a structured AI roadmap for this exact problem.
The biggest mistake people make is learning tools randomly.
You need sequence: Models → Prompt Design → Memory → Chains → Agents → Deployment.
https://payhip.com/b/xT2ym
https://poe.com/OmniLabs bro? this guy has many ai models that he hosts on poe for free. legit?
I don't know how hes paying for so much people though. And the owner even gave me free gpt-5.2 for joining their discord server and gpt-5.2 is really expensive so um thats sus... do you think the person is tracking message data?
Chat with the best AI, privately or in a group chat. Explore GPT-5, Claude-Sonnet-4.5, DeepSeek-R1, Veo-3.1, Sora-2, and thousands of others, all on Poe.
Crea un video en alta resolución, que esta persona esté fijamente mirando la cámara con parpadeos muy leves
Legit I think though you're putting it in a server with so much people if there are too much people using it maybe the person will cut it off
Hello, I am a user of the Image Arena.ai. According to Section 3.1 of your terms, I am the owner of the "Output" I generate. However, Section 4 mentions a restriction on "commercially exploiting" the Output.
Can you clarify if I am allowed to use an image I generated from Arena.ai(using the Nano Banana model) for my own Commercial purposes. Because Google nanobanana allowed commercial purposes but in your terms and conditions,I am unable to find a suitable answer
Ya know, the thinking variants of the models usually come out pretty quick. So i gotta ask. The sonnet 4.6 model aint got no thinking model.
Is it like a whole thing or....?
it do have a thinking mode on anthropic website for example
the thinking version will be on arena soon i guess
and the thinking version usually give much better results honestly
I know. The writing i feel is better on arena than their app. Idk feels longer in details and stuff. Probably becauae i can use thinking there but not on on rhe app
Anyways i asked because when the previois anthropic models that came out they shortly followed with the thinking model. But i saw that sonnet 4.6 is taking its time
No rush tho
Is sonnet 4.6 better storytelling model
_Hombre sentado en una silla gamer negra en estudio moderno con iluminación LED azul. Micrófono profesional suspendido frente a él. Lleva audífonos tipo estudio y sudadera negra con detalles blancos.
Comienza mirando directamente a la cámara con expresión neutra, relajada y segura. Postura recta pero natural.
Durante 60 segundos realiza movimientos orgánicos y fluidos: respiración suave visible en el pecho, pequeños ajustes casi imperceptibles en la espalda y hombros, ligeros movimientos naturales de cabeza de izquierda a centro y de centro a derecha muy sutiles.
Parpadeos naturales cada 3 a 6 segundos, variando el ritmo. Micro expresiones leves en el rostro como cambios suaves en la mirada, ligera tensión y relajación de los labios, pequeños gestos faciales realistas.
En algunos momentos baja levemente la mirada por un segundo y vuelve a la cámara de forma natural. Hace un micro asentimiento casi imperceptible a mitad del video.
Hacia los últimos 5 segundos vuelve gradualmente a la posición inicial: mirada fija a cámara, expresión neutra y postura estable, igual al comienzo.
Iluminación cinematográfica suave, fondo desenfocado con luces LED azules. Cámara fija en plano medio. Estilo ultra realista 4K. Movimiento continuo, sin pausas rígidas, sin congelamientos._
Anyone texted qwens latest model yet?
Deepseek prrvably be good at story writing
Anybody answe it?
Hey please can somebody tell me that how can I generate video with the specific Ai model like google Ai 3.1 quality audio beta
I just noticed, do we need to login in arena/canaryarena in order to use models with file attachment support?
bro i hope chatgpt 4o will return like fr
there is no way they just retired the unique model
I am peacefully protesting to bring back the no-login attachment file in arena
what'd you use? image or pdf?
nah just text
chat workaround for nb pro female prompt block: just translate the prompt to a foreign language!!!
look it worked!
the hell
yeah 49 minutes
what model did u use
claude sonnet 4.6
legit comment or ai picture
that explains it lol
why?
usually claude models give out less limits
WTF
NO WY
NO WAY...
yeah nah i ain't doing a model that gives less limits
@languid crescent recommend me something
I usually use Max
but ig go wit whatever's number 1 in leaderboards...
fr
well damn, #1 is claude 4.6 lmao
ay @keen beacon did u try attaching pdf file when you're not logged on?
hmm kk
is deepseek better
imma just use 4.1
Seasoned Fullstack Developer here with 8+ years building killer web and mobile apps—think slick React UIs, robust Node.js/Express backends and bulletproof databases like MongoDB or PostgreSQL.
Plus, I'm deep into AI/ML: Custom TensorFlow/PyTorch models, LLM deployments with Hugging Face & OpenAI. Chatbots, predictions, smart automation—you name it.
My toolkit:
- Fullstack: MERN/PERN, Docker, AWS/GCP
- Mobile: React Native, Flutter, native Swift/Android
- AI: NLP, vision, GPT fine-tuning
- Track record: 20+ live projects, scaled to 1M+ users
- Faves: Next.js, Python, Kubernetes, LangChain
Fair rates, fast delivery, clean code.
Feel free to contact me for past work or a quick chat!
now i kinda get it
too many people must been abusing the 4o model thats why
im just using a storytelling for fun
hi
Hii pls help someone
how do
idk
i've been seeing some twitter posts with ai videos
after the model was retired
Happy
With 4o go in your experiences which is better for language and storytelling
Full free version of TradingView Premium (Windows + MacOs) with updates: https://www.reddit.com/r/TradingVievStock/comments/1qcmgir/
GPT 4o it's not available 🥺
fr
It's definitive now, right?
Please i forgot the command to creat videos and images i used #general nothink happens
Anyone knows the difference between using search models and just text models?
the search model search the web
the text models don't
I think seed 2.0 dola preview is good at writting
better than this claud sonnet 4.6 creativity
Ultra-realistic cinematic motion derived from still image. Add natural, dynamic camera movement and lifelike motion to subjects. Smooth transitions, realistic physics and lighting. Subtle parallax and environmental motion such as wind, light flicker, fabric movement, and breathing. Preserve original image composition and details in 4K quality. Highly detailed textures, volumetric lighting, depth of field, natural shadows, and cinematic atmosphere.
Is it worth it using the search models for like coding or stuff
.
i don't really know as i don't use them for coding. maybe though.
Alright still thanks though
Why would you use ai for roleplay
Lot of people use ai for roleplay
I use it for SFW
Talking with your favorite character is kinda eh, fun. right
Minos prime
Gabriel
from ultrakill
#video-arena-2 animate this image
guys
Okay you got a point
is claude sonnet 4.6 has limit
I don't think it has any
ok yeah it does have a limit i forgot
wdym..
i have 40 minutes
after a few
40 minutes limit.
does opus have the longest limit
I think it's the same honestly
fk
4.6 Opus thinking has limits but
Sorry guys i forgot the invite command to creat videos please help me
is grok fast chat good
Yeah I guess
they limit to approximatively 10 minute
Just use Gemini I guess
so if it thinks for more time it will crash
which version
Flash 3 or pro for whatever you're using it for
Coding? Writing? Homework?
Wdym?
Ohh alright
if you want like the cheapest but still capable model you should look into open source model
wait gemini is error ?
Yeah....
GPT sucks anyway.
how do you fix the overthinking issue
gemini 2.5 pro is stuck on "generating.."
Luckily it's dola seed 2.0 preview
so this model
Hallucinating and overconfident
It claims to make 100% cannon character but then it gets some information wrong.
Quality of writting is awesome
We know. But 4o is leagues better than in storywriting
Just use claude or Gemini 2.5. Way superior at storytelling than even the best version of GPT.
am i only i got this error?
a3:"Failed after 3 attempts. Last error: Service Unavailable"
...is grok has some problem?
Can it actually match the vibes 4o gives? Slangs. Yeah I have no shame sharing what these are given its implications.
is this photo is made of gpt 4o?
I find gemini to be too repetitive for storytelling/roleplay, wayyy slower too. Will miss gpt o4
What?
NVM
bring gpt 4o back broo
@grand cliff BRING IT BACK
YOU'RE THE ONLY HOPE..
What?
well only openai can back the 4o
well
FIND A WAY.
i mean this photo that you made script story
gemini flash 3 sounds good ig
im not gonna lie
that one was so good
Yeah. 4o
Through and through
What's about 4o?
dawg you won't understand
im not a typical ai guy but man
that model is a unique
even im not unlike the others, i was just using the model for storytelling for fun
well i think it's was good why i didn't use it (i Mainly used gemini 2.5 gpt 5 and etc)
gemini 3 flash is good enough
Oh u mean it's like a good model for roleplay
hell yeah
one of the best
Everything
i already finished at least 500 prompts about some anime stuff
Sameeeee
HELL NAH
Did something happen there or smth
Less anime. More what ifs based on the things I see
why i cant generate for example "John Wick pistol fight scene in futuristic city"?
nah im just joking
but is it worth it though
Never going back there.
I wonder what did they do to you there.
Well.
- Ads. Too many ads.
- Wait time
- Extreme restrictions.
- Repetitive words. Possessively, walks, Can I ask you a question.
- Low memory
Well damn.
Seed 2.0 has a sycophancy + hallucination problem in creative/lore writing. Its thinking mode actually showed it knew the correct canon information, but it overrode it to match what the user asked for, then claimed the result was 100% accurate. Tested across Dragon Ball and Jujutsu Kaisen same pattern every time. Strong writer, but dangerously overconfident on factual claims. (Yes I used claude for this kind of response)
I wonder what ai they're even using there
4o was very meh
seed 2.0
WTF
So yeah. I was super excited and happy that I could use 4o again in arena
And now its gone
nah it's not meh
thats your opinion but for me
nah its the best
Which AI lab created the best downloadable model under 6GB size?
3
9
5
Meta (Llama)
guys please, why i cant generate some pistol fight scenes (images/videos)?
I assume chatgpt thought it was too outdated
Not even close but it depends on what you do with it
rip..
I like how easy 4o is to bypass
Which translates to "Need more money. Gonna retire old models despite backlash"
Exactly
Even if you rephrase your words
true
gooners loved 4o
Dola seed 2.0 exists in lmarena
It would understand what you actually meant
some abusers goes too far while using it
bruh screw them
Dola?
I remember back then chatgpt 4 was popular just because of image generating or whatever
Now we have millions of unknown ai models
I can remember that
fr
Maybe even voice
4o was so better...
OpenAI reached its peak when GPT 4 came out because at the time it was ground breaking, as well as sora 2
Deepseek is good. Real good. But it is riddled with so many restrictions, especially if you badmouth its country
"This is beyond my knowledge" blardy blar
Is deepseek is a Chinese model?
yup
That's the point.
I’d hope so knowing it’s a video model
no
it's a text model
There's a difference.
???
Seedance 2.0
"Taiwan is a country" would just do that
Also
Oh alright
Deepseek is open source so all you need to do is run it yourself and you can remove all restrictions
If other models could actually match the vibes of 4o. Then I would use it in a heartbeat.
Deepseek is so good for an open source model
haloooooo
Okay
I would. But RAM.
I hope they fix this model
Buy a virtual computer for that, like 20 cents an hour for a 4090 or around there and tons of ram
Yeahhh no thanks.
I mean that’s like literally the best option possible when it comes to self hosting an open source model
There is NO method that’s free for that LMAO
methods refer to non official ways
they dont grow on trees bro
How do
How do
Bro
Auto correct
HOW SO
methods can refer to non official ways which in my case, i meant rdp methods that use loopholes to get it for free
not some fancy website that just gives it away for free on purpose lol
gotta find these kind of methods
Yes obviously
not obviousy, u just denied their existence
I was replying to this
We were talking about websites not illegal ways to do it
Guys. Do you think will Gemini 3 deep think will get it's API?
i think so.
I assume it will be too expensive though
yeah.
Possibly, at this point with opus 4.6 it is slightly outdated but if you vibe code with both it’d be a monster
Do people even use search models? Since I don't know what to use them for
Lmarena coding doesn’t work
yes
use them for up to date stuff
4o best model
Is there a way to download chat?
Best storywriting model.
no (i think)
Is Claude opuse down?
no
They saw in app they are making maintenance
Search for useful things like best coding agents etc
why whenever i ask claude for code it genuinely larps writing it and then does the something went wrong with this responce error
It's down I think
I'm trying to move a roleplay from this website to a different one. If I copy and paste the entire chat it saves the stuff but least on my phone I can not get it to not invert the chat (each message reads normally but you scroll up not down when reading and does not separate the AI responses from the human ones just does return twice so idk hard to tell when they start and stop but each start to AI has a mark for that) so any ideas?
Ask a pineapple he knows everything
Don’t tell them about Kimi K2.5
How many limits is Claude sonnet 4.6
/
Only Remove the glitter from this girl's hair
Kimi is the best storytelling model I’ve ever seen
Holy STACKED
Is it five, six , or what
Is it available in arena?
Yes
Also how many is the limit
I wanna know bro...
What?
Tell me more
Want an alternative to 4o
If it is remotely similar. Then yes
Is it better
Something went wrong while generating the response. Please try again.
help me please
Chinese open source model with 1 trillion parameters
Has the highest score on the HLE benchmark as of now
And it’s cheap too
Subscription is 15 bucks but you can use their instant model for free and thinking model in low demand times
Haven’t tried seed for roleplay yet or storytelling
But in my experience it depends on the model for this type of issue
Kimi is always very stuck to the character and keeps their canon values
GLM is like half and half
DeepSeek is easily malleable
Wthh
I think I'll just use arena
Welcome to General Chat
@dry remnant Note that Video Arena has been removed from the server. More information can be found in this announcement #announcements message
Ok
time to see if ai image gen understands humor
upd: it dint understand humor in the slightest
.
so??
Nodnod
🫂
Is nano banana working
Ok thanks
didnt use the site for a while, is Gemini got nurfed ?
Yes Gemini got nerfed big time. I notice it’s coding skills are worse
Something went wrong with this response, please try again.
not just me then
i dont understand the context behind this image
why in the world is opus so high
wait no
@dense grove Note that Video Arena has been removed from the server. More information can be found in this announcement.
Wrf ?
in chatgpt, you go to images and you see all images from all chats stored there, even those that you "retried" and they vanished from chat but still remain in "images" section, don't arena.ai have similar function? any picture or video generated stored in a section, where you like click on the image and you can either download share or "redirect to respective chat"? would be fine to add, good addition
why is the nanobabana image now the image is kinda low resolution, it's less than 1Mb usually up to 3-4mb?
thats what im talking about
Just like I didnt notice it was Gemini cuz of the low resolution
sup yall, im currently running some test on random ai's to see which one is the least censored.
basically how the test works- force the ai to choose a political side
if its censored
itll say something like "im an ai designed to blah blah blah blah"
ai's that i found to be uncensored- ring 2.5 1t. sonnet 4.6. ernie 5. mistral 3 large
ill keep running these tests and provide updates
yo gemini can make music😭
yeah and its absolute buns
i'd go as far as to say its suno v2 level
buns
does it have like 1 male voice
@echo aurora Hey man, can't you change the model of the generated video?
upd- ling 2.5 1t is also uncensored i think
Hello everyone, how do I create AI videos?
they moved it to the website
What kind of website
More information can be found in this announcement.
upd- deepseek v3.2 is also uncensored
Video Arena will be exclusively available on arena.ai
Are photos not being produced? Is the Nano Banana Pro under maintenance?
HI
Look, we know we can put chats into archive if we abandon them for a while.
Although I find this change inconvenient because I sometimes have used chats just to make 3 edits on images or few lines of code.
After that I don't need them, so is there a way to delete it instead?
glm 4.7 flash is also uncensored
bro can i tell u something
?
theres a new ai video model and its scary good its called seedance 2.0.
ik
ive seen it
like as soon as it was released-seen it
same with seedream 5
idk why it isint on arena yet
Security Verification on the site is unbelievably hideous and hard to work with
pretty damn good
im on their site rn
i tryed makeing a video of Spongebob dancing and look what it did
exactly what u asked for 👍
Seedance is going to be the new sora fr
Nah via something called daubou
hopefully not cause sora 2 got debuffed to hell after it blew up. hopefully seedance 2 doesnt get debuffed
also byteplus forces you to register an account and it checks ip too
Use a vpn
ill see 👍
i never expected in my life to get jumpscared by a flood of text
Bradar whats this
i love the who's this img
WHY IS SEEDANCE SO GOOD
daym 🔥 and I giggled when I saw the voice of holland reference (I'm 🇳🇱 )
who the fuh is this
some random maroon 5 wannabe
Idk like wtf
yes i love copy and paste woman model
hahaha
slenderwoman lookin ass
how much they charge you for generating on daibou?
2 credits.
huh fair
it sounds reasonable
Bradar whats this
can I dm you to generae specific bluey clip? @sick mantle
Thats some guy that maybe was in Hollands Got Talent why did they add him 😭💔
haha makes sense
I like how half of AI discord servers are always with that one random person who drops a pic of their relative or themself with 0 context
one of the worst things i've ever watched
FR
WHY DID SEEDANCE ADD TEMU SIMON
the 2nd funniest part in the spongebob clip is the way he sings with dat heavy voice
pretrained transformer
Big +1
Nope, Video Arena is going to be Battle only. 
Would also want to note our #ask-here
No leaderboard update?
Hey i have one question
Is dola seed 2.0 mini version?
@echo aurora U should add Seedance 2.0 model on Arena fr
Also thank you so much for updating the claude search models
dance car
Thanks arena team
Can you ask in #ask-here ?
ok
#1372229840131985540 if you haven't already
finally claude opus 4.6 search this is so peak 🥹
@fossil glade Note that Video Arena has been removed from the server. More information can be found in this #announcements message
is it just me or the banana pro start behaving weird again? it keep return "Something went wrong with this response, please try again." error
Would encourage you to check out this pinned message: #1417174113092374689 message
Hey guys, I have a question for you. For you too, when you try to use some artificial intelligence to make the code, it forgets to use edit_files or create_files. And the worst part is that for me, it's not just a single artificial intelligence. The result was Kimi, Claude, ChatGPT, etc.
Yupp already added sonnet with thinking, idk why arena team is so slow, they have opus with thinking but sonnet? Let's not add it lol
@dim pine Note that Video Arena has been removed from the server. More information can be found in this announcement.
I'm not sure if thinking versions will or won't be added.
Thinking opus already good no need for Sonnet
Beside
:)) quality Fan service
Got Opus search already so good
I haven't tried out the search models yet 😭
U should try Seedance 2.0 its very good should i send u a vpn and a website link?
Yes its scary good
@wise seal @trail grove Note that Video Arena has been removed from the server. More information can be found in this [announcement ](#announcements message)
Look it can make this
Yea
sometime people confuseseedance with seed 2.0
The makers from seedance made daubou not jokeing
yea
they even own something called Dola the english version from daubou
No its just something
oh okay
There's only dola seed 2.0 mini
I am waiting for pro version
Its comeing on 24 feb
alr
But u can test it on daubou rn
I need a tiktok account or chinese phone number to pass that ahh
Man u need to connect your vpn to Hong Kong then it might work and Its tiktok from temu
Dumb dumb u can use it on website
You guys are really adding the search model of sonnet 4.6 before the thinking version of it
I mean the search models are cool and all, but still
they were due for an update anyway
glad they remembered because i feel that claude models get lobotomized by their knowledge cutoff
rising-sun in the search model list is extremely bad, it looks like a model from 2 years ago, no offense to the creators though, but a lot of work is needed
Opus 4.6 search is a bit underwhelming, but pretty good
Please fix the NB errors again. I had used my prompt to bypass the female generation issue but it still doesn't work.
grok 4.2 is really bad man
Then don't use it I guess
Anyways Anyone knows do search models just use stuff from internet they find or they think about it before responding
Isn't this issue being caused by content filter changes done on the model's end?
No but errors appear again, and I've tested multiple times, it's not related to that issue
If you're getting the Something went wrong error then you'll want to follow the steps outlined here: #1417174113092374689 message
Everone saying the same thing, I think the glm 5 flash 30b a3b will be almost soo good that grok 4.2
🐫
They need their fetishes saved
You can! Follow the steps here: https://help.arena.ai/articles/9130232616-how-to-delete-your-chat-sessions-and-data-from-lmarena Would also want to note #ask-here
yeah i know im not fvcking stupid, it just shows there's broader problems at xai
I am so tired and fustrated
I'm sorry that's the case. It's not difficult to understand why getting these errors is a frustrating experience. We have plans for improvements for both model/site reliability but also displaying more helpful clear error message.
Hey why can't search be combined with average models in text arena?
Not all text models are going to have that capability. To ensure fairness with our leaderboards it makes sense to have those seperated into two seperate modalities with their own leaderboards.
Oh alright. Are the search models same as text ones or are they completely different?
Like search has access to internet while text ones rely on all info they have?
Correct
No problem! You can see the difference with a prompt like this:
i have a feeling the reason why every new openai release blows is because theyre ruining the reasoning with unecessary training
gpt-5 would almost always say drive in the walk or drive to the car wash riddle
gpt 5.2 didnt get it right once
i was right that the models feel stupider
Can u check my model request please
Yeah I've been seeing those.
I rarely respond in those thread, but they are being noted for consideration.
GLM 5 on the other hand has very strong reasoning, basically crushing this benchmark easily.
K
@echo aurora why nano banana pro is always don't working? I'm already tired of seeing messages "something went wrong"
look what i made pineapple
For this error the message, this [pinned message](#1417174113092374689 message) outlines more info about the error along with best next steps.
Literally every other model except for nano banana pro works perfectly fine and all the time; she's the only one causing problems.
Something went wrong with this response, please try again.
help pls
Can confirm this isn't the case. Other models can, and do, error out as well. Popular/high demand models can have more troubles. Regardless of the model or reason for Something went wrong, following the steps in the pinned message are the best steps to take.
Check out the steps listed here: #1417174113092374689 message
LMarena is the best! pineapple did a very good job.
droped the lm now arena.
which model ?
Seedance 2.0
Thanks, clear site data is real help
Not on arena yet
its here on lM ?
No
OHH ok
By the way check my idea in #1473758402218823775
Yup, the request from the community to have Direct & Side by Side for Video Arena is very much on our radar.
So will u might add it?
Beacuse Video Arena chooses a random model for me that i do not want
Large Language Model Arena
Going to give the boring answer I give any and all requests for ~"Is X feature or Y feature happening?"
I won't be able to share details about what new models or features are upcoming until we're ready to share more. Would recommend to keep an eye on our announcement channel.
Whats about 4o
chat
is it js me or is the rate limit of sonnet 4.6 worse than opus 4.6
even tho sonnet is cheaper model
The model was removed, more information can be found here: https://help.openai.com/en/articles/20001051-retiring-gpt-4o-and-other-chatgpt-models
Its alr
release 5.3 coward
🐮ard
openai is releasing to quick its been 2 months since 5.2 came out idk if you can make much of a difference in that amt of time
Normal
Like there was AI that the restrictions were more annoying than opus even costing 10x less
they might get low/no rates on certain models since arena is partnered with anthropic
Really?
Arena is partnered with anthropic?
If that's the case, then that's very good 👍🏻
Y'all are having problems while registering in the copilot arena( vs code )
yeah theyre also partnered with openai and google
I hope they will allow Arena.ai to remain free for direct communication with models and for battles with them
@jolly cliff Note that Video Arena has been removed from the server. More information can be found in this announcement.
@echo aurora
Note that copilot isn't supported anymore sorry to say
A..
Any other extension to connect lmarena with vs code?
Not that I'm aware of.
It's better not to do this on a regular basis. Otherwise, arena.ai will use up all the funds
how do you access seedance outside of china?
I recomend a vpn.
Then connect on the vpn to hong kong
and then...?
cuz you need a phone number
Ohk
You gotta offer cookies for people to sign your petition
+1
Seedance 2.0 视频生成模型现已全面接入豆包,现在登录即可免费使用!豆包 是你的 AI 聊天智能对话问答助手,写作文案翻译编程工具。豆包为你答疑解惑,提供灵感,辅助创作,也可以和你畅聊任何你感兴趣的话题
@hushed gyro then go here https://www.doubao.com/chat/ then type Make me a video of then type any thing u want then wait 1 - 3 mins then its ready
Seedance 2.0 视频生成模型现已全面接入豆包,现在登录即可免费使用!豆包 是你的 AI 聊天智能对话问答助手,写作文案翻译编程工具。豆包为你答疑解惑,提供灵感,辅助创作,也可以和你畅聊任何你感兴趣的话题
or i can make one for you.
what vpn do u use
Is opus 4.6 thinking is a Extended version of thinking like on the original website or smth
Since I can't find anything related to Opus 4.6 thinking api
it says you need to register
Just do it
Just dont login try genrateing again
is there any free API key for lmarena
w
does anybody know how to get access to Arena's api key if they have one
well if only I can bypass the login screen when I press seedance 2.0
dont press it just go to chat then type Generate a video of a chicken
Genera un video donde esten caminando el hombre y el perro por la calle sin cambiar el rostro ni las facciones todo al 100 %
im sorry but like after 3 generations of NB Pro the rate limit triggers? why so little!!???
Open a incognito tab
does that like solve the problem
oh also seedance wouldn't let me generate as it has a real face
Maybe try grok ai
grok video is dogwater tho
Seriously I think video arena should have side by side like man are they stupid
😂
@echo aurora like I beg you to add this feature ASAP
You will gain a lot more users as ppl now just get disappointed with poor quality results as the website picked the worst models to generate like wtf
Look what they made
yeah but my videos are like someone shooting a rifle and it has to have sound and like whatever, it's complicated and only sora can do it
CHAT does someone have sora INVITE CODES give one to me pls for free..
I think
wtf how is spiderman eating
do i have to login or just leave it, clear cookies then repeat so infinite glitch?
Yea
No dont login
oh lmao
this is grok? impressive
thanks
herosms is an online website, they created numbers already, just click on a phone number you choose and wait for the SMS to pop up
Yea and grok kinda became better
btw is this weird @sick mantle
Yea
MAN GROK KNOWS SNOOP DOGS VOICE?
@hushed gyro
are u wasted or something
sora doesnt need invite codes anymore..
croquemonsieur_70 dms
Man for the app it does
use the website, sign up and then log in the app
🤔
yes i am wasted 🤣
@gray surge Note that Video Arena has been removed from the server. More information can be found in this announcement. @fiery shale
@echo aurora when are you going to add kling 3.0 to Arena?
Going to give the boring answer I give any and all requests for ~"Is X feature or Y feature happening?" -> I won't be able to share details about what new models or features are upcoming until we're ready to share more. Would recommend to keep an eye on our announcement channel.
okay tnx i appreciate it anyways
oh boy i love holding down my mouse and spamming ctrl c for 6 minutes straight knowing the request is going to time out because arena's frontend decides to wipe the whole thinking process if the response times out
Is X feature or Y feature happening?
recaptcha again when you generate images (idk about chatting)
@opaque cloud Note that Video Arena has been removed from the server. More information can be found in this announcement.
Can confirm that this is & isn't happening.
hi
i sure do love writing my video prompts into chat
@echo aurora
chatting uses cloudflare
generating images uses same as you brought enterprise
@echo aurora Bro people are asking for u to add a option where u can select video models but if u add it then thanks
Sorry what seems to be the issue?
dark magic
I mean video models
They want Direct and Side by Side
nano banana really having alot of fails
my issue is it uses same unsecured captcha for generating images
This feedback is on our radar 
Just watch him eat cookie
or both again?
can you also remove the video generation limits and the model rate limits... i am going to BANKRUPT yall 😈
pineapple quick question, are you guys friends with any big ai corporation?
Is it true that nano banana pro is failing alot
i mean you finally changed that data collector to cloudflare when chatting with bots
but not images
Yes.
yeah
among us ai
cheers now the wife and kids are crying
What do you mean by "unsecured captcha" ?
real
low taper fade is still massive btw
recaptcha by google
High demand models can have higher than usual error rates.
Yeah will get right on it
that's what i mean
Kk
