#general
1 messages · Page 221 of 1
Some ppl just not cool with it period. It’s their prerogative. They have every right just like we have every right.
i'd agree
Ai is just the accelerate to the whole controversy surrounding technology, computers, and the digital space, and even further than that, you could even say industrialization in general
Look it the uni bomber
Industrial Society and Its Future, also known as the Unabomber Manifesto, is a 1995 anti-technology essay by Ted Kaczynski. The manifesto contends that the Industrial Revolution began a harmful process of natural destruction brought about by technology, while forcing humans to adapt to machinery, creating a sociopolitical order that suppresses h...
Neo-Luddism or new Luddism is a philosophy opposing many forms of modern technology. The term Luddite is generally used as a pejorative applied to people showing technophobic leanings. The name is based on the historical legacy of the English Luddites, who were active between 1811 and 1817. While the original Luddites were mostly concerned with ...
Technophobia (from Greek τέχνη technē, "art, skill, craft" and φόβος phobos, "fear"), also known as technofear, is the fear or dislike of, or discomfort with, advanced technology or complex devices, especially personal computers, smartphones, and tablet computers. A 2018 study proposed a new conceptual and empirical definition of tech...
Its really a war between authenticity and conformity
Jean-Jacques Rousseau's concept of "the noble savage." Rousseau argues that in a more natural state, untouched by societal expectations, humans are pure, authentic, and true to their inner selves. It is only when society imposes constraints and norms upon individuals that they deviate from this original state, resulting in the forfeit of their genuine selves
who's complaining ?
Here is video break down
Go to https://ground.news/moon to see through media bias and become a smarter news consumer. Subscribe through my link for 40% off unlimited access this month.
So this is a real ongoing debate with real implications that’s why people are so resistant to AI which can be viewed from this lens as an accelerate to the ongoing issues of society
Believe it or not this dude was actually a victim of MK ultra
Took fbi 17 years to catch him, he only got caught because his brother recognize his writing.
I heard about that I believe but wdym MK Ultra?
???
i didn't dawg
He’s actually a test subject for it
Viewer Discretion is Advised
You’ve signed up to participate in a psychological study on your university campus. It’s run by an esteemed professor, you’ll earn some pocket change, and as a side bonus perhaps contribute to the greater scientific knowledge of humanity. But the next thing you know, you have electrodes strapping you to machi...
And I'd say majority of people are resistant to it because of "eco system"
but he uses ai which is the irony
Whats wrong with using AI? Whats the problem with that dude idk him
I will watch
Yeh it’s crazy
he posted "how to avoid ai"
.
Ohhh
good news
NBP back on yupp
i am still hoping 4k will come back 💔
yo omg they keep taking it off lmao
☠️
i hate gpt 5.2 from my own subscription, but im on the lmarena voting board and the anonymous prompts i keep saying is better than the alternative end up being 5.2 somehow
Well I legitimately i can't even make a new chat right now on the website 🥲 ( i mean I can but it won't load at all after I put in the prompt)
perhaps i mightve been to hard on it because it keeps winning atm
5.2 is z best

how is gpt 5.2 for creative writing anyway
wow
must go to meet pineapple in person for him to verify you manually
What is that
hmm, catching up is not as difficult as closing the gap and excelling though
wonder if they can take the lead and sustain it 🤔
As a Chinese, I think DeepSeek and other models are not as good as Gemini 3 Pro, but this evaluation is only temporary
Are all the mods busy ?
Tagged one guy 2 times in last 5 days and yet no response….. 💀
Lame af as compared to opus 4.5 and gemini 3 pro
yup expected
yeah it's only slightly better than deepseek v3.2
gemini 3 actually good at writing huh?
yeah asked it to write some novel scenes for me
what is it best at prose wise?
it gave quite good responses
opus 4.5
i've noticed opus loves writing the name marcus in its stories
tho it gets cringe asf sometimes
yeah 😭
like genuinely sometimes i gotta step in cus it just gets so unbearably hard to read
and also that 67 search on google
with claude
lmfaooo
😭
Indeed- sometimes It will just add dialogues from the barbie movie to random character writing
gemini 3 i've heard has a nice flow with writing and it made a SICK ass short story about ww1, even above claude, so i think its creativity is also better but prose goes to claude
is it js me or do you also have a problem with claude forcing characters to flirt by saying "you're impossible" which nobody on my fkn life says irl??
Gemini is my goat 🐐 bro gave me a story which made me actually cry 🤌🏼
I mean 2020 they used to say that
yeah it's hella good overall but claude mogs ppl in coding
well just completely bashes gemini
but money issues 
yeaaa but like i never hear anyone call anybody that
ever..,
YEAAA
i mean i just use yupp but even then it costs so many credits
gemini ain't good for agentic coding. For one shot idk maybe it's better than Claude
indeed
yeah but that's a toss up
so i usually just use gemini over claude unless i want something SPECIFIC
"shut you're impossible" that sounds as if an oldie tryna sound cool
yup yup but the thing is gemini 3 pro is still a preview version and the knowledge cutoff is January 2025..?
well, it has search
don't forget that
search is always the fix to knowledge cut offs
yup ngl it feels as if it's written by actual ai unlike gpt and gemini
nah it sometimes forget that there's something called nano-banana
Gpt 4o was the peak of OpenAI
Everything after that feels like downgrade
genuinely
such a fun model
was very smart and was so good at writing
it's not even the same gpt 4o model anymore
I won't be surprised if tomorrow chatgpt gets renamed as Copilot pro Max ngl
that's the thing 😭
the preview version sucks
but there's a hope guys
https://x.com/AiBattle_/status/1999858840045764777?s=19
What's up with 5.2 not being on the leaderboard? Kinda sus
How is it on webdev but not the main text one
GPT 5.2 proved that Stargate Project is the most shameful page in American history. Gemini 3 Pro and Claude 4.5 Opus got NONE of governments money, NONE!
it will be nerfed again
im kinda liking the stealth gemin i3 flash model on lmarena
ghost something
ghostfalcon
its really good
what a shameless ranking
xhigh ultra pro max vip
Agreed. One user here spreads this crap every single day here.
it will be nerfed again
Sorry to put my request here but I'm new on LMarena and I can't create account
I'm blocked to password creation.
I didn't find support page or support contact
This is Amazing and Beautiful @whole sundial did you see this?
And did you test more?
i should test more soon
Still just 6 points more then Kimi
Yeah make more of the pro sub lol
But what's your verdict so far?
Is it promising?
Im just kinda a general user so is hard for me to judge
it's good in terms of post-training for popular tasks by coding, hopefully they can move to k2 base or a different base in the future
the end of this month and early next year should have some new 1t+ base models be released, deepseek v4 and grok 3 definitely
the massive influx of h200s into china soon should help with them training better and bigger base models, the chinese government accepted defeat and are allowing their major ai companies to buy them as they are much more powerful than china's most powerful homegrown chips
kinda unrelated to momentum labs but still important as they may soon find the limits of their current base and will have to move to a new one, hopefully by then there will be some more really good base models for them to use, china (and in turn the whole open source ai community) is lacking rn, glm 4.5 and kimi k2 are great, ling 1t and deepseek v3 are fine, everything else is honestly mid
🤔
Is MomentumLabs Chinese company?
no, i tried to clarify that and i should some more, they are from the UK and the Netherlands
#1397655624103493813
<@&1349916362595635286>
Noted Noted
You are very knowledgeable of Ai man
I have a question so actually i have kinda made a 8 SOTA Models Orchestration for Engineering Like System-Persona Prompts do you think this is Would Give Better Quality? @whole sundial
The Models are
Team West
Gemini
Opus
GPT
Grok
Team East
Kimi
Deepseek
Qwen
GLM
maybe, but i think you should be using something like gpt5.2-pro or gemini 3 pro deepthink instead, combining different models could have some effect, but you're probably better off using one of the two models i mentioned previously since they are based on either one of the world's most powerful models
I can't pay for those man
💀
true, in that case just combine gemini or gpt 5.2 together 4 times
Why leave out opus 4.5?
Also would GPT 5.2 Extra High Work because is Available on Yupp Ai?
yeah, almost as good as the pro models in its own rights
yeah you can put opus in there, maybe grok 4.1 too
the chinese models aren't really worth it as they distill off of the american models anyways
yeah, in fact it may be pushing down the performance
Hey guys,
Is there a usage limit on grok models? Like opus?
💀 i thought they have alot of potential
they do, they just need start reliably beating us models and not have tell-tale signs of distilling (deepseek and glm have signs of gemini distillation, kimi k2 thinking may be distilled from gpt-oss, idk about qwen but they may be distilling as well)
Ahhh i see
So i drop them for now i guess
yeah
GPT 5.2 Pro is a piece of CRAP! The same like GPT 5.2 Thinking, just with higher latency, 40 minutes or something.
just use gpt 5.2 xhigh then, not much of a performance degradation there
I can that's easiest, simplest and best
GPT 5.2 or all four if i want to go all in
I will just tell GPT XH to run multiple Iteration-Refinment Cycles-Loops for Better Prompts
Is there a MASSIVE difference between Gemini 3 pro and Kimi K2 Thinking?
Are you kidding me? It's even worse than high, because it has COMPLETLY the same performance and significantly higher latency.
chat how to jailbreak Gemini 3 Pro?
LLMs do that for us now
it's basically when a company tests something to some people but not others, or when a company tests two different things to two different groups
in this case, the video arena is only being tested to some people, most don't have it
Which site ?
chat what is the message limit for 3 Pro on Gemini?
@zealous sparrow i thought you could connect to Google drive???
You have reached your rate limit for claude-opus-4-5-20251101. Please try again in 50 minutes.
please help me
Yo where tf is reve-v1 when generating an image?
What is unclear?
they took it away and replaced it with a stealth model called epsilon that is also by reve, idk why but the only way to access it is by using battle mode
my guess it that it's a new checkpoint and they didn't want two of the same model on lmarena at the same time
What is the ‘expert’ column about in the rankings? Is there any information about this?
xhigh 💀
Gemini, GPT, Opus if you had to choose one or two? @whole sundial
gemini, if two then gpt as well, I don't code much so I didn't include opus, it has inferior world knowledge compared to the other two
What??? Come on man it was one of my favorite🤦🏾♂️🤦🏾♂️🤦🏾♂️🤦🏾♂️
Out of all the damn models, that one???
True but opus is not only for coding is it?
Is great for logic and reasoning isn't it?
Like for Prompt Engineering because it has great structuring
true, but most people think of opus as a coding model, it's good at other things as well
Mannnnnnn this is some bs
yeah, idk what to say. honestly, it's stupid to replace it like that. they should've tested the new version while keeping the old version like with seedream 4 and autumn (seedream 4.5)
or with any other model but reve, actually
openai, google, bfl, and bytedance all didn't do this, they all kept their older models
idk what is up with reve
even on the text side i've never seen this before, they stealth test new checkpoints alongside older ones, they don't replace a released model with a stealth one
but epsilon is reve, you'll just need to run your prompt a few times to get it, honestly all stealth models should be selectable like yupp imo
Yeah but recently yupp removed nbp 4k and sd 4.5 4k
💀
I see so what are the best strengths of opus?
OMG Video Generation working on LMArena (Webpage) thxxxxxxxxxxxxxxxxxxxxxxxxx
Why not News? ❤️
Or i only feel attracted towards it because is expensive?
So the world doesn't abominate the compute expenses
idk because i don't use it, but i've seen it used mainly for coding
also yes it is still expensive, less expensive than before but still expensive compared to others
not fully out yet, you're lucky because only some people are getting it
I see thanks for the update
nbp 4k is expensive, sd 4.5 4k costs the same as standard. idk why they call it max when it costs the same, just a higher resolution
not really compared to the other three
True but now both gone for selection
honestly dumb to charge more credits for a model that costs the same as the other
I see do you know the grok 4.20 won the alpha arena trading?
Idk why they have this kind of system yeah
based on some random tweet from an elon glazer
i don't trust it, model isn't even released
Like the alpha arena website is there and legit you still don't trust it?
Yeah
ah so epsilon is reve
i wanted to know what it is
سلام
Well maybe it will be on par with other 3
hopefully
🤞🏻
how did you find out epsilon is reve
@whole sundial tysm for finding me you have great impact on my life
I was so confused and time consumed by my own systems
Now i can work with simplicity
they have the same output plus the reve v1 model was removed at the same time as epsilon was added
you're welcome
elon says it's grok 4.20 experimental
makes sense
yeah i trust it now
💙🌌❤️
it's probably on lmarena as some model
theres some grok codenamed models on lmarena
you would just need to pinpoint them
if I want to use already executed prompt and add some detail and make another prompt based on given prompt then how I can do this?
actually i want to link one prompt to another prompt in single prompt like character details etc
Just a reminder. GPT-5.2 is BAD!
Good morning
Under 1000 lines of code btw
https://019b1cbd-304f-75e9-a01b-b9dceb231d0e.arena.site [ghostfalcon]
We need to start to believe in our AIs man, instead of pushing them to limits.
You have been testing these falcon models a lot. What is your opinion so far on these models?
Good, meh, bad?
Scammed by Scam altman again
solid
ghostfalcon seems to have improved overtime tbh
doesn't write a lot of code, but still fulfills the task given
That could be the case. They might be fixing bugs etc
Do you think it's better than 2.5 pro?
I would be a bit disappointed if it's worse.
I guess
I am going to guess google will put out another checkpoint tho.
A thing ive noticed tho, Is if you ask for too much modules, It wont program them.
Well, it will work but just a click solves them
luuuuuuuuul is deleted.....:(
no i still have it
URL Link? ❤️
wont work for you if it doesnt show up on your website
It's tied to accounts
Scam Altman stole $500B from American taxpayers! And even that didn't help him to compete with competitors.
im log in with my Google Acc, not working 🙁
did that specific account have it
Website down?
I mean it's still up for me ( even if I absolutely can't do anything)
Which model should get prize "Worst model of 2025"?
7
15
1
GPT 5.2
whats the claude system prompt
halo
what are the neww studio a/b test for
I only got Flux models on Battle, is this normal ?
hi everyone Can I copy the link to my favorite chat bot on the lmarena.ai website somewhere? Is there an lmarena.ai app?
Gemini 3 Flash/Pro checkpoint
hey guys I need you all to give me some insight
I am using gemini 3 pro right now, but it chatgpt 5.2 much better?
I moved away from chatgpt because I did not like its way of speaking to me
does 5.2 worsen or fix that?
it also sucks for coding
i mean it gives you full projects in one prompt
thats i what i like
the responses are more complete than gemini
claude is very expensive
Commie Claude does, yes.
dam
ok I guess I am sticking with gemmy now
I really did like openai though
while I was using it
it's just really annoying that they neutered the poo out of chatgpt and now it is impossible to talk to
Ok, I keep hearing this censorship stuff for 5.2. I use AI differently than most in here as a sports and prediction market modeler, so it hasn't affected me, yet. What is everyone experiencing
Did Opus-4.5 suffered intelligence regression?
it gave an inferior output, compared to its performance yesterday :/
it refuses a lot of political/medical questions because of guardrails
Can anybody tell me about this?
First, five videos are allowed in a day, and now how many videos can we make per day? Please let me know?
bruh
Can anybody tell me about this?
When gemini 3.5 pro?
next year
It feels like major models are getting worse across the board
gemini 3 pro isn't as good as gemini 2.5 preview 0325 lol
I think this is because we're basically trying to brute-force improvement
They trying to cut costs..
They don't care lol investors will give them infinite money either way (until they don't)
Nah bruh that's not really how it works
A business first goal is to cut expenses, as they always did
That's not how AI companies and departments operate though
Investors arent a magic entity, they are people who expect their money back
They're basically running on the promise that they'll make profit someday
Which is what keeps investors investing in them
Exactly
Thing is they haven't really been stingy with the money they're supposed to start making a profit with so far
Yeah it may seem like that on the outside, since this was a big bet, but inside they are for sure thinking how to cut cost as much as possible maintaining a good performance
google is the only one making money
Apologies if this comes across as fallacious or rude but how do you know how they operate within?
Profits are going to be big, but not yet. Ai is def the future, but we need to rebuild everything to fully integrate it. once done the demand will be so high that profits will start to roll out
I study history of business and economics
I do not know at 100%, but I assume based on what I've learned
Google 2001
Ahhhh, yea I get it. I use Grok for those things more
Agreed.
I anyways find it interesting outside everyone likes to dunk on lmarena as a benchmark. The top models though are Gemini 3 for text and Opus for coding. It’s seems like it’s captured best models in these two domains better than most benchmarks
y
Let’s avoid continuing this conversation here. If you encounter any issues again, please DM me instead. @weary galleon @vivid coral
Interesting observations. I noticed that Gemini 3 pro grounding behaves as if it had cut off access to data. Many times I asked to find information on a topic that is relatively new, and the model returned information that such a thing does not exist.
erm what the sigma
It's kind of absurd that LMArena lets us generate images with Nano Banana Pro for free at all, but especially in 2k. At $0.15, per image they must be bleeding cash. I just wanted to say I'm truly grateful. 🙏
Well calculated business strategy
nice
more proof 5.2 flopped [SOTA my ahh]
how ya using him?
@patent aspen
where is brian
so many username handles
just wanted to ask if gemini 3 flash turned out better than they expected?
my initial tests tells me that its quite a capable model
leo seems to think so as well. I haven't used it and don't know the evals
Is 5.2 sota at anything?
no
no
Arc agi
means nothing brah
benchmaxxed
No
Im glad Gpt 5.2 didn't benchmax CritPt
Openai never benchmaxxes
bro
it is
Its not real then the physics bench is faked
have you got any proofs
Yes
Openai is the largest company they wouldn't lie
go ask it a difficult physics question
see if it gives you a good answer
Ok
gemini is still the best overall
It got it wrong
there is your proof that the bench didnt lie
Is this sota?
kind of
They didn't use to be the worst offenders, but honestly... 5.2 was all about benchmaxxing
Like xhigh doesn't even output more tokens overall than 5.1 high to run AA
so even that is kinda pointless lol
correction: gpt 5.2 is ranked the most censored model of all time
Then what should I use
grok for roleplay
gemini pro for multimodal tasks and writing
claude for coding
no way that's real lmao
gpt is so censored
and gemini not so much
gemini is still censored but no way is it third right?
OH
IM A COMPLETE DMBASS
MB
?
would be nice to see that with the open source models
that's weird shouldn't grok be at the or near the top?
that graph actually makes sense in that case
cus why would gpt of all models be at the bottom
yeah the graph is completely wrong but i just also read it wrong
who did it better
Rate based on physics
https://019b1dd1-7aec-7ba4-82c2-0651703e70b7.arena.site - gpt 5.2 high
https://019b1dd1-7aec-70fa-bc1a-8e632abe4b6e.arena.site - ghostfalcon [gemini 3 flash checkpoint]
grok is not top 3 most censored it's dumb rating
yeah, so?
what im tryna say is that graph is completely wrong near the bottom
it's accurate up until like
i am not the one who made it, so idk
then it'd be good
are you sure 4.1 is not too censored
🥶🥶
Yeah this is actually very interesting, look at 5.0
yeah lmao
what is that anyway?
difficult physics questions benchmark
kinda unfair comparison
but gpt cooked
hi guys
gemini 3 pro is still on the top lol
these cheap shills are nothing in real performance
they just hype openai up with no real evidence
0.0%?????
Hello
which ai is the best for generating images?
Nano-Banana-Pro-2K
Take a visual journey from a single byte to the colossal scale of a quettabyte the largest data unit we can name. From early arcade memory to today’s AI-driven data explosion, this real-scale 3D visualization reveals how massive our digital world has become. If you love mind-bending comparisons, data science, or tech history, this one’s for ...
we should add grok 4
and mistral
deepseek v3.2 is also so bad
- mistral
- deepseek
- grok 4
- gpt 5.2
thanks
both are good but which one was faster
DeepSeek is a great company, they don't have so much money, employees, and GPUs that OpenAI have. They work hard and do great job. OpenAI is an extremely rich looser.
it was leaked that deepseek smuggled many h200
and they are already using them
🙂
ghostfalcon
then ghostfalcon is better
It's worse than the AIStudio checkpoint for sure
Yes, but not so much like OpenAI have.
but how do u know the one in aistudio is flash ?
could be pro checkpoint
People that are more indepth with google say so
They can legally say that
Unless we are wrong
and its a 3pro checkpoint
doubt it
nah i doubt that
they tested like 4 checkpoints on lmarena if im not wrong
or like 3
this one seems the best
ive tried the other ones, they were so bad at fixing bugs
I need cosmetic genetic engineering to be developed so I can become a catgirl. My appreciation for catgirls has gone from ironic to genuine over the past several years.
yes
which checkpoint is this
ghostfalcon if im not wrong
i wonder, when we will have the first [realistic] cat-robot..
yeah no i think its the AIStudio one
yeah defo the AIStudio checkpoint
for long output
maybe
ghostfalcon cant make good svg
gemini 3 flash is capped at 300 linecodes, u need a prompt to unlock it
i object
its capped at like 800
ghostfalcon
yeah its limited, cuz google gooners decided to lock it at some point of output
lol
GPT-5.1 = 0%
GPT-5.2 xHigh = 0%
ppdqwpdpqwd
what the fk?
bruh
lol
i need to lock in and become a femboy already
joking..,. maybe.,.
hi, imma the only 1 having issue with the video generating model ?
how can I make generated video 11 second long rather than 8-9?
I think some AI models are missing like grok 4.1
can someone answer me where its gone
this is under 1k lines of code but its cool
https://019b1e5c-f810-773f-845e-eaace88c8bf1.arena.site
[ghostfalcon]
it is still there i can see it
now it back for me, i need to re-login
Wen will VideoArena will roll out for us poor peasants ?
When they test it enough for release
I do warn you currently it has a ratelimit of 2 videos per 14h
How long will
Those vids be ?!
hi everyone! i am new to LMarena, could someone pls help me? why can't i find seedream and nano banana in the model list? yesterday these models were available. do you have the same problem?
i cannot even make videos it just gives me error something went wrong and that is it
8 seconds for sora im pretty sure
unsure for rest prob the same
enable image mode to see them
damn! thanks bro! really helped
I really like SeeDream and flux as model
is there any video button or can you only make videos on discord?
it's only on discord
coming soon to the website
k
It's because Google is arguably still not the best at fine-tuning. And they haven't trained their model to be able to do extremely long outputs
It's not a software, you can't just cap the model at x lines without cutting response mid generation lol
Which model has the longest thinking?
that would go to deepseek speciale or gpt 5.2 xhigh
I mean on Arena.
wdym? speciale is on arena
was
oh
issue not fixed still ig
oh wait i think uh
tbh im confused
abt lmarenas decisions
like why do we have 32k but not 64k which isnt that much more expensive
cz i looked at gemini's internals and
What's 32k?
claude think model thinking limit
Ohhh
like opus 32k
Reasoning Effort
ye ye
Yeah 64k would be better
Agreed
But with few prompts how much can be done really 💀
The rate limits are tight
theyre fine for me
You probably just give very good long prompts in one go
Thought for 37 minutes.
Absolutely not, go ***k yourself.
wish a LLM said that
Lol almost
Nobody will read so many letters
Read books instead of this
is this your experience with your own codebase?
or are you just a bot?
I sense em-dashes
and your writing format is somewhat reminescent of a model, especially Gemini with a lot of finetuning
Of course, you can use em-dashes in a writing and that's fine. :' )
o yea lol
its not x its y!!!!
You're absolutely correct!
Absolutely not, go f*** urself. lol
seriously that respond is borderline AGI if its true
Very true
I’d like to make an ai video with an image I have, it’s to say the written prompts I give it, how do I go about thst here, im new here btw
Hi
hiii
hi
Hi
Hello
Okay it looks like the 30 sec one i mean you used reference images tho
But still nice
No prompts
The composition leads the eye upwards from the cloud-shrouded base to the sharp pinnacle, emphasizing the height and dominance of the mountain. Every element, from the stylized rock textures to the soft cloud forms, contributes to a cohesive and beautiful anime aesthetic, capturing a moment of quiet majesty in a frigid, isolated world.
Hi this is exciting
But is this the original??
Didn't sora signed with disney or something? Making it free for us to create disney sora generations?
Dbz not Disney
Yeah ik ab this one
But i meant
Ya
Did you try to do some disney characters
I did not
And how it looks is it good or wha
They should make an announcement when it’s ready
As part of this three-year licensing agreement, Sora will be able to generate short, user-prompted social videos that can be viewed and shared by fans, drawing on more than 200 Disney, Marvel, Pixar and Star Wars characters.
That's exciting
But i hope it's not just characters tho
I hope we can create a series and stuff
#stockmarketnews #businessnews #financialnews
Bob Iger confirms that Disney has sent a cease-and-desist letter to Google after months of unproductive talks about how Google’s AI systems allegedly use Disney’s copyrighted works. He stresses that Disney has been “aggressive” in protecting its IP and has already forced other AI firms to r...
It is
No actors
Disney just signed a massive partnership with OpenAI’s Sora platform, allowing AI-generated videos featuring Disney, Marvel, Pixar, and Star Wars characters. For a company that spent decades fighting to extend copyright laws and control its IP, this new deal reveals something huge: Disney’s grip on copyright might finally be slipping.
In th...
As part of the agreement, Disney will make a $1 billion equity investment in OpenAI, and receive warrants to purchase additional equity
Under the license, fans will be able to watch curated selections of Sora-generated videos on Disney+, and OpenAI and Disney will collaborate to utilize OpenAI’s models to power new experiences for Disney + subscribers, furthering innovative and creative ways to connect with Disney’s stories and characters. Sora and ChatGPT Images are expected to start generating fan-inspired videos with Disney’s multi-brand licensed characters in early 2026.
Ayy we'll even get the Yoda!!!
I can't imagine the memes
Yeah
And abuse
Disney with hitl3r
lol
Gunna get abused 100%
‘Hythey will beef up guardrails
hello
Soon we'll be able to create videos on lmaeren. I captured a screenshot when it appeared, but when I refreshed the webpage, it disappeared.
:/
I think it will be ready once LMArena figures out the limit for these. It’s way too strict right now, and it shouldn’t have to be this strict for all models in my opinion
How so
why are people saying yupp has more models?
lmarena has like 110 models
well idk if they are all working tho
Well, for models like Sora 2 and Sora 2 Pro, the limit is undoubtedly understandable, but for less cost effective models, there doesn’t need to be that strict of a limit. Also waiting 13 hours is diabolical since the other rate limits are 50 minutes
need to check how many models yupp has
on the leaderboard
also more live ones
nah
i dont think so
its just the way its presented it looks like that
since it has all models in the same menu and you need to filter ( image/live/reasoning...)
lmarena has 14
It’s not a video gen platform though
but some of them are useless ngl
You get 10 videos daily lol
OpenAI released circuit-sparsity, a research drop that exposes how a language model makes decisions internally. Instead of scaling up, OpenAI trained a transformer while cutting over 99.9% of its internal connections during training, forcing its logic into small, readable circuits. The release includes a real model and tooling that let researche...
Haha same this happened to me i thought i was dreaming
This is hilarious, they sent me a thesis to correct and I'm using GPT 5.2 xhigh in several stages to analyze it, it's simply catching ALL the errors, it's going to seem like the woman's work was bad, but I don't care, I'm going to be rigorousest (I'm going to analyze each error to see if they really exist)
Regardless you have to pay for a bunch of them there. And they limit your tokens. Apples and oranges
Full GPT-5.2 breakdown - did OpenAI reclaim the crown? A story of tokens, time and cost, plus 9 details you wouldn’t get just from reading the headlines.
https://www.youtube.com/@eightythousandhours
AI Insiders ($9!): https://www.patreon.com/AIExplained
https://lmcouncil.ai
Chapters:
00:00 - Introduction
00:55 - Better than Human @ Profes...
Was that a resume / cover letter?
ok just a small update.
stop response button in : added.
model usages : unfortunately its impossible since the data is hardcoded on server-side.
Something went wrong false positives : ive fixed that. well for now it bypasses everything but i can improve it to decrease false positives instead of bypassing them
i can add some trust indicators for false positives
because ive seen this guy report, and we can like add something like -> se*ual + health = bypass
it still needs context awareness tbh
Wat is that
im making a script that re-designs lmarena and also fix its bugs
thats one of the bugs
'Something went wrong'
What is the context lol
its not a bug really
U can get that to pass
wym
how
no sometimes it triggers false positive flag
and you cant even send ur message
How r u promoting it
Here we speak English 🇺🇸
i cant tell you the secret 😖
I dint need a secret 🤫
you do 😡
its not a prompt
im not talking about a prompt to bypass their system filter
the issue is that even before sending that prompt it will get blocked
I just think everything is possible to generate the right context and the proper prompt structure and wording
A lot like lock picking
exact scene lmao
Yeah I told u
Now u need to figure out if u already didn’t how to control it
N see if u can use it to spin off other characters
infinity war now
ok
The cam effect I use to stabilize the video
did u post something, i was afk
“The scene opens with He runs into a professor let him pick one out of three monster in his pocket and he gets to pick one, but he picked the yellow one instead of the fire, water or grass”
Did it work
Damn
i got one
why original source? what u mean
last is original?
ya
exact position
Just minor changes enough so it’s not 100% clone
Mask it or tweak it just a lil but essentially it’s just mimicking the train data ip lol
And getting 0$ for it
Same with most text
Lack of creativity and originality
thats ai currently
is all the Gemini models error to you guys ?
yeah
high is already here
which is better for coding GPT-5.1 Codex Max High or GPT-5.2 Codex Max High ?
Is there any plan to add a “research” (or similar) category in the future?
mai 1 preview disappeared from lmarena. Why?
Hello
opus
Guys support this, https://discord.com/channels/1340554757349179412/1449482775823515688
So we can get image support in search modality and code modality (code/web arena)
I have found a bug in which i am able to attach images to both modality and they work well.
So that means the foundation is there just proper implementation for release is pending.
guys
is lmarena retry button glitching again for yall, all models retry button is glitching again
oh nm
The difference between the two models (image to video) is too great; there's no need to compare them at all. It's obvious the right model wins.
#video-arena-2 message
<@&1349916362595635286>
is any error result count on quota ?
its like, I still didnt get any result but it says retry in 50 min or something
Second one doesn't exist at all.
This question will never be resolved
If people want the answer it’s in the benchmark, yet people don’t trust the benchmark
So how could this question possibly ever get answered?
Ask 10 different people you’ll get 11 different answers
for text and images it doesn’t count but I’m not sure if it’s the case for videos
As an AI-expert I wanna make my official statement: GPT-5.2 is BAD.
or maybe its count only in Direct ? I got this msg
it always error so I keep on retry... and doesnt get any pic yet
It works for me PERFECTLY. Keep trying.
At what?
It's not objectively bad. It's actually a really good AI model. But it does not live up to the hype OpenAI like to build around their releases, and it's also not a frontier model. So in the end, it's simply a disappointment.
Its trendy to hate on open ai these days
Everything.
It's a piece of crap. Because I don't see any improvement between 5.1 and 5.2.
yes, in next 50 min
Could be that where all the improvements are is the parts you’re not using?
5.2 focus more on business enterprise stuff imo. more for professional settings then for fiddling around.
Thats why 4o was the perfect model
Cuz that’s what people are comparing the ChatGPT experience to without realizing it.
All more technical stuff is benchmarked so there shouldn’t be an issue
Unless we don’t trust the benchmarks…
What is it bad compared to
Zero improvement from 5.1
So it's not bad but not as good as you expected?
Unless gpt 5.1 is also bad
Lots of hype created by Scam Altman
I don't see an issue with this
Wdym it was the best or very close until Gemini 3
No at all
Gemini 3 Pro, Opus, Sonnet are much better, not even close to 5.2 extra mega high
You confused. Opus 4 and Opus 4.5 are two different models.
So benchmarks lie?
I meant 4.5 mb
They are fairly close like 40 pts
No
On coding 5.2 is better than Gemini 3(barely)
On lmarena they are
No
Wdym no
You can check for yourself if you can.
GPT 5.2 never was in Text leaderboard.
What about coding
What about medicine, biology, physics, mathematics, chemistry, laws, ANY coding except JS, etc.?
They don't have those benchmarks
Its cuz its ChatGPT is being heavy handed on content moderation
Arena Code tests only JavaScript, that's all.
That doesn't mean gpt is worse at everything else
Where
Fr
I tested it yet for myself.
On official website of LMArena.
I can only see webdev
Lol
That doesn't necessarily mean it's true
You have personal biases on what is good like everyone else in the world
It 100% means it’s not true
You're is biased much more. Because you love GPT in any case, always.
I agree I have biases and thus objective information should be used rather than 'this is what I think'
Gn
It's not Photoshop, it's real.
Guys, this bro 👉 @radiant heron said:
- There is no other categories on LMArena, except WebDev.
- GPT 5.2 is a very good model.
- There is no difference between Opus 4 and Opus 4.5.
- There was a time when Grok 4.1 was THE BEST model according to LMArena leaderboard.
- I'm biased when I say GPT 5.2 is bad.
This conversation is closed. Short summary is a line above 👆It's impossible to continue.
Aight moving one
GPT is a very good model in some lenses whereas not good in others such as your perspective. 1. Is untrue, I didn't say this. 3. I didn't say this 4. I might've been wrong in this however grok4.1 was close at one point. 5. You are biased because you are human, we all are
#4 is true, on Nov 17th xAI released Grok 4.1 and grok-4.1-thinking and grok-4.1 where #1 and 2 on the text leaderboard with style control. This is what we report as #1 on arena. https://x.ai/news/grok-4-1
On Nov 18th Google released gemini-3-pro which reclaimed the #1 spot.
See I wasn't that wrong
Tysm
@keen beacon you're probably going to like this if you haven't seen it already https://www.youtube.com/watch?v=B9M4F_U1eEw
In today's body camera video, we're covering the arrest of Jason Killinger.
We are a news agency dedicated to delivering factual reporting on criminal investigations, public safety, and law enforcement procedures.
This video is a documentary intended to inform and educate viewers about real events of public concern.
It was produced for journa...
Guys how many seconds is the video generation pls
Has there been any visible improvement to 5.2 coding since release? Both xh and h are impossible rn
No it's benchmaxxed
This is simplebench but yeah everything is also benchmaxxed
How can I make videos 30 s free?
Hi! Please check #1397655624103493813
But only 11 seconds
Hi guys i just make an AI image and post to the server and they warn me T_T
I know im wrong
nano banana pro not working??
@jovial path please check #1397655624103493813
always error since yesterday
can you help me how to write proper prompts to make vertical 9:16 video from text prompts
It works very well for me.
@twin sonnet Hello, this topic is unrelated to this community. This server is for AI topics. Thank you
@daring rock hey , who are you?
He's moderator
hi
You're unable to extend the amount of time for the generations.
Hi everyone I'm new here ☺️
Gpt 5.2 would've gotten so much more love if they just called it something else, like gpt-exp or gpt-codex2
Worst model of late 2025?
11
25
2
GPT 5.2
Or gpt-benchmaxed 😅
Calling it 5.2 was a huge mistake because it's a downgrade in many ways
It’s O3 but with the GPT 5 database
Hmm. Different prompt? O3 loved to do that -> -> thing
I’m assuming that 5.2 was an experimental model that they were toying around with (diff architecture with new database) and then gemini 3 pro came out and they panicked, fine tuned it for like a week, and pushed it out
I’m sure they have a lot of experimental models like that
Probably so
O3 architecture + new database and 5.1 system prompt
It would explain the high hallucination rates and the xhigh modes
defo
You need to use the battle mode with a (google) account, for much more generous usage of models.
Iirc, battlemode+account has no ratelimits, but am not 100% sure of that.
gpt-5.2 is trivial to discover in battle mode (just ask the model for its exact name and version)
grok4 also is easy to identify
the others you can get by testing them in direct chat first and see how they answer
The responses are much more shorter is what I’ve seen
Still mid
benjimon franklin lol
why did "GPT-5.1 Codex Max" get removed from the leaderboard?
wasn't it the top for coding before it got removed
@echo aurora Why the Reve models were removed from LMArena?
I'm not sure, will look into and if it's something I can share I'll keep you updated.
Welcome welcome 
There isn't a way to get past rate limits 🙁
ok
Wth are these fakes benchmark?
??
where is gemini 3 in this benchmark
its from web.archive
but gpt-5.1 codex max was removed
wondering if it will be added back
its a fake benchmark in my opinion
these are fake in my opinion
its real
gemini 3 is absolutely better btw 5.1
still beat it
no its #1
nah
see the benchmark
these are fake believe me
fake benchmark
IIRC we had some latency issues on our end with that model and decided to take down. Would also note the moderators should only be pinged for moderator related issues (things that break our server rules). For questions/feedback you can directly come to me instead. 
bro test gemini 3 and this 5.1 max high
Okay, sorry, thank you, is their plans to add it back in the future?
i did, its better in my test
then u will say me if is better or no
what was test?
coding test
I believe so, but I'm not 100% sure sorry tosay.
prompt?
did u also test it?
hi
i dont see it on incognito
Dunno I somehow got it
Though on my normal tab I don't have it
Yo I got the same
But I closed the tab with it
I guess it's gamble based on a guest?