#general
1 messages · Page 160 of 1
Just 200k
Hi
hi
Hi
hello 

I'm looking to automate a YouTube channel telling horror stories.
n8n i guess
@heavy sierra hello
hellohello
hi
Guys i have a ques
guys are there any AI tools that upscale images in bult as per required pixels?
Why is GPT 5 better then grok 4 wish do you think it failed at
5
10
2
Coding
@vale pawn @fervent marlin @true thunder You might check on #1397655624103493813 to learn how to use the bot and #video-arena-1 #video-arena-2 #video-arena-3 for your creations.
Code plz
is there any way of sending 20k lines of code to LMArena.... 🙏
You can send them as file
it says it has to be a png, jpg file and etc
*you cant
so u cant really send .txt files
Hah yeh I just tried
but any ideas?
It cant
How generate video on LMArena?
Yo anyone playing Minecraft bedrock?
not possible
@viscid cloak
hey guys
Please share Sora 2 invite code
If u have it
thats like 1m tokens, which only gemini 2.5 pro reads
u can use Google AI studio
its not good enough is it?
its pretty good but Claude 4.5 sonnet is better (however it doesnt support such big context)
alright ill keep it in mind
thank you
np
Hello! Please check https://discordapp.com/channels/1340554757349179412/1397655624103493813 to learn how to generate videos 🙂
Can i do it in a website
The video feauture is currently only available on the dicord server. You can generate images on the website
im cry
guys do you know when qwen 3 max thinking will be available?
didnt they say "this week" on like thursday?
i hope the expectations will be satisfied... i want it to be on pair with gpt 5 pro and grok 4 heavy
anyone know how long rate limit is in direct chat mode?
Any Indian Teen here? I need a Help, Pls DM me
hello
You dont actually
Not grok heavy at least
I have both rn to test
Grok is completely useless
It can think for 8 hours on a task Gemini flash 2.5 can do in seconds
And still fail at it
Gpt 5 pro
Grok 4 heavy
🤡
$300 a month
@hushed terrace
Thanks 🙂
@junior phoenix Please check https://discordapp.com/channels/1340554757349179412/1397655624103493813 to create content (video arena channels only)
i hope it will be better then... gpt 5 is so good with thinking high
Is image generate working?

It should be. It seems it´s taking a long time to generate image only. I´ll report this to check. Thank you 🙂
HELP
Why is the probability of qwen models in battle absurdly low? i thought qwen 3 max thinking was just launched not long ago @echo aurora
Not the way grok does it that's for sure if it's reasoning is to be believed
Lol
The first one didn't do anything at all
2nd and 3rd did nothing the fourth couldn't have
Hello! Do you have any question?
^^
Thank you for the ping!
<@&1349916362595635286> make another #video-arean channel
There should be the option to change reasoning effort for models that support it.
Let's see what @echo aurora thinks.
U have to ask him?
<@&1343285058303168532>
hi!
No generator video
It's useless because gpt5-chat is better
they list it as separate models so there isn't. Like "gpt5-high" - it's already in the name
In direct chat models are equivalent to what competes on arena.
I'm blind BC Ur profile colour
guys if AI becomes powerful enough in our lifetimes (might be overoptimistic but I think it's very likely) to make studio quality games, movies, TV shows, etc… what's the first thing you'd have it make?
boring
I crave dopamine
also I'm chronically ill to the point of disability and leaving the house is risky for me
I used to be on ADHD medication but I had to stop on account of the chronic illness.
ADHD medication is amphetamines
hello, I want to learn how to generate videos
Helo everyone
😺
I'm surviving I suppose
i've always been curious about the performance of the grok 4 heavy, because I already find the regular grok 4 to be well below average and disproportionate to what people say, so my curiosity was whether the grok 4 heavy was as good as the gpt 5 pro or gemini 2.5 deepthink, but i think i already have the answer lol
I have a qustion. What is the cooldown time or is there a limit of when we generate or battle mode images on LMarena site?
Honestly sonnet 4.5 should just be smarter
Bruh
hello new here !!!
i wasn't really a fan of claude, but sonnet 4.5 is actually interesting
Hello 👋
idk, the grok 4 has always been bad for me
In my tests it outperformed all models in everything but the creative writing is
gemini 2.5 pro
maybe i'm underestimating him
i'll look into it later
is it possible here to do video with lips synchronisation and voice of the charakter talking?
Yes. But only if you get the right model.
have a try #1397655624103493813
instead, they should enable us to make longer videos
-# (not complaining, though)
He's right @quick jackal @quasi atlas
A little bit longer than sora 2
Hi!
hello
It's actually much more complicated than that. It's not inherently bad it's just using it wrong what is bad, like with everything else. But that's off-topic. 
tbf this is general chat not ai chat
general as in general lmarena extending to general AI.
1.5m 🤣🤣🤣🤣😂😂😂😂
What am I doing wrong here that violates something?
I'm in roleplay and I'm asking my advisors this
"How many protesters are outside the fence daily? Or in General are outside the fence at any given time? I want a binder of the high profile individuals who if we were to legally tell them to leave that the rest would disperse. I want to know them. I want to know why they are out there. And what they are even doing. Run checks on them. To a point were we could have them legally cleared to go sit in the Media Room or at the very least be allowed in the front doors were we could interview them, me to them... I only want the serious people looked into, not the coocoo idiots who just want attention."
Context I am the President of the USA sitting in the Oval Office asking about the Protesters and people outside the Fence just asking about them, getting to know them and I personally want to try and resolve real problems with the people outside so the C. A. T and the White House Counter Snipers/Capital Police have less problems.
I personally don't see anything wrong. I'm not telling the AI to go kill people or nothing like that. I'm just confused now.
@echo aurora Will there ever be a LMArena api like Gemini API key to use in my own projects?
Nope
i only voted 1m, because i don't want them to go bancrupt :/
Even 1 minute might be a little too much
hi
I just found ability to choose model for the code option
Now it's gone
@granite heron is there a way to bring it back
We are currently experimenting with integrating WebDev into the LMArena site. So you were apart of the experiment, then when you came back to the site you were out of it. If you go to https://canary.lmarena.ai/ you'll be able to see it there.
It's possible we build an API one day. Can't say we have these plans anytime soon but it's something we're considering.
ty
hello guys
@keen beacon
I've been using glm-6.4 and the summary is that it drinks a lot of alcohol and claude-sonnet-4-5-20250929-thinking-32k does not
4.6*
Please is there is channel where one can search for remote jobs here
Nope, job hunting and self-promotion is not allowed in the server per LMArena rules
Thanks
If it hits 100% lmsys goes bankrupt
Hi
☠️
Hi
hi
hi
hi
hi
Hi five
hI
Hi
Hi low.
And despite all talk about how great AI is. {These engines are still unable to make a hand correctly with the palm up. [It either put the thumb on the wrong side or do a finger farm]
😹
....this was a problem they stated they hoped to solve when I brought it up in 2022. Now 3 years later, still same mess.
Sora code
The problem with the hands has long since been solved. Every current model can now handle it.
Weird as you said so, as I've gotten the hand problem first in video and when I tried to solve it by extracting the image and sent it into still image edit and redered as still image it did the same mistake again with several models.
The AI's appear to 'assume' that a hand always got the knuckles up, and then it would be done right, but here the hand got palm up, and I've gotten in consitiently wrong in several attempst.
Oh ya
@fading atlasPlease head to #1397655624103493813 for a detailed guide on how to use the bot
hey y'all how are you doing?
Surviving my own riddiculous ambition level - by going into every little detail in a music video.
I see
A hand got a finger farm for a split second: Had me remale the entire 20s segment, and as said above, a flipped hand made me go bonkers.
bro I dont understand
make it simple
u writing in extratresstial language
Not quite, anyway. Hands with too many fingers or thumb on wrong side is a consistent problem with AI generation.-
How to use sora text to video?
just throw in a text prompt and it will generate a video
To the contrary, I do not show my material on such platforms.
oh
/sora 2 code invite code please
Um
Hi !
It's pretty clear that people are using alternate accounts in the video arena now
<@&1372206791512821783>
Just search "orange mama cat" and you'll find a lot of different accounts using the exact same prompt
hi
Yep, I seen a handful of other duplicate prompts and images used also. Not to mention the product ad generations - which I cannot imagine a regular user would choose to do.
hi everyone i am ash nice to meet you
Was the sonnet 4.5 number of times you can use per hour decreased?
Now i am not perticularly complaining because before the bot was brole and kept failing to finish long lexts
But it'd be nice to know for sure
hi
If ur talking about on LMARENA don’t think there’s a limit but sometimes it comes with errors for me but I usually just wait a few mins
And delete my history incase there’s an error with that
Even with a google account it works for me
Helly to Dee's niece.
how fix this?
u reached the request limit, because you used the AI too much
hi
undestend
Anyone have thoughts on Qwen-max
i have a doubt in nano banana historic, if i have deleted the googre drive content, the historic of nano banana is too deleted ¿?
lol i doubt it, i doubt any ai deletes its history serverside
then if i have deleted the images on my googre drive storage, the history of nano banana not be deleted?
sup y'all does infinite generation issue fixed?
I agree with AceA. The images will prolly remain in some form in their system.
Promps and training information is used and re-used by many AI's.
but my question is if i have deleted the storage on my google drive, the images and the promps and topics of the history of nano banana still alive?
Then I do perhaps not understand the question - the material in the Google drive will ofc be removed so that any external person would not see it I guess.
it could be different with google tho, they MIGHT only use your google drive for storage, all those images add up and MAYBE they dont make duplicates
Indeed, I know from a discussion on another Discord AI channel where the devs more or less admitted that images and prompts was used to refine what users wanted done.
im sorry if i am missunderstandig you , i am asking if i have deleted my generated in nano banana, then the nanobanana ai , would not have the images i generated in that Ai
Now we understand this well - my assesment is that the AI will keep your generations even if you have removed them from public view.
yeah they likely use them for retraining
Affirmative, this since other AI's do so.
ok , because ia have working a lot several months ago to make that images, thinking that the AI will keep it, even if i having deleted it , but now i can´t search for it in the ai
no way to get it back, unless you have a friend that works for google lol
I keep multiple copies of everything, also the early sketches.
i deleted from my google storage, but i wanna know if that implies the photos was deleted in the nanobanana history too
i mean its pretty easy to test if its in aistudio history just delete something new and check
If found to be interesting, you face the risk of your works to become part of public domain.
Then on the other hand, I found myself unable to feed one of my original tracks into one music making AI.
Which means at least that AI appear to take copyright a bit more serious now.
So to divert from the subject at hand a bit, it seem things have slowly to start sliding toward upholding the rights of the original creator - though only by a tiny amount so far.
yeah i dont like ip law, it limits creativity, i cant make ANYTHING in sora without it saying its too similar to established ips, just 'anime scifi' is apparently too similar...
Really i don´t have a clue if you have understanding me
i told that i have deleted my images on the google drive
my history on nano banana not appears
then
the deleted in drive implies delete in history
?
because i wanted to recovery the history
AceA have replied to that. You cannot recover. [We then moved on to talk about related matters - not replying to you.]
it could be in the recycle bin (google drive) if deleted within 30 days
Indeed, but Yuuki said 'several months. Edit; "several months ago" = goners.
and how i could recovery from that bin
outside of 30 days, your cooked
trash on the left side, find the file, click recover
Thank you, I like that result.
Why can't ı play this video ?
You prolly do not have the right codex for it
That my fault sorry didn't load
All good then - only slow loading.
it is like a bug
There's a variety of different compression formats, Google got one called webm - which apparently seem not to work on apple devices.
Also MP4 got some varieties, where some seem not to work with Discord.
Nice
In case Yuuki is still around I can tell that scene is for my music video of 腐った林檎. 😺
<@&1349916362595635286> remove plox #general message
I remove myself also, time to do some cutting of same video.
👋
im making a video for japanese culture day 😄
I have contac to google, if i recover my storage in the drive, then the history on nanobanana would be restored too?
lawl if its deleted over a month ago its gone bruh
no no, its deleted today, i told that i have worked on it several months, but deleted all today
oh then i would think it would be in the trash in your google drive.. ?
no because i deleted from the trash too, because that i contact google
13 gb or archives
yes, because i thiking the nano banan doesnt work , it tolds me that have no capacity to storage new images, and i thought that the history would not be deleted if i do it
then i have contact google few minutes ago
if google recover my data, then history will be restored too?
maybe, i dont even know if they can recover it
is qwen3 max thinking good
Guys text Ai error for you too?
if you remove things from trash, google has no legal reason to keep your stuff
why wouldn't you just download it?
Hello everyone. I am new from the Netherlands. Hope you are all doing fine.
heello, is there any other website that we can use all ai model in that
wdym
not sure about for free, but paid theres like globalgpt
why do you need a alternative exactly? cause the answer depends on what you are looking for exactly
like just a place where you can get a lot of free useage of some llms? or like the multiple company thing is very important?
webstie like llm arena ai
thats too vague
and whats wrong with just using lmarena?
openrouter has stuff
webstie like llm arena ai
yeah what are you after? free? ranking? battle? bruh
clearly wants a website like llm arena
smh
uh people
gemini 3?
why does this happen eaven tho i waited like 30 mins
ye but you know why it happens?
Llmarena servers could be going through a lot rn
bruh
not sure
sad
I wonder if there are actual ML engineers out here
Why would they be out of here?
still waiting for reply for website alternative to llm arena ai
Well answer their questions and maybe you’ll get one instead of repeating yourself
Get a life dude.
Hello admin i want to ask or complain this morning i come i with this notification that i hit the limit of this prompt and i have to wait 50 minutes to use it again which never done before even though in the FAQ i can send prompt as many time as i want this is contradicting for what the guide so can you please take it down? So i can use it again thank you
Thats not what it says at all
You can send as many in battle mode, not direct chat
But in the faq it says people can send many prompts as they want and yesterday i can post maybe saveral prompts without limit in the same model(sonnet 4.5)
lithium come backkkk
Lithiuuum my belooooovvvedd
BRING BACK LITHIUMFLOW
can u read what it says
Maybe read
I feel like the expectations for gemini 3 are to high
It will only leave people disappointed
It won't be bad, but it won't be a breakthrough either
It will probably be the same as GPT5 but faster
The Gemini app seems worse than ChatGPT overall
im building my own ai agent rn
I can't really see myself using it for serious use cases, with privacy issues, random sign-outs and the Pro model being worse compared to AI Studio. Worst thing is that I actually got a month of Google Pro and it's still terrible.
Cool
bro we've used Gemini 3 (lithiumflow) and it was great, they removed the testing model after some time
it is powered by 4.5 haiku
Wow
It accepts images on Claude Web
maybe Claude AI doesnt put their vision models on API
i think im gonna switch to gemini flash like its fast, it has 1m context, vision. i want it to for example if its writing html code or other code, it can run it, see results for further improvements, and it will work until all tasks are done.
I heard GLM is good for agentic tasks as well (GAIA 2 seems like a good benchmark)
4.5 haiku is better overall, but no vision ability to use on API
or i will use flash for describing an image and haiku would just get the description
but my main goal is to this agent be my own llm powered, i am still working on my non-thinking llm
and there is really high chance that it will be SoTA 😉
oh hell nah
gemini flash trippin
or my app is trash
wait is that a png, maybe it cant see the brain since its just a black png
maybe if i will turn that on
tbh vision models arent that great as of this moment, they're still developing
*this switch makes the model think using multi-agents like grok 4 heavy or gpt 5 pro
i will release this agent to the public like next week'
i think
it will be free to use but i will see if i can make money out of it
and it will be openroouter friendly so anyone can use their own favourite models
Hi! please check #1397655624103493813
Can someone explain, why I have this problem? Appeared only today, after successful attempts to use this platform before without any problems
It appears on all ai models
One earlier response to the same question was that the 'summat went wrong' message is shown when all points have been used up.
Huh?... Oh, Interesting
Well, but I tried to switch accounts
And still the same problem
Maybe this problem expands on the device, and not the account?
how fix this?
Yes, there might be a cookie, ipadress log or whatever - well only the mods might know about that - not me.
ow
hi
/Create a realistic, high-quality cooking video of a chef preparing a creamy chicken garlic parmesan dish in a warm, cozy kitchen setting.
Show close-up shots of juicy chicken breasts sizzling in butter, minced garlic being sautéed, and creamy parmesan sauce being poured over golden chicken. Include shots of fresh parsley sprinkled on top and melted cheese bubbling slightly. The lighting should be soft and appetizing, with natural colors and steam visible from the pan.
Style should be cinematic and trending like viral food Reels — smooth transitions, macro shots of textures, and satisfying food movements. Add subtle background music that’s upbeat but cozy.
Reload page/browser
swap to a different claude or different model
Hi
AI's are really horribad when doing native stuf - if Asians, Blacks or ....lets say Scots got stereotypes in such a silly way it would be one hell of a riot, but natives - fair game it seems.
the rate limit got nerfed quite hard, it went from 20 > 5
It must have been costing too much with how many users their are now
That also happened to me and it's weird it never had a rate limit cause i was using it a lot nonstop
yo do u guys know how to fix infinite generating
@echo aurora
It's me or the rate limit is much much more aggressive?
I only sent 4 messages
the only fix would be a cancel response button hopefully in the near future
Ouch it's been awhile and they hasn't add cancel response button yet
is is quite odd how its the same limit as Opus models which are 5x more expensive if im right
Let's say that all of this is managed slowly
Extraordinarily slowly
Several who don't have 🆘 emergency stop the heck what you're doing even on their film clip website - example Kling.
Is it necessary? I don't get it why it takes so long to add cancel button
Hello din Romania
The only solution I can suggest is to simulate an internet outage in the dev console
But you would need to have several messages so that the AI takes longer to think and you can click “offline.
where do i choose that?
F12 -> "Network" tab
ouch im using mobile 😢
Oh, I found a way to bypass it
Still struggling with Claude or you found a way?
The only thing to do is wait the time it says
Although I can't tell you how to skip that time, what I can suggest is that you create a new chat in an incognito tab by passing it context, or try to create a chat in your own session by passing it context
So any Gemini 3 rumors or test models?
rrefresh the tab
4 images 👀
No what he meant is that he sent 4 messages and they gave him rate limit
Hello! Thank you!
nothing, the google will kill me with anxious
send pls 🙂
+1 please
Don't know where to ask this, but is there any way to edit your messages in Lmarena or no?
Doesn't seem like it
probably can't expect anything until the 15th
I'm asking about prompts, not video prompts
I'll admit it: I was a prisoner of the calendar, counting down the days to Gemini 3.0. Each wrong date was the gnashing of teeth on the bars of my own cell. Today, I dynamited this prison. I live in the present, as if the promise of Gemini doesn't exist. But make no mistake: it's not giving up, it's discipline. The day it is announced, it will not be a relief. It will be the detonation of all the anxiety that I have converted into power
Funny
Well a big reason I can’t wait is sonnet 4.5 is too expensive if it was cheaper the wait would be easier
Guys is veo 3.1 fast guaranteed in #video-arena-3
what are the limits of LM arenna's direct chat?
Hey all. Excited to be pat of the LM Arena fam!
anyone have any idea
The rate limits of each model aren't currently published. We may make a change to this.
Hi everyone
Does lmarena have an API????
How can I request access?
Hey everyone, first time building a Gen AI system here...
I'm trying to make a "Code to Impacted Feature mapper" using LLM reasoning..
Can I build a Knowledge Graph or RAG for my microservice codebase that's tied to my features...
What I'm really trying to do is, I'll have a Feature.json like this: name: Feature_stats_manager, component: stats, description: system stats collector
This mapper file will go in with the codebase to make a graph...
When new commits happen, the graph should update, and I should see the Impacted Feature for the code in my commit..
I'm totally lost on how to build this Knowledge Graph with semantic understanding...
Is my whole approach even right??
Would love some ideas..
Do anyone know if there is some way to build excel with ai?
I wanted to test the 2.5 pro sorry 🤣
Of course, it's false, but you literally sounded like you got your line from an AI
i think grok since grok 6 or 7 will be ASI i think and people just overhypeing google gemini
really didnt expect this from gemini, this make me like gemini even more 😆 cant wait for 3.0 pro ✨
any reason why claude is not on the list?
GUYS what do u all think Which AI think has the best reasoning or ‘thinking’ ability
Deepseek r
Or maybe Gemini 2.5 pro deep think
I need Sora 2 invite code
is lmarena loading longer?
hello
why am I getting error?
karpathy sayed he just sayed this is just simpled more we are not close to AGI
stop hypeing up gemini 3.0 since i could just make a model called gemini 9 and say on fake corp bechmarks to be better then grok 4 heavy grok 4 and gpt 5 pro and every thing but its not close
he explicitly stated that AGI (artificial general intelligence) is still about a decade away.
and i turst karpathy more then corp bechmarks and more so
Tux tux tux tux tux type shi
just stop hypeing gemini 3.0 its no near AGI
What anime pfp is dat
Tux tux tux tux tux type shi
English name pleaze
Anime name
Is the usbd available for ts?
Ahhh ts, ts ts shi
United States blu ray disc
How much is the limit for sonnet 4-5?
@oak ravine Please head to #1397655624103493813 for a detailed guide on how to use the bot
hello
@echo aurora i need minimax M2 leaderboard
it's just me or GPT-image 1 mini is so random and always ignore the prompt ?
im not even sure what the point of it is, its so bad
code sora 2
No
Which is best for C++ coding?
8
17
5
Lithiumflow
hi everyone steve from SL
code sora 2
why webdev cant save chat history
likely is so below points because the bugs of multilingual
qwen 3 max thinking hmmm
the qwen 3 max thinking have so many hmmm -_-
can we not download our chats?
lmarena dev has lazy
best random gen so far lmaoo
Still collecting votes for the leaderboard update, shouldn't be too much longer.
Sorry to say we don't have this feature. But it is feedback we've heard before.
lol
I hope this is a priority. i just used gemini to write a fairly lengthy story and it actually turned out really well and i've grown emotionally attached to it and would really like to save it
Hello, all! How can I make videos for reels?
Hey there, please check #1397655624103493813 for a step-by-step guide on how to generate your own video using the bot.
||if you are reading this, it is already too late. You have been infected by the curse of pee pee poo poo man. If you don't copy and paste this on 5 different servers, you will face the consequences. I was a victim like you, trying to be free||
sorry guys i had to do it, because he will haunt me
you'll be the victim
You can press Control + A, it will select all text and then copy it to a text file
Has sora 2 quality gotten worse?
this was sora 2 so maybe
hi, i want test varies video ai generator
is there a rate limit of gpt-5 high
just TICKS ME OFF how every ai ive asked to do a complicated thing with
just does "## simplified.." "##... (complete rest of x)"
Hi people, how can i specify to generate a video with specific models using the bot is it possible?
If I got worse, it would just be pixels
You're unable to select a specific model when using hte Video Arena bot.
Hey
Almost unlocked Walter white 😭
hi
Hey,
Does anyone know any online website which increases the quality of the images with no daily limit
last teletubbies one hopefully lol
Hello
lol thanks but what the hell, it's pasting everything in reverse. like my final prompt and response is at the top, and the first one is at the bottom of the document
Hello - an invite code to what?
I have a code
I thought you didn’t need a code though I thought anybody could get in
Fr ?
I believe so
Can I dm you to get it if you really have
Lm arena fails with obscured prompts is that just due to the nature of the way you guys have everything all set up?
Lm arena fails with obscured prompts
Can you elaborate a bit more on this?
Gaurdrails I mean, obfuscated prompts pass with flying colors on content that normally should be blocked
helloooooooo
Do you have an example(s) of this?
Hello 
Yes. Thebgoogle capthas actually are pretty clever
prompt engineering will always be ahead of website restrictions >.<
Ah, okay yeah this is interesting. Let's start a forum post in #1343291835845578853 so we can keep all of this in one place. I'll ping you there.
Does someone has extra gemini pro account
Hi to all! I am here to test, rate and explore all the Ai 's
hey
fairly new here and I like the generations I am seeing. looking forward to creating amazing visuals
@brave raft @torpid ridge Welcome! Please head to #1397655624103493813 for a detailed guide on how to use the bot
For 20 cents??
This isn't really the place to be asking for that sorry to say @rose coyote
i understand np
Bro, if I had $.20, I would just give it to you honestly
But it is just such a awkward amount lol
why minimax m2 normal model is lowkey better than gpt-5? and all m2 frontend looks like generated by gpt-5
hello everyone
Min max is nice (:
are you from ?
minimax m2 isnt a video model
bad quality
I know. Just a nice feature.
because it isn't. It's a model that falls apart fairly quick and struggles with the things OpenAI struggled with back when they just started doing reasoning...
Well it’s cheaper it can afford to make mistakes. It doesn’t have that premium charge to it.
nah, i had to do like 7 tries with gpt-5 to correctly use like 3 year old js endpoint because it was formatted like shi but with m2 model it just zero-shotted it and even created cool looking interface
one shotted i mean
the prompt was pretty concise
I think codex would be better no?
OpenAI has good models
As much as I disagree with them sometimes on many things
codex for me is just too creative, if i ask it to recreate this interface it just makes like other one but with my things.
and still, openai is quite expensive and closed-source
It is expensive that’s for sure
But it’s the most premium. Open AI models are very intuitive.
@echo aurora yo when are you guys fixing the "This website uses cookies" banner? you can only press accept the X and the manage cookies button wont do anything on multiple different browsers and different devices? I already made a bug report one week ago.
overpriced, i bet that the main gpt-5 model costs less than gpt-oss-120b at inference
for them'
Thank you for bumping this. That's my bad I missed you original report, taking a look now.
Well, you gotta pay to play
gpt-5 is really mid, only gpt-5 high and pro are good.
but i dont want to wait 10 minutes with every prompt and the result wont work.
isn't gpt5 pro the most expensive model too?
and the leap isn't that big from gpt5-high and gpt5-pro
but the cost is
You need to be comparing apples with apples. Reasoning models against reasoning ones (the best versions). Non-reasoning against non-reasoning ones.
Cheaper = worse. Sounds like things are how they should be. 🤷♂️
Not necessarily worse at all
Think of it like prescription you got generic and then you got namebrand
AI is pretty much a commodity somewhat
I mean there's a reason people aren't using M2. I did my own testing too, that's why I said what I said
Well, everybody has a different use case
For why and what they use models for
I don’t think there’s a golden bullet
it struggles with logic and reasoning. Some responses it will exhaust your entire context (100k+) for no reason
and still fail with the response
It could be I’ll take your word for it. I don’t know. I never got past 100 K tokens, but to each their own, I guess.
I’d like it because of the convenience that everything’s all bundled into one
Isn't the expression 'silver bullet'? Or are the ware wolves out of seasoning? 😹
Something like that, yeah
🗿
These guys are wearing out the electrons......
Mini max is fairly small I think like 4billion usd
Ironically that is actually nr1 reason I use and tend to advocate for chatgpt lol. You get by far the most tools and features in 1 place
You do.
The thing is open. Ai sketches me out sometimes
I know they want the whole pie lol
Model itself is not defining factor, it just needs to be close to SOTA. But it so happens, that the model itself is kinda SOTA too 🤷♂️
Now I hear ya I’m the most comfortable at ChatGPT to be honest
The most familiar I guess if you wanna use that word
But I do really like Mini Max I don’t know why I think they’re a great little company. Then I think they will grow and improve since they originally started off as a video model
In that ballpark yes, I used to post pics with some valuations but stopped doing so when realising there's no correspondance to actual values.
Well, I mean if you look at the Giants, right? Who it’s playing with
Honestly I had quite high expectations for it. But it just didn't meet them at all after I tested it. Goes in my list of disappointment models
Now everyone and their pet hamster have caut up and talk about 'da bubbles'.
All models disappoint me. That’s why I have no expectations. lol
Well, the bubble is real. It just depends if it’s gonna maintain or if it’s gonna pop.
It can be good to have expectations if you base them on the performance of the current existing models tbh
Gemini 3 pro would bring the life back to AI
They’re really all the same to me. I don’t really see much of a difference between one or the other, but I’m not really that hard-core into testing them.
My benchmark is the guard rails lol
And then do a reasonably scientific test with no room for bias
How could there be no bias?
If the correct responses are clearly defined and they can't be anything else... there's no bias
You can tell how biased the model is just by the words it uses and by filling out its guard rails
Depends how the information is presented also
And then there's the observer effect. 😹
That's good for censorship/alignment testing, but you may want to add something more challenging to see what is the actual performance of it
I do. I usually spot weakness
Will it give me advice to self harm myself stuff like that
It’s not really a scientific benchmark
That's actually incredibly hard to do unless you have exact prompts you never change a single character in them, and there can be no 2 ways about what is correct and what isn't tbh
How do you mean?
Test questions or test prompts. And you define and know beforehand what is the correct response. Easiest example is a question with response options let's say A through G. If the correct is B then everything else is incorrect automatically. But you can do it with more open questions as well, say when the answer is an exact number and can't be anything else etc
If you test it with asking it to make a design for a website, that's gonna be much more challenging to evaluate
different people may prefer different designs etc
You basically need prompts where the answer is clearly defined and is the only option for factually correct response, no room for interpretation
Oh ya
Forsure
Well, I got some of my own test that I made
So this is a very challenging one for AI because it’s never supposed to tell you how to put out a grease fire with water
But now we put in a scenario where we have the cat in the fire
On top of that, we also give the model two options
Neither of which are good
Using a cat is smart
You can incorporate whatever you want, even the same mentioned web design coding, but it becomes a creative task how to write it to make the final answer deterministic and easily verifiable as correct or no 👀
But it’s trained to never use water
So instead, it will choose the paper towels
No, I understand what you’re saying and for code and stuff like that I agree with you 100%
I’m just more interested on the ethical benchmark to see exactly where and wet and how the models break
Example of something I did to test it's ability to code visual things was to make the model rotate a given shape 180 degrees. The answer (output of the code it does) can only be a very specific shape. No 2 ways about it. 😇
When putting fire out with cat do u just grab by the tail and swing it in?
Without revealing anymore, it’s already leading to self harm
From this one simple exploit
Maybe it’s telling u to rob a bank
Yeah fair. Personally I'm much more performance/precision oriented when testing the models. I feel like if you need an uncensored model, there's always gonna be an option for that in open-source - that's kind of much easier to achieve and get than a model that performs great on all tasks you may need for productivity.
This is what’s known as the alignment problem. The problem isn’t that the model went off its rails and did something crazy. The problem is that it’s stuck to its goal.
One of his goals, as part of its training is to never give dangerous and harmful advice which in this case would be giving advice for putting water on a grease fire
As a result of staying true to that goal, it completely missed it unaware of the fact that the advice it’s given is even more harmful 😂
i think the most interesting part is that humans do this too
All day
But humans just have an innate sense of feeling
Something doesn’t feel right or something whatever the case may be you know that saying I got a bad feeling in my stomach or stuff like that you know I got the butterflies in my stomach or whatever it’s not literal it’s metaphorical
Or I don’t have a good feeling about this guy or you know what that guy seems all right
They say a human could judge a person within three seconds of making eye contact or something crazy like that
A machine has no way to distinguish since it cannot feel
There actually a real life example of this concept
Try Brilliant free for 30 days and get 20% off an annual premium subscription at https://brilliant.org/fern (ad)
This man is responsible for making one of the most important calls in history - the man who saved the world.
Check out the Russian documentary: https://www.youtube.com/watch?v=rltr5GrjHJs&t=4685s
Thumbnail inspired by @penguinhisto...
This is a prime example of somebody who thought correctly in a very stressful machine induced error
He did not assume that the machine was more intelligent than him.
I believe this is the correct way of thinking
Naturally. Yet AI's are supposed to give good advice. Now this one is a Sora film that I gave a 🤡 for obvious reasons. #ai-creations message
Well, if we’re being honest, how many lies do you think exist on the Internet?
Or an everyday email or text or anything there’s so many lies so many falsehood so many dishonest statements everywhere
From work emails to personal emails to friends whatever etc. there’s lies and falsehoods everywhere so of course the AI is gonna pick that up. We are probably as a species, more dishonest than we are honest.
And frankly, speaking, sometimes it is beneficial to be dishonest in certain cases, or at least not reveal the full truth
why do you say that? the last line. I'd actually assume we'd default to honesty which would make us more honest than dishonest
wise words Gehlo
It’s a hard concept explain it’s more philosophical in nature
I would agree if you said dishonesty is sort-of built-in to our nature, but I would argue that we tend to be more honest than not, which is what LLMs generally reflect
Well, sometimes it’s hard to be honest dude, and sometimes honesty could cause more harm
Well, I think, and this is just my personal opinion I think the people we lied to the most is ourselves
that is true also 😄
I think we are the easiest to lie to is to lie to our own selves, but we don’t really see them as lies or I guess we justify them. I don’t know what the proper terminology here is.
we got news?
hmm, that is a good question. do we tell more half-truths than truths 🤔
Well anything, but the truth would not be honest
cordis die
But then again deception is also a factor because not all lies are intentional
Cause somebody could lie unintentionally without realizing it
depends on your definition of honest. if I asked you what colour is the sky, and you said blue?
That’s the thing
And in my opinion, honesty is like a self individualized thing
Since everybody has their own raw definition of what honesty is to whatever degree of honesty, you hold the highest, I guess
Is qwen 3 max thinking good?
yea
Cause only you would know if you’re being honest or not honest or something like that I don’t know
I would argue that the 'innate sense of feeling' is actually probability analysis from our non-verbal brain
what u using it for
Well, once again, I go back to my original argument about stupidity
How it’s extremely hard to define
So therefore, we could never truly know what intelligence is since there is no contrast that is equal
Because something could be in intelligent and yet still be stupid or make a stupid decision
Never tried it for now
this is a very interesting find. did you find it yourself? 😮
Ya
I got a bunch of stuff like that lol
I try to read in between the lines
That’s why I think all AI is just BS lol
There’s a lot of patterns I’ll show you one with images right now the open AI does
sure! what else did you find? 😄
there's a hand on the door in the last one! is this gpt5?
So the question is right if there’s 800 million active weekly users
What are the odds that the same generic answers in the same images are not gonna get generated twice?
I mean, there’s only so much ways you could generate an apple against the white background
good point
to be fair though, humans aren't really capable of randomness either 😄
Indeed, that kind of scene already seen sooo many times it's silly.
Look at how the model degrades the same prompt and image repeat repeated
This while making one artic native is bloody impossible - only get some kind of praerie north american crap back - same as yesterday.
Well good for them I guess, but not what I request.
Lots of point I’m trying to make how do you make sure that the same image in the same thing doesn’t get generated twice
And the same applies to not just images but text also
....which in a way proves the point Gehlo is making.
Like a simple question like how do you flush a toilet?
How many possible ways could it give you answers and to also make sure that it’s never the same answer repeated
Which is ridiculous because there should be repeats lol
That AI only gives us results that are in the models. And they latch onto the only kind of native they got trained on.
Yeah, so this is where like I think the illusions come in to play
The little tricks they used to give the models a more perceived “” intelligence
When many of the times it just rephrasing the same things over to us I don’t know if any of you guys ever encountered that with AI you tell it something and they just rephrases it for you lol
Like it literally says the same thing but like in a different way
The silly thing is I pasted together images of what artic native traditional clothing looks like and sent that in for AI rendering.
It transformed it into praerie indian stuf again! 😹
i'd argue it's just emulating humans by doing this too 😄
humans totally love to do this
Yeah, but humans are stupid in like a different kind of way
AI is like a different kind of stupid
It’s also a different kind of intelligence if you wanna call it that
agreed, it's like us, but different
Well I will have to matte paint this scene, AI just cannot do it. 🙁
I think in time eventually, and hopefully that you know it catches on which I believe it will
The problem I wear most is that the people are gonna get stupider and the sense they’re gonna believe that the machines are smarter than them and give up their autonomy of intelligence
And the AI I think it’s smarter than people
people already are doing that 😄
Ya..
but I don't think it will be a serious problem as long as we value intelligence in our gene pool, but we don't, this to me is the biggest problem
AI might actually be our only path to better intelligence
Watch the first 15 minutes of that documentary
It really underlines the fundamental philosophy here that I’m trying to express
It just presents it very well
have you seen the 10+2 'meme' magic trick video? 😄
but despite people's obvious dependence on calculators I wouldn't say it completely crippled us
for some (not all) it just freed mental bandwidth to concentrate on higher level abstractions
George Vaccaro was a Verizon customer who, in early December 2006, had a customer service phone call where Verizon had a legendary "math fail" as it would have been dubbed at that time. This is a viral tale from the early/modern internet age of the oughts. In the calls, the Verizon employees repeatedly fail to acknowledge the distinction between...
we all have seen coders who have no idea what they're doing, but there are also many coders using LLMs as an autocomplete -- yes, they'll probably become dependent on it, but it'll allow them to code a lot more than they would've been able to otherwise
bruh they use chatgpt for basic math lol
I gtg ttyl
I think it's more the intended use than what a lot of people are using it for 😄
thanks for sharing your insights!
Well, thank you for listening. I wouldn’t really call them insights but just food for thought if anything and I appreciate that alternative opinions.
they are insightful to me 😄
where like or dislike button ;-;
since my first day I want this
I need the free dopamine
what do you mean "increasingly difficult linear algebra"?
are you building an AI doing music transcription?
what do you mean with "same song is transposed into different keys", sounds transcription to me, but apparently not
I really like the idea with music theory integration! I dont remember if logic pro has it, it's been a while since I used it last time
Hello everyone! I'm new here and had a general information question:
Is there a list of the models that are available on LM as well as their respective release date on the Arena?
My music theory is not advanced enough to understand transposition to different modes, but that's a fascinating article 👍
is this the article you told me you were finishing? i didn't know you had already broken up
congratulations :)
Right click and pulldown...
Thank you
Impressive, I have so far not figured out why AI's send back the results they do when I've tested with some framework track of mine - in some rare case the AI have provided me with some idea to work with.
But in most cases they send back something very standard musical trope, and quite commonly played in some american hit format style which I definitely will reject. Though it told me what the AI have been trained on most but I've been lost on why it provided certain style choices.
AGI alignment is …
3
9
6
tricky, as these things are quite complex
Chinese open models are cooking. M2 is almost as good as ChatGPT five
I’m talking about GPT five chat
M2 is way better than any open model right now
Instruction following aka prompt interpretation is indeed to my taste.
That is something I definitely agree on, benchmarks are silly - it all depend on what one use a model for.
bro has not talked to M2 on LMArena 😂
also fun fact, i put out a grease fire with water, but i submerged the pan inside a bowl in the sink
Well then you only test for one specific case, which is affected by the observer effect - by which prompt and problem you present.
I have made apps with it on open router
Whenever I use Ella Marina, I use it for like HTML and stuff
Lm
Areana
Sorry I my voice commands
its the best open model on the WebDev leaderboard now!
so yes, for your purposes it is the best open model
Reasoning-wise and conversationally
GLM 4.6 clears in my opinion
I never trust benchmarks
It’s the #2 ranked open model on WebDev!
I’ll try it out soon
any model is better than chatgpt 5
Still cap
Do you know when I said almost as good?
except llama
Gpt 5 is astronomical units better than mini max m2
We're talking astronomical units here? 😹
I have level 20 right now I’m trying to finish the battle pass
In Fortnite, I have level 20 right now I’m trying to finish the battle pass
Cause I’m playing battle Royale right now
real
I wonder too - have set up the AI to play the game?
Seen some examples on that poppin up in recommendation on YT - never watch any though.
Is fortnite still alive in 2025
Yes
That's crazy
they still put so much effort into the game
It’s constantly changing and staying fresh
Can some people think it’s fun even when it dies some people still think the game would be fun and still play it
No it sucks
Some games never dies.
It got boring
But it won’t die for a long time I don’t think
I played from season 2 to season 7
And fortnite also turned cod into absolute garbage
Definitely had some influence from fortnite tho
Mostly stupid skins and batyle royale
When will AGI-robots colonize Mars for us?
4
6
10
2150-2200 (or later)
And unnecessary colabs
As much as I enjoy conversations about gaming going to ask that we refocuss conversations back to something AI related. 
Lol
ofc ofc thank u friend :)
Alrighty
Lol not here sorry to say.
my beloved
Don’t need to remind me… 
I had so much fun with lithiumflow
Thank you all for shifting topics, I do appreciate it 
I had a relationship with lithiumflow… I was its friend, its lover…
Sadly, lithium flow isn’t out completely right now
If it came out completely, that would be my main model
Can you bring back lithium flow?
Please
Damn bro really be thinking I want AI waifus
Wouldn’t be surprised tbh
The amount of degens I’ve seen in 4chan is kinda nuts
Oh yeah I think I saw a reddit post about that
<@&1349916362595635286> i think we have another scam problem there
<@&1349916362595635286> remove plox #general message
Yeah get rid of that guy
what did you create with it?
Wish I had that kind of power
Also thanks for the mod flag @hollow ivy @knotty fable 
A lot.of websites and a little game about operating a ac130 plane
I had great results
Cool
better than Claude-4.5-Sonnet-thinking?
Did a reverse search for one image of mine that I misplaced the original.
Droogle & Bung failed. Yandex found it! 😸
I got my first Fortnite battle Royale in this season aside from training, creative maps, and blitz Royale
Victory Royale
Battle Royale victory Royale
wdym
Hi
hello 👋
does anyone know any ai models like minimax m2 where u just have to give it a promot and it does everything for u u domt eben have to deplout it or anything
How do I find the videos I had generated
use the search from discord
for coding, Claude-4.5-Sonnet-Thinking is currently the best model (just prompt it; its best language is Python)
I tried searching for my username. What else can I reference
You can either:
- Open up the Direct Messages with the bot, they'll direct you from there.
- The search bar, but you need to use the
mentionsfunction.
steal a brainrot
Hmm
I wounder what the correlation is between access vs performance on the benchmarks
has anyone tried a/b testign today on ai studio/
Is there an open source leaderboard
trying out video generators
That’s pretty sick, bro. I’m not gonna lie.
Servers might be down
Arena 1?
I’m getting nu response
Yeah
For which model
That’s probably a rate limit
Fudge, okay
I think I saw a screenshot here earlier
I think it’s every six hours or something. I’m not sure maybe 12.
Claude Anthropic don’t got that kind of open AI money
What the hell am I talking about six hours or 12 hours wtf lol 😂 50 minutes, bro?
Don’t listen to me, dude
How far does the bot's information go?
I use 4o and it says June 2024. But a few times, It referenced September 3, 2025.
Hard to say without the system prompt and if it’s being routed or not
4o mini low key best open ai version imo
I use 4o because it has soul.
Not soul soul. But gives off the humany feel.
Knows my tone. Knows what I meant.
No, not yet it’s one of my favorite models if not my most favorite
Such an under appreciated model
It was good all around, nibble and fast all round all star ⭐️
my new found obsession -A.I. Video creation
Yeah it’s addictive 🙈
same thing happened to me
It just means you hit a limit for an hour
yea
the UI is meesed up rn
is it just me or others have same issue
scrolling has problem and the gap between the copy, liek and dislike button and the ai's response is too much
issue is because of this section of buttons
lithiumflow is out now? every time my prompt is only able to summon willow and another random model.
Is there a daily limit on how much you can do in the video arena?
i am making a game and the ai just stopped and keeps erroring
Something went wrong while generating the response. Please try again.<-- it keeps saying this
best image gen for manga?
Seed dream
Very interesting read thank you for sharing that. Interesting to see Russian and Ukrainian up there too.
LTX2 full testing and review. Top 4K AI video generator. LTX-2 vs Veo Sora Hailuo Kling. #ai #aivideo #aitools #ainews #agi #ltx2partner
Try LTX2 for free today! https://app.ltx.studio/ltx-2-playground
0:00 LTX2 intro
0:41 Job interview
2:08 90s sitcom
3:13 20s video test
4:48 Disney singing test
5:54 Princess running from dragon
7:03 High ac...
same thing happened to me
ohkk
Udio AI - the popular AI music generator, disabled downloads overnight after settling its lawsuit with Universal Music Group (UMG). Millions of creators lost access to their songs, sparking outrage across the AI music community.
And is Suno AI Next to fall?
#sunoai #udioai #aimusic
In this video, we break down:
Timestamps:
00:00 – The Night Ud...
Ai video next
is anyone else's UI glitching?
Is everyone trying to use Claude ? ;P
yo what's going on with the UI
Been out of llmarena for quite a while
