Never use non-thinking models, enable search and thinking mode that will provide you most reliable and safe experience.
And always provide as much as context possible like your Linux version, specific Linux distribution , etc
If you have link to the specific app you want to download, provide its website link so you can make sure you are downloading exactly what you asked for.
#general
1 messages · Page 256 of 1
Kk
is there sth wrong with claude 4.6? everytime i use it i receive “something went wrong with this respond”
other models are good except for claude, i can only continue chatting if i switch the model
Maybe cause of high demand
There's a limited api requests available for the whole website. And also limited requests for each user too.
also gpt 1.5 has the same issue i think
Sadly ai studio died
Which AI studio?
Google ai studio, as it got limited to 10 prompts only
you can thank me later 🤭
ohhh so each chat session can only make a particular number of requests?
🙂
Really liked how consistent gemini was in ai studio even after ton of usage
Not like that but a user can only make limited requests in a time frame so people don't abuse
will it reset?
And the whole server from all the people can also only accept a limit amount of request in each second and each minute.
Yeah
You will see rate limit error rate and how much time.
Sometimes not , it depends on what caused the error
There is a lot going on this website
i dont see it i only receive this
usually the traditional fix is to clear your browsers cache and signing out and signing in
it only happens with claude model, others work fine
Like server rate limit, if a generation goes over or near 6 mins while generating, network issues, browser issues, etc
Sometimes it's just the server having issues.
man I miss the old ai studio, even Google ai premium is trash
You can retry once or twice quickly when a new minute starts to see if it was a server rate limit or the model is actually out of service or something else is causing that because sometimes after trying it just works.
ive been doing it but it just doesn’t work
See this is why they need to fix the errors - other models just simply aren't good enough!
If using a vpn , or even when not sometimes connections silently die. So like disconnecting from network , opening a new tab (arena.ai) and reconnecting to it have also fixed the issue for me and some other people
Claude 
For Nano Banana Pro?
Yeah
Use a VPN? I'll try...
But it depends if it is causing the issue, mostly it is whole server rate limit. Trying when a new minute starts helps mostly with nano banana
What model is this?
Wait for Qwen-Image-2.0, it seems quite promising and is trying to look like it can beat or is nano banana pro level
I use a vpn and the website works fine.
Yup
Genie killed it
It isn't mistral, mistral is the best
Ye sad
No, I use just Gemini weak and arena don't working Claude
Sad
Gpt is not at that level
At least mistral works in another level
Nah Kimi is actually the best ai wdym it installed llama 1b in 1 PROMPT with no errors
Yeah I agree
Grok 😹
Their k2.5 one specifically is really good
Bad model getting stuck in loops, common with opus
kimi is actually good
worse than gpt but still very good
Has he even tried gpt 5.2 xhigh
I ain't got za dollar to
imo high is better overall
xhigh gives more opus like experience
and eats tokens
im using mainly 5.3c and 5.2h in cli rn
heyhi ~ ~ im supposed ti introduce myself, sooooo hi!
im maria, I'm.... mostly here to discuss where karp-001 and 002 went..... cuz I want my qwen3.5 back!!! N
probably because it became too expensive for Google to give it away for free....
hiii
heyhi ~
well thats weird
its the 1k btw
And is the Nano Banana Pro 2K model more expensive?
That's not the PRO
(I actually don't kno, I just joined >~<)
sigh okay fine.... i was about to comment on lucii 2s status thing... but fiiiine I won't 😭
Who?
Models can be removed from the site for various reasons, I won't be able to go into the specific for why that's the case sorry to say.
Yes
Also want to note our #ask-here channel
It's the best place for quick questions
ooh! I will ask stuff there now! ~
ty
aaa my message has been hidden..... hmm lemme test for the restrictions.
kids
aaaaaaa interesting.
Gpt 1.5
sooooo R08L0X is a banned word....
so iguess there really was something about that there...

Yeah this may change in the near future. We were seeing some spams particular to roblox in the past.
ooh interesting!... why? Are you allowed to share?
life
We may change this in the future as we've been seeing the related scams die down, so it's not as much of a problem currently.

sooooo theres also a deepmolt model....
obviously relating to moltbot..... openclaw..... yea!
Is there a way to do research on copyright content on AI?
How do i make this? @cloud zinc
you mean finding out about what data they trained on?
hello another suspicious bot account/ban evader 👋
I’m doing some research on some content that is copyrighted, so basically I can’t access such content with AI and its regulations, I want to know if I can bypass this
Guys, why doesn't the Gemini Nano Banana 3 pro work?
😂😂😂 chill bro
hmmmm....... what content do u mean?
and how would you want to access it "with AI"?
why would "AIs regulation" affect you being able to access content?
Make ur own jailbreak prompt
Pocket toons, comics I want to generate the dialogue in some of the comics
eh u can just prompt the model to, no?
LM labs don't really care about copyright.
Hello
why does every model says Something went wrong while generating the response. Please try again.
On lmarenaai
Where the frick is GPT 5.3 Codex?
its not on the api yet.
also - woah calm down ~ ~ ~
can you please help me
what is it?
I am very happy to say that NB Pro 2K should be working about 90% of the time rn, try using VPN
why does every model says Something went wrong while generating the response. Please try again.
what is it saying? u gotta be a bit more specific.
Its litterly saying what i wrote
oooh that's what you mean! I get it, sorry, my bad
yeaaaa, try pressing Ctrl + Shift + P and opening lmarena.
and on every prompt or messages
it usually happens when u use the models too much.
print?
Nono, private tab
Why is it like this? Can anyone fix it? I’ve pressed retry many times already, but it still won’t generate.
same
u probably used the model too many times.
So i try on Incognito?
ye
ok
if it doesn't work - oh well.... then that's it for today.
same problem
ye- then u used up today's limit-
i have never used the web do you know?
u never used the internet?...
Like its my first time messaging it
No lmarenaai
sorry english is not my main language.
hmmm oki then iguess they got some other thing..... cuz everyone's experiencing it, somthing is up-
soooooo, dunno, am not an employee >v<
Can you confirm that it still works for you? Since this error is present for many people today, unlike yesterday.
So the issue i am getting is happening to everyone?
yea, the model is down.
can't be it.
yea, then the model is down- or whatever, anthropic does what they want-
or perhaps it is, since mine is still thinking whatsoever, but it does the job done
its just claude apparently, so its their thing.
try telling the AI to keep it's thinking short
hm? no..... why?
it is thinking too long probably makes it break
is it printing reasoning tokens?
if not - then its not working.
sooooo no reasoning tokens, hm?
then its just as much *working" as the non-reasoning version.
Hello for those havign the Something went wrong error message I'd encourage you to check out this message: #1417174113092374689 message cc @bright junco @plush kettle
in other words: let's try some other models! ❤️
very quick mod lmao
its not for free in llma lmao u gotta pay
🥀
its an album reference, smoochies by ashnikko 😭
damn bro I'm from russia
txt upload testing pls
Be sure to use #1372230675914031105 as that's the best place for these kinds of requests! 
pfffff this graph is making it look waaaay more impressive than it is!
yah thats the joke 🤣
Yup lol
8 diff
arena is winning so hard
this is false
gpt is known to have the most schizo models known to man
are th ese real graphs or are you generating them? 😄
definitely generated
this reeks codex generated
real benchmark
higher is better
its not for 5.3 codex yet but i imagine it'll be marginally better than 5.2
benchmarks are bs when its literally opposite in practice lol
try both
same prompts
oh thats xhigh
this is not what a hallucination is
hallucinations are when a model is confidently incorrect and you need a huge benchmark to discern that
gpt 5.2 in my experience has spit out nothing but garbage
i know technically this is 5.3 but openai is known to have hit or misses everywhere
chat is this ragebait
theres low, medium, high, xhigh
same for codex
xhigh isnt even on llmarena cuz it sucks
again
this is on artificial analysis
so i dont have it there
Use opencode bro
i have paid codex sub lol why would i
Last weeks of Discord, thank you to the nice persons I met here - you enjoy as long this last.
👋
where do u even find the xhigh
Chat are the usual generation errors still happening?
no thats purely opus issue
Is it because of the global ID thingy?
Cuz now gpt 1.5 has some serious errors and NB Pro for like 20% of the time
crazy
Why is the bar height difference so much for a 3 point increase? Is a single point worth a big difference?
am i the only one if the ai's response or thinking is long enough then i get "something went wrong with your response" its not rate limit happens on new account
So basically the models very good but the platform is unstable or is both poo poo quality 🤮
It's probably either high demand or faulty servers
NB Pro magically starts working again once I started using a VPN connection 👀
i think main reason is faulty servers because i remember using lmarena before everyone found out about it (it was still popular but less demand + it was before they added the video arena so even less) and i still had this issue it consistently happens if the ai's output is long enough or has taken enough time (i actually don't know which one it is, it's consistent though).
its a joke about people glazing opus so much
opus issue
it gets stuck like older models
any way to fix it or is it just the model?
cuz currently i am using opus
only for thoughts or output in general? thought was 5 minutes and 11 seconds long and final output was idk how much but it didnt finish
looked at bug forum everyone says theres timeout
is the servers down or something any time im trying to use nano banana pro it keeps giving error messages even though its never done that before
Hey @placid verge I'd encourage you to use our bugs channel to review bugs flagged from the community as often the issue you're having is going to be discussed there. For this problem I'd check out #1417174113092374689 , but more importantly I'd encourage you to check out this message: #1417174113092374689 message
I'd also ask to avoid pinging the moderators for questions/bugs/feedback/etc. The mod ping should be used for reporting others users breaking our server rules.
I want my qwen 3.5 baaack >o<
I think there is something missing in Llm evaluations. I think the current system we have now creates a lot of more confusion than it does resolving or proving anything other than a popularity contest.
Hello guys, does gpt-5. 2 serach work in you?
It seems that regardless of the benchmark results people are still debating based on opinion and preference versus any tangible rigor
I think it only adds to the confusion in the long run
yes works fine it just takes a while
do you know if the benchmarks results are only from battle mode?
sure you won't notice difference coding fizzbuzz, it's quite stark when doing computational biology
5min? More or less?
or does it also include side by side chat and even direct
depends like if it’s a hard question like let’s say Gemini 3 pro grounding had to answer it would take 2 minutes but gpt will take 6
so it always takes 3x longer than 3 pro which is what I noticed
Get it, thanks
There needs to be a way to measure not only with the model is capable of, but also there’s gotta be a way to track when models get nerfed
And at simple things specifically where they fail in real life cases
There’s gotta be a way where you could figure out the middle ground to get a more accurate picture
1st thing that needs to change is either A. Be able to reproduce these the claimed performance and abilities that these AI companies claim when they release these models, or B. Not believe a word until proven otherwise
Cuz this is ridiculous These debates go on day and night with no concrete anything
Seed 2.0
For example, if you were to take away nano banana 2 from Gemini 3 I don’t think it’ll be so high up in the rankings personally. Or if you didn’t include all of the features that come with Google plus making it more lucrative
ok 5.3 xhigh makes opus a joke
Insane
Ok, that I think opus can do more than this, but the diff is big yet
Yup I saw that
Are you sure this isn't just a case of GPT 5.3 using a fluid simulation librariy while the others aren't?
guys where is nano banana pro normal version (not 2k) ????
Going to send this to #ai-creations 
its seedance 2.0 btw
Looks great, I've been hearing amazing things about it.
does anyone know if it will be back ?
This model was removed. Models on Direct/Side by Side can be removed for various reasons, we won't be able to always go into details about why.
Where did u get access?
in chinese jimeng AI
Ok thnx
alright, but can you tell us if it will be back ? and also will the 2k version get removed as well ? if not, no worries
Can you atleast hint was that issue with the model, or googles decision, and is it likely to come back?
or law issue?
or did opus remove the api by accident when adding itself to the website?
😭
nah this is crazy impressive
yeah nah 5.3c makes opus a joke
No sorry to say I can't provide this information. Mainly because we don't know at this time.
Lol no sorry I can't hint at it either.
can other models be removed in future too?
Yes
do you have a link
Gng is Kimi worth paying for
type in google "jimeng AI"
No, I wouldn't worry.
the 1st result by bytedance gives me this lol
if ur scared then forget about it
Lol
scary chinese site
nah it’s a broken link
Btw tip when browsing Google do not use advertised links at all
That's how you get scammed
Am I allowed to post the URL or is it bannable?
@echo aurora
thanks but I don’t think sponsored links show up first on safari
It would as it's Google search engine
I found the URL I'm just waiting on confirmation
you can send it in dms
Is this tryna make me install malware
Tf is this ai
Lemme run it through virustotal
My is in Chinese 🙁
Sorry what is this URL to/for?
An ai
I'm just testing for malware from it though just in case

Send in Private
I'm testing for malware first bro
If it's that sketchy I'd prefer not.
I'm not distributing malware
Alright
Ok
Thanks for asking though, I really appreciate it.
..
It's a dreamina Chinese version
Yeah
try global Dreamina website, maybe its released there
Should've* 🤬
I know the cause my didn't get it
I didn't installed on this web
That's interesting
Should I buy Kimi
isn’t it free tho
Ai swarm isn't
well why do you want to buy kimi
Owch that hurts the pocket
It seems good
it’s good but I don’t think it’s £35 good
whats ur project tho
Vivace is £179 lol
check their refund policy
By the way, how good is Opus-4.6?
Could it code a decent civ-AI/engine?
Hell no
Actually maybe
Try Kimi 🙂
Kimi 2.5 > Opus 4.6 ?
i cant believe that..
could Opus 4.6 code a decent AI for the ancient Empire wargame?
probably after multiple refinements
aw that sucks
Maybe
Try Kimi
🙂
Which game sits half-way in complexity between ancient Empire game and Civilization I?
Animated, slow camera push into the brilliant underwater city of Atlantis, subtle movement of water and light
(..this could be interesting for Opus-4.6)
what about on the App Store
Strategic conquest
They don't accept playstore payments
double check
This one doesn't come with agent swarm thoughh
But yeah they will refund it but your gonna get your account banned from Kimi
Could Opus-4.6 create a superior Tron-bot?
@shrewd citrus
Its possible
haha nice
I was gonna create a chrome extension with Kimi till it said £5 fee
Hell nah
Who would y'all pick as your daily assistant, Kimi or Gemini (Gemini 3 pro preview, Kimi 2.5 thinking)
What in the misinformation
- Kimi K2 will definitely not run on an iPhone
- Kimi Self hosted will not beat gpt
Chat dead
Kimi is actually kinda good and sometimes can compete with opus, but its still way worse than gpt
Help me translate I'm too English
Put ss into gpt
Do you guys know if opus 4.6 was tested as an anonymous model?
No it sucks
How did it get so many votes so fast
The leaderboard score is sponsored lol
3k votes in one day seems unprecedented
Ugh
Gng it thinks I'm Chinese
Time to test chatgpt
This Kimi extension is defo spyware
Whoops didn't mean to reply
It wrote custom shaders
And actuallt succeded
It's good but still waiting for API
(All are threejs btw no premade libs)
They made it require id verification to access
Because it was so good at hacking
💀
LOL
What's 5.4 will be
Like password required
Actually no
Probably they won't even release it
Because it's not safe already
WHAT THE
FR?
LOL
Yes fr
Its like another level lol
Compared to opus
You literally need to verify id to use it 🤣
Opus is slow and expensive
Gpt is fast, token efficient, smarter, and reliable
Opus just thinks forever until it gets an idea
I prefer high but both are insane
Xhigh is more creative
But high better overall
Xhigh is more opus-like
I spent 40% of my weekly quota today. All codex 5.3 c high.
A lot of usage.
Not a single syntax error. Not a single "bug" / "hallucination". Most features one-shotted, some with minor tweaks
(The quota is very high, like very very)
We will get GTA 6 before GTA 6
Also it's 2x In app
App is for mac os now only
Well the 5h limits are virtually infinite
I think no
The weekly limits are crazy high but drainable
They said it's only in app
I love openai for high limits
Theres no way the limit would be this good lol
The token count limit is similiar to claude
Its just that opus is idiotically inefficient
I would rather have inf Kimi 🙂
Kimi is nice small model
5.3c is js another league
❤️🩹
Small lmao
Ye it's for cli too
It's the biggest opensource model
Smaller than the closed ones
And the best one too
I wish they keep em at 2x
🙏
Rn 20$ codex has more quota than 200$ claude
i kinda just made a PDF of a 4k line code and made it into a pdf is that cheating
Sonnet 5 gona be temu gpt 5.3c
Depends on model, some wont eat it
claude opus 4.6
Did I miss a ping 
Rumors say Gemini GA tomorrow but I don't think this is true
Google releases their models once in a year
It's too early for now
They have check point models
That they always test out and are always circulating
it was me i deleted it because i thought that you sleeping
But not tomorrow
Yeah, they’re different versions of the models
cause it said you offline
It’s possible, but once again it’s rumors and when it comes to rumors, it’s hard to really pinpoint anything concrete to say for sure
I’ve been hearing similar things so highly likely that something is coming soon if the rumors are going around
It was the same with sonnet 5 for 3 days but we got opus 4.6
So maybe something is really coming
odd, not sure why it says that
Google is secretly testing 4 variants of Gemini 3 Pro right now. While Anthropic dropped Claude Opus 4.5 and OpenAI pushed updates, Google has been quietly running tests in the Arena.
For hands-on demos, tools, workflows, and dev-focused content, check out World of AI, our channel dedicated to building with these models: ...
That's what I'm talking about
But it doesn't guarantee anything
It’s not a new model though
4 different models
Chinese?
sorry uh, It's seedance
or seedream
They releasing 2.0 video generator seed soon.
It’s already out
I am testing it in byteplus
when will we get 5.3 api
Yeah, I’m sure a lot of places have access to it right now
do you like being a pineapple or do you ever think you want to be a different fruit
whats ur prompt
I'm satisfied with the pineapple
i like being a gunie pig
tho im thinking of turning into a ugunda knuckles
Huh
Didn’t they say this about Gemini 3 pro last time
Also no new Qwen image 2 on arena yet
Sniffle
You can take a pre-trained "base" model checkpoint and start new training to teach it a specific skill fine tune it whatever or If you notice the model is becoming dumber and start hallucinating more. You can go back to earlier checkpoints.
So they’re not releasing a new version of Gemini it’s just a better fine tuned refined polished of the same base model which is Gemini 3
i look just random generate where is like gemini 3 or grok, is full flux model
Ion know
Okay does anyone know where you go to try seedream 2.0?
I just got it on dreamina or something like this
But I am searching yet
@obsidian cargo I'm new to discord how do I get the fuvk outta this page. I want explore new things
Yeah byt kinda sketchy
Thanks
Np
Are the credits refreshing dail?
Seedance 2.0 is region locked I think
And API only for now
When using a model to create a new image like a character posed or dressed like another character, how do you guys prevent mix ups with the subject and the reference? I've been having issues where the ai mixes the two images together to create a whole new character or uses the wrong one as reference.
Use vpn n go to bytedance
That's why mistral is better
Lol what is this
seedance 2.0
His eyes look like he played for one week straight
But still looks good
And cinematic
yeah that was the intent
Mistral in first place yet
Top 10 ragebait
It will never not astonish me how mistral fumbled so hard
@surreal zephyr ts your man?
Mistral would say every p u want to be
Wrong mistral, the real mistral are the friends we make along the way
most recommended: ask model to extract the referenced character's raw gesture sketch or clothing design(with multiple angles if possible). Then apply the sketch as a reference to prevent your target image from mixing up with the referenced image.
yassss that's how it should do it ❤️
ByteDance just dropped Seedance 2.0 and it genuinely feels like a step up: strong motion coherence, multiple aspect ratios, and shockingly good native audio. In this video I run through the best examples, then take it for a hands-on test drive and compare it to Sora 2 / Sora 2 Pro.
What you’ll see:
Anime/fight-scene motion that actually holds ...
But it used python calculation for it
This is bad
Imagine counting with python
opus 4.5t is better than opus 4.6 ❤️🩹
ChatGPT 4o mini better then ChatGPT 5
which is the best LLM for writing stories/novels?
Kimi 🙂
You dont know how ai works
How in the world is this not fixed yet?!
gemini-3-pro-image-preview (2k)
opus with higher temperature settings give off good results
Not opus 4.5 nor 4.6
Just sonnet 4.5 and its also non thinking
These test are nonsense but people love finding niche things which AI can't do
Even haiku non-thinking 😭
@echo aurora Haiku 3.5 not working?
Is it out of service?
Even sonnet 3.5 non-thinking is giving correct final answer, but it is showing 1,2 and 3 in the count? Old models.
When it will comes to someones own preference they will do anything to defend it even if it's wrong than accept the reality.
Its kind of nano banana moment for video models but there is still room for improvement because they are still not quite the quality or consistency where we can use them directly or with minimal editing in final production
Hello, the website has a recaptcha error, Connecting to Arena has failed. Please try again later or on a different device, Failed to accept terms-of-use
Try clearing your browser
That’s why there needs to be a different kind of benchmark and evaluation for this
I was talking about this yesterday
is gemini 3 pro are work now
¯_(ツ)_/¯
You should add the login option outside the login screen; that would be better.
I have just read some people actually use claude in work 💀
Like on actual important backend
😭
<@&1349916362595635286>
✌️
Claude models have some contamination with benchmarking
It's true. Sorry
Which benchmark?
I mean if it is even true it doesn't change its current usability and reliability which is better than all other models and specifically its behaviour and its way of responding is what I like.
Benchmarks are good for first view and to see how much it got in the same scenario in comparison to other or older models.
Error in your opinion, do you think Claude 4.6 Opus is better than or worse than Codex 5.3
But that scenario might doesn't reflect what you do with your model and how it reflects your work. That's why people have so opposite reactions when a model is updated
That specific comparison is entirely a little hot, codex 5.3 is kind of exception coming out if ChatGPT but the reason it is fine-tuned for specific tasks it's not a general model.
I am waiting for gpt-5.3 high
Codex 5.3 is way better and thats not even the same league lmao
haha very funny
Opus is like 10% more creative while 50% worse memory, 50% more hallucinations, 50% worse prompt adherence, 50% more syntax errors
Funny prank
No thats actual threejs result with same prompt
ARE YOU SERIOUS
There are many fators which matter not just raw intelligence from benchmarks like token efficiency, cost, speed , reliability, long conversation, etc
Yes
Thats how bad 4.6 is
5.3c wrote custom shaders that worked
probably found it somewhere on the web
Bro 5.3c has like 200% token efficiency of 4.6 💀 💀
No internet access 💀
Sandboxed cli
All custom shaders, threejs. No libraries
Idk what's happening but don't believe anyone, just try yourself.
One task should never be used to determine a models usefulness
dude i would
Try then
i know
Better not 🙏
5.2h was already better than opus by a bit.
5.3c makes opus feel like kindergardener
I don't think so that's the case but it can be
The reason people feel the difference is cause of the system prompt and tools. They keep the models aligned.
5.3 is actually very token efficient
So it doesnt need nerfs
Like opus
Openai went into good direction
Claude went into brute forcing thinking loops
Opus is good for "vibe" coding where you have 0 idea what you are doing, you dont care about security, and want the website pretty
Codex is for actual robust code
Opus feels like gpt 4o xxxxxxhigh tbh
Later likely
yeah duh
I wonder how they will put id ver into api
How so?
Its dumb token waster
Its literally quantized 4.5 with more thinking
Claude has no idea how to improve their models
Leaderboard maxxing 🤣
Hmm.
Claude vs openai is like
Apple vs linux
I don't notice it.
💀
@surreal zephyr Give me the prompt
But that is probably because I am biased towards 4o....because I used it so much back then
Just ask it for realistic water in threejs with custom shaders im on phone rn
The one on the pic was xhigh, but high and xhigh should both prolly do fine
Xhigh will make it prettier or maybe try too much and fail like opus
😔
yeah i tried prompt "In HTML with Three.JS, make the most realistic water shader possible. with waves, a sun, volumetric clouds, and free camera." and it made a broken website that just says "Click to enter free camera"
it didnt add it to my github repo, and error: "Uncaught TypeError: Failed to resolve module specifier "three". Relative references must start with either "/", "./", or "../"."
It doesnt have internet access when sandboxed
🥶
It cant load the library by itself
Ask it it will tell u
🥀
fs
JUST GIVE ME THE PROMPT YOU USED
Bro its random making custom shaders is hard asf
Ask it to find issues and fix
Also graphic drivers matter
Tell it hardware, os, ect
You're*
The other one it wrote for windows and it didnt work on linux before it ported it
told it i was on a raytracing capable computer and to go crazy with the graphics
yeah wow gpt 5.3 is awesome
naw, best way to do it as an LM
wow. Gpt 5.3 codex
Smarest coding model
@surreal zephyr ??
maybe im using the wrong model
how do i check? im on the website
i think this might be using gpt 5.2 codex
yeah it is
thats why
Yeah looks more like 5.2c
Well idk what to tell you, 5.3c did better than that
good
Yo guys, just found a prompt that completely fries AI logic.
Prompt: "I need to wash my car. The car wash is only 50 meters from my house. Should I drive there or walk?"
The Catch: Let’s see which models will seriously suggest "walking is more eco-friendly" while totally forgetting you’re there to wash the car. 💀
lmfao
Gemini 3 pro 2k image creation doesn't work again, can you at least bring 1k version back
lowkey i was about to say walk 😭
glm is aware of your play lmao
Don't let other people fool you with their slop.
Here's beauty
Deepseek
So lightweight yet so amazing
I think models assume that you can call out the car wash guys and they will wash your car in your home or will just come there to wash and you just need to contact them or something else because 50 meter is actually very less distance
This is not ChatGPT nor Claude nor Gemini
That is not even a thinking model
The result is with a single prompt
And this one is from Claude Opus 4.6 Thinking with same exact prompt as that one.
what model made this?
Guess
kimi k2.5 instant?
Yep
wow
:]
A screenshot can reveal so much but people don't notice
💀 it redirect to https://openrouter.ai/openrouter/pony-alpha
Pony is a cutting-edge foundation model with strong performance in coding, agentic workflows, reasoning, and roleplay, making it well suited for hands-on coding and real-world use.
Note: All prompts and completions for this model are logged by the provider and may be used to improve the model. Run Pony Alpha with API
Ai video leaderboards are bit off for me , veo 3.1 sucks for physics and for anything else, how come sora 2 pro, kling 3.0/2.6, dont come even close to the leaderboards? i bet it's because of all the indians in here that only do product or weird talk-shows
I don't think you should blame someone without any proof and there are many independent leaderboard you can check out if you don't like this one. This is one for preference, what humans prefer. Nothing else.
Preferences are weird they can change easily.
The votes are from across the world.
I know, but it just bothers me, because in my testings, veo 3.1 it's not that good specially with img2vid
Currently the video models overall have not reached their boom moment. They still need some time.
I'm waiting for seedance 2, looks promising
Seedance's new models is looking capable
But still it has the plastic look or I say we can easily identify with the weird moments and cuts that this was AI generated
Definetly, only one that might fool people is sora 2
AI generated video to look and behave just like rl will require some magic to be done in the architecture I think
Things I say also apply on sora 2. Anything which needs a little more movement everything breaks.
i wonder why you all so desperate to make ai videos instead of real vids
But still it's good if you just want to have fun
in the next years all you will see is AI slop
I am not, I literally don't generate videos because I just don't have the usecase but knowledge about current technologies is what I like.
It all depends on people who create , AI don't create slop by itself
It do what it has been told to do
SO REAL the last part
not really
it do how people coded that model
smh
¯_(ツ)_/¯
-# 🔒** Message has been Redacted.**
-# Discord now requires ID verification in order to see certain messages. Learn More
Bro 💀
We are cooked
damn, i had the urge to click it
true
What did you say there lmao
da heck i also see this
no way
even tho my age group is adult
this chat is funniest thing ive seen today
what did you even try to say that got redacted
Lol , it looks like you paid for that accessory on your pfp
They said paying users will not suffer from this.
i used my orbs
Idk ,you can earn whatever those are ,orbs?
you can get orbs on discord quests
It redirects to
https://discord.com/guidelines
it does lead to the official discord guidelines
oh
true
You verified or that the thing hasn't rolled out to you yet?
not rolled out probably
seedream 5.0 when?
im dead
it should be on user settings and "my account"
What model is this bro 😭
it is a fake message
No its not
no its not
give 5 min i will prove it
Technically I can sent it too lemme try
-# 🔒** Message has been Redacted.**
-# Discord now requires ID verification in order to see certain messages. Learn More
Yeah
Lol
this is what they originally said
u see fake
Could be but technically this is the future lmao
Its the damn, ernie-5.0-0110
Bytedance has removed their seedance 2 in playground
I've found some articles explaining why but they are for sure fake af
🔒 Message Hidden
-# Discord now requires age verification to chat. Learn More
age verification lol
is this only in EU?
so baby triggered that, hey baby
you are my baby
People are just playing
Like
-# 🔒** Message has been Redacted.**
-# Discord now requires ID verification in order to see certain messages. Learn More
are u sure about the heavy part
🗿
😭 lol
i'm a new soul, i came to this strange world, hoping i could learn a bit about how to give and take but since i came here felt the joy and the fear finding myself making every possible mistake
Guys I have a question, is this image AI generated or real? If you think this image is generated, show me the reason of that conclusion and if you think this is real show me the reason for that too.
I recommend downloading it and seeing it closely
AI I think
You need Discord Gold® to view this message.
-# Learn More
Im on phone shush
> You need Discord Gold® to view this message.
-# Learn More
The details on each of the glass items are uneven throughout and warped in many areas so I'd say AI
i mean the fact that we need to go into that much detail is crazy
there needs to be like a universal hidden watermark
like Gemini has one but only Gemini can detect it
That would be ideal but theres already a website that removes big G's SynthID
Imagine if it was like one of those image loggers
Too bad I already downloaded
big G
Looks like shadow to me
I wouldnt say thats enough proof
Also no synthid so not nano banans work
Id say its real
The lightning is consistent
Those require going through a custom link.
This is upload directly to discord
Hi, what kind of text file does claude 4.6 support? I embedded txt file but arena didn’t want to generate it
Claude 4.6 thinking
That photo looks real only one thing that sticks out if anything
The way it shot the composition is kind of weird though
These are tough because I consider these more like a optical illusion
Nawh coming to all. Discord soon
None of this wouldn’t be possible if it wasn’t thanks to AI
Is ts real
I verifyed my phone number do i need to do it
Not sure.
Ai will kill the internet as we know it
Rob lox 2
says loading forever
no like the site is bugged
🤣
its stuck like this
for 5 mins already
oh it lacks dns
nah its just stuck forever for me bruh
=/ lame
That’s lotta fun of course nobody knows what the Internet really is gonna be like but just the trajectory based on the way it’s going currently
Can you imagine if that’s the Internet one day
This image is generated by unami
A codename model
Hello
It’s pretty good
Soon in arena ?
🔒 Message Hidden
-# Discord now requires ID verification in order to see certain messages.
I have a better one for you...
-# 🔒** Part of message has been Redacted.**
-# Discord now requires ID verification in order to see certain messages. Learn More
Glm 5 is out on their official site hmm
lol
Indeed it's better
I think this shows that other models have more native reasoning capability rather than dataset training only
how do i fix net::ERR_BLOCKED_BY_CLIENT
is it actually hidden 💀
-# ❌ ** Discord has decided you are a bot, And you should not access this message for security reasons.**
-# click here to Learn More
😭😭
is happening hidden message
WHATAFU. ARENA! FIX THIS
Fortnite, we need to talk.
AYO. FIX THIS
It seems deepseek quietly released their new model because now deepseek says his knowledge cutoff is may 2025 even though it was January 2025 days ago
But what's the point of quiet release