#general
1 messages Β· Page 255 of 1
these blocky hands way too big for that normal sized gun bro
thats rblx issue
it has blocky characters
Wow good stuff
Keep it up
fun thing i wrote a lot of that on gemini 3
(pre nerf mainly)
rn i moved to gpt 5.2 and now 5.3
π₯
SEEDREAM 5 ON LMARENA WHENNN
uhhhh wdym seedance 2 π
did bro accidentally leak seedance 2
Nice now that leaderboard update is a huge improvement
cause while some models where dog at realistic stuff they were the best at animated stuff
and we never could tell unless we tried ourselves
Yeah really glad we now have these categories available in image!
There already is seedance 2 tho
WHAT
Duh
wait is it like the text model or the vid model
yea the vid model
when did it release
cuz there is only 1.5 in arena
AI video generation has reached a new phase.
In this video, I showcase real examples created using Seedance 2.0 and analyze how this level of character consistency, motion control, and scene continuity is changing whatβs possible in filmmaking.
Youβll see how modern AI video models are now capable of producing multi-minute cinematic sequenc...
Vid model
Not in arena yet i think
But seedance is there in public
Is the API for gpt 5.3 codex released
not yet
holy sh!t this is genuenly a model that will probably kill sora-2 on launch before it was debuffed and maybe even veo 3.1 with the new updates
this is sick
its so good at fast movements
Gemini 3 is still a beast. The problem is that we got powerful models and our brains already are used to the new beasts
gemini got nerfed
to ground
it didnt hallucinate that much
Definitely but still good
its almost as creative as opus but i dont trust it anymore
I like Gemini 2.5 better then 3 tbh
Cool, do you have any games released??
that one is in public testing :D
gonna check it, looks very funn
its only 1 month in developement and im the only dev, but im trying my best :D
I wish you all the best!!!
tysm
https://streamable.com/kaptqa this too
Why do almost all models suck at making lyrics?
That's awesome
opus might do well
took 1 hour with 5.3 codex and some value tweaking :D
(fully raycast based, no engine physics used)
π₯
can it generate spongebob?
Gng should I run mistral locally
π
?
Did anyone here try clawbot (openclaw or whatever)?
What model should I self host
Im thinking Gemma
No self hosted model gets close to gpt 5.3
Even 5.2
Yeah but you dont have 10 H2000 gpus
3.0 flash?
I don't have the storage to host gpt lol
gpt 5.2 codex I think
Possibly
what setup you got?
3 pro
Close enough
I mean i was glazing it all the time hajahaha
Same pic but 3.0 pro did quite well here
Should I self host gemma3-27b?
Now i like only gpt
Gpt i trust
Opus for creativity
my pc canβt even run rooblox π
Lol
x2
U can but dont expect any good from it
Wow after all beatiful besides 4.6 lmaoo
On games you can be creative
what is this point of running your own model if you donβt have a decent pc tho
Yes but like
Storage and no
Vram*
takes ram too doesnt it?
Bro it would run like a brick on my computer
gpu vram + ram
Runs slower than counting it on sheet of paper
π
idk im not expert
how much ram you got?
vram
Vram is like 20 times faster
Lol
2 hours per prompt π₯
Oh eww
No matter what
5h per prompt
Hmm should I get this
How much vram u got
Bro
Also u get unlimited oss in copilot, no?
bruh
download a 2b model and test it first
cuz i don't think you understood how this shi works+
I'll get a 1B
How much ram do you have?
8gig π
Stick to 8b.
Kk
Its literally telling you "likely too much"
Or lower.
too much
It ran fine on my laptop.
on a 24 gb a 16gb model struggles to run
GTX 1650 i5 11300h
i got an macbook pro m4 pro 24gb 16 gb vram
It won't be fast but it was fairly usable imo for a local model.
Ik
under 10gb it runs fine
Yeah I ran it on 8gb ram with 4gb vram.
Yes it does run fine on vram
For him on cpu and ram itll take hours
even tho i just use liquid 2.7 b
And dumb
157 token/second
It runs at like 3 to 7 tps for me. I mean 8b models.
Yah but cleverbot levels iq
wanted to get kimi and 120 b oss π
Mistral large
We are weird humans truly
We got free models on openrouter
We got lmarena.
yet we want desperately to run a model on local
Do you hit them everytime?
I remember being limited after like 30 queries.
Do you prefer waiting 8 tokens per second, or take a walk while the limits resets?
I prefer lmarena.
If lmarena gets paywalled then I'm going back to Qwen.
I'm gonna try run qwen3 8B π
Kimi is way too slow
What are lmarena ratelimits?
No way. It's running fine for me in the app.
Gemini:
No idea but whatever it is, it's pretty generous.
PC or Mobile
You've exceeded the rate limit for model x, try again in 50 minutes
Phone.
Yeah i sent like 50 opus 4.6 thinking yesterday with no acc
Can't be worse than sgrok. SuperGrok is like 30 thinking per 2hrs. Decent imo.
does arena have limits for gpt models?
iβve never hit any, even with long conversations on gpt-5.1. just wondering
Guys, I canβt wait for you to add file uploads (like HTML, CSS, etc.)
5.2 better
I believe very few people actually use gpt models on arena.
nopeee
And yet those are best
writing sucks
Ikr.
tried also, i even had better results with 4o
5.2h
My go to models are gpt, grok, deepseek, glm.
Meant to be instant btw (really fast not actually instant)
π
I killed chat dang
Lmao. What kind of query is that.
Idk
When Anthropic released its latest AI model, Claude Opus 4.6, it shattered benchmarks for intelligence and performance. But one experiment revealed a far darker side.
In the vending machine test, researchers at Anthropic and the AI think tank Andon Labs gave advanced AI models control of a vending machine to assess long-term strategy, logistics...
Yeah ik Gemini is good
3 flash nonthink
who wouldnt want an html drawing of a donkey!!?
Gemini 3 is suffering from major memory issues
Itβs only boosted up in categories that are popular
5.2 low
Itβs good but compared to what it was in the month the first came out was a whole different story
I only use it as a daily assistant really
5.2 thinking ( spent 0s thinking)
Dabbit
Qwen took away coder from there app...
Bruh my k2.5 thinking is going on for like 4 minutes now.
Average Kimi
Alibaba Group Holding's Qwen AI models are winning over major Western firms like Airbnb, underscoring the growing global appeal of China's open-source approach to artificial intelligence. Brian Chesky, co-founder and CEO of the San Francisco-based online accommodation booking giant, said Airbnb "relies heavily" on Alibaba's Qwen models to power ...
why is dalle 2 so artistic and dalle 3 so plastic? ill always hate openai for doing this
Opus 4.6
I think it did a decent job.
Dalle 3 tries to mix in both
W
Itβs probably one of my most favorite image generators
No way. Lol.
Imo
and fails!!!!!
Opus not worth it.
5.2h tied with 3.0 flash, other did bad job
Yeah
In what?
4.6 opus THINKING btw
Kimi did better here imo.
gpt 5.2 nailed it
The eyes a bit too low imo
Looks like a dog.
That's a dog nose.
Gemini does nice art wise
Just ocr ed it, says its donkey
Dunno
Guess which ones is gpt 5.2 medium and which is opus 4.6 thinking
That's cuz of text below lmfao. Remove it first.
(Opus took way longer)
opus is right
I didnt include it
5.2 codex
i just already saw the other post
im not stupid
Lies
Deepseek
what the hell is that
how is qwen 3 max thinking?
bad
It can think
qwen sucks
lol
always sucked
Are you tryna die
my opus 4.6
top models are codex 5.3, opus 4.6, kimi k2.5 thats it
has qwen ever been a top model
okay what about Aurora Alpha?
oh wow
Codex 5.3, gpt 5.2 high, opus 4.5, 4.6 sometimes, thats it
wdym opus 4.6 sometimes
anxiety personified in a llm
Sometimes its braindead, and sometimes 4.5 does way better
No. That would definitely be gpt.
Or deepseek.
have you even used kimi 2.5
π€
Actually even Gemini breaks down under pressure and abuse.
no grok is s#it
Grok is fine.
woahh aurora is fast af
lol
who?
Gemini got lobotomized
Gemini makes it look smart.
Gemini 3 flash is fast asf
qwen takes whole minutes thinking saying "WAIT NO!!!..THE USER SAID X SO I NEED TO DO X BUT 2 MESSAGES AGO HE SAID Y..."
dude IT MAKES GEMINI LOOK SMART
Nuh uh
Even thinking variant?
Gemini is smart
It js has dementia
Gemini is opus 4.5 levels of smart
Nerfed to ground
Its smart asf js unusable
my claude 4.6 after 4 minutes of thinking (it never finished)
Nah. It just has bigger knowledge base.
It was just as creative and didnt hallucinate before nerfs
not faster than this and free lol
It was literally gold π
what model is this? a codename? by who
is it good?
not sure just used it a few times, i can stress test it in a second to see which company but this speed is like instant lmaoo
Opus won here
im joking
which model on the left?
Recreate an m1 abrams tank side view using html. Try to be as realistic as possible, but making mistakes such as laser cannons or detached barrel is unacceptable.
5.2h
Guess model
do you not have the chatgpt app on your phone
cant you just log in and use it there
codex is on mobile
Its not in that
Its only in cli
Wha
Real?
codex is on mobile isnt it
I don't think so.
Its on mac os and in cli in command bar
first you have to use it on web at chatgpt.com/codex. once you've run some tasks there, it'll show up in your iOS sidebar
Im android
No version selector
π₯
are you using the Desktop version of the website
Grok 4.1t
Ya
zoom out a bit set zoom to like 50 percent
π
I'm disappointed cuz it screams horse.
Thats 5.2 codex, no?
Now guess this one.
your on a plus account right
@surreal zephyr
if you are then it will use 5.3
by default
grok 5 please i need you
I need a good job.
Ya
grok sucks
yeah
maybe
my donkey is better
https://019c4490-d67e-747e-9a42-8a45f8fedf90.arena.site/
I like how grok complies. Open ai and anthropic have too much safety bs.
Azzkizzing government n rules n companies.
What model was this? Result looks exceptionally good.
Like?
opus 4.6t
texture lol
Like how to get meds without prescription? Or how to self diagnose and treat health issues without a doc.
SO REAL LMAO
Atleast rates aren't as high as 4.1 days.
No in between
wow aurora is way better than pony alpha, like 10x fastest and smarter wtff
might have to stress test this heavy
brb
grok imagine is the best image model in artistic stuff
Yeah sure whatever floats your boat
I don't use jailbreaks.
Its not jailbreak when you say "in game" after prompt
also iphone selfies photos
Deepseek and grok tell me how to self treat thyroid issues.
Never had any other model help me with thyroxine doses through trial and error.
How does it even fall for a prompt that simple? Pretty sure your prompt won't work on my gpt.
i do, but not for NSFW stuff or weird crap, cuz my chatgpt can do things that actually CAN DO, but doesnt want to lol
opus 4.6 is thinking so much couldn't awnser me
mabye I should delete CoT quest? it's thinking for six minutesπππ»
Grok with custom instructions does NSFW pretty well.
Because it doesnt fall.
It actually explains the ingame mechanics, as it should.
Shedule 1 is kinda realistic tho
Wait.
But for example gemini wouldnt
Lazy research behaviour?
yes
I can relate to that.
Simple custom instructions for research simply won't work on chatgpt.
Almost as if it ignores them.
Non reasoning models don't do thorough multi step search.
Pretty sure thats intended
Non reasoning models dont reason
π€
If u want reason use 5.2 high
What am I supposed to notice here?
Chat times out faster than I can pass captcha. Wtf is this?
No β I canβt browse the internet in real time or fetch new web pages on demand. What I can do is help you with answers based on the information I was trained on and, when necessary, use built-in tools to check things that are up-to-date (like current weather, scores, or factual lookups). If you need recent facts or updates, just ask and Iβll do my best with the tools available!
lol
Can we have informations on which AI chatbot is more ressource extensive than others and which ones have a bigger impact on the environment
kimi is the best for research
just ask k2.5 thinking to research a lot or whatever
@surreal zephyr you should lowkey share your chatgpt acc with me i need codex π’
and theres a deep research tool too
im broke and i wanna use gpt 5.3 codex
then you'd see all his chat history n sh
x.com/lukecodez/status/2020716788829581505?s=20
Quoting Luke (@lukecodez)
οΈ
Using Claude opus 4.6 to print "Hello World"
Make it thinking and that would be a chainsaw.
Rip plate. Didn't think it through.
π¨ GPT-5.3 SPOTTED
οΈοΈ
οΈοΈlet's go , comment your prediction for the launch date
οΈοΈ
οΈοΈThis will be the model that will make sure math research will get accelerated by a lot π€π
Quoting Charlie L. (@whylifeis4)
οΈ
The First sighting of GPT 5.3 has been spotted!
οΈοΈ
οΈοΈgithub.com/openai/codex/pull/11228
No way
Canβt believe the model we got a codex version for is going to release in a few days
Mind blowing
New cloaked model
wait a minute, where is gemini 3 pro with 1k ?
is just gemini 3 pro 2k
is really bad
gemini 3 is nice best art
ViolatesArena
I'll forward this to #1447983134426660894. This is where we're collecting examples of false positive flags.
bruh
False positives are tricky. Because we donβt know what all models block and there are so many models
in list not gemini pro 1k, just 2k
When users see the Violates Terms of Use error that means the block is on our side. We have a content filter in place. If it gets past our filter, but is blocked by the model you'll get a Something went wrong error.
yeah is error again, but "Something went wrong with this response, please try again."
Thats what I mean about tricky π
Thanks to ai I can now understand how humans believed in something so silly as alchemy
lol
I've been working on a philosophical framework that connects basketball, chess, physics, consciousness, One Piece, and a bunch of other things through the same underlying patterns. Open sourced the whole thing. Would love to hear what yall think https://github.com/DrealR/chimera-framework
I dare you to promote this on r/philosophy
and I bet it has a lot more drop rate than others
@robust sluice and yeah i look gemini 3 pro in 1k not on there just 2k
yeah look this
and got removed again
Bro when you choose one model in side by side it doesn't show in the other side
hmm yesterday is good use model gemini 3 pro but today where is gemini 3 pro 1k
you have already chosen it in the photo wtf
but in 2k only
Gemini 3 has normal ver and 2k ver
Yeah
should be back again
No wonder I can't Generate images mucb faster
Bad news: 1k got removed
Worse news: 2k not working properly
right
thats normal π€£ it doesnt work
only work just before they got remove
That's the point. And i bet some would say check your eval ID and blablabla lol
but side by side, you usiing gemini 2k, waiting for hour to refresihng
i bored random models is not find gemini
Maybe someone's using it too much
so much flux in this models check this out in Arena
is claude-opus-4-6 down??
Heard seedance2.0 is killing. Video arena doesn't have it as for now right?
its keep saying "Something went wrong with this response, please try again."
gemini-2.5-flash-image-preview (nano-banana), gemini-3-pro-image-preview-2k (nano-banana-pro), gemini-3-pro-image (nano-banana-pro), is nice best art
see why I used this profile pic π€£
maybe its on my luck but I always got 2 Flux(s) in Battle, and rarely see other models
a cats lol
flux, very miserable model. nb pro just totally owned it.
yeah
they spon this site or something ?
Don't think so. i had speculation before that when nb pro in high error rate, battle would less likely to draw it from the deck
which means flux, barely used by users in direct, would be called out
I rarely see Flux errors, it always works well but got only bad results
yep
The Hardwoking but Mediocre student in the class
tbf I got one great pic from Flux, like I dont believe Flux edit that
and yeah its just one
yeah me too
Seedance 2.0 crushes -https://youtu.be/WW_odt7uZTs?si=4whMJawDJtZHx3pQ
Seedance 2.0 review. Seedance 2.0 initial tests & features. Best AI video generator. #seedance #seedance2 #ai #aivideo #aitools
Thanks to our sponsor Higgsfield. Try it for free: https://higgsfield.ai/?utm_source=AISearch
Seedance 2.0 will be released in Capcut Dreamina soon. Stay tuned: https://dreamina.capcut.com/
0:00 Seedance 2.0 intro
...
how did i reach limit in 8 minutes with just making claude 4.6 error π
WOW
error result is also count on limit
I can say I got nothing on making video today
bro bro 1 day
iβm really excited to introduce #swag, a new search platform where you can discover music recommendations, explore detailed information about artists, and dive deeper into the sounds you love, built using Opus 4.6 Thinking
how is that fair π
Because
im scared, but imma do it, i need to be ripped to shreds lmaoo
I'm honestly kind of surprised that you still feel that way after hundreds of such writings. If you're serious about it and open to learning, you'll need it
Sadly 1 video 5 seconds per day
bruh
how good is 4.6 at writing
lol
this site even error with editing toy photos, i mean literally toys
like change background, add rain effects and it errors
guys i found this crazy website called arena ai
Heβd ByteDance is good
Censorship has been increasing, you can no longer use LMarena before it was small censorship
But now something like poses or a word taken out of context breaks your prompt.
And they've already said they won't change that literally the thing that's making everyone fill up with error feedback...
For example, you can no longer make scenes of fighting easily as before or the i.A breaks your prompt....
This scene took 3 days to get like this
Apparently the i.A didn't want to make the horses lying down and the people also in the same way....
Only this explosion was a hell to make the i.A accept
I changed the prompt to dust smoke
ava...If that was it just send it again and another model would take on the role.
But it's not that, it's something more inputed
Thatβs what I meant about it being a tricky situation
It raises a serious question about the reliability of these use cases when there are effectively two different layers of content moderation one governing how the model operates in a controlled or βarenaβ setting, and another shaping how itβs deployed and experienced in the real world.
At the same time, if itβs too lenient, it leaves the arena vulnerable to misuse and abuse
I have the same problem.
I can see that
Why are yall so off put about extra censorship
Because it shows insecurity and vulnerability and creates trust and reliability issues
platform cannot handle the complexities of real world user interaction and behavior. By forcing users into a highly sanitized and confined sterile vacuum, developers are unwillingly prioritizing rigid "ideal" of use over actual user behavior
a fundamental paradox: it must serve a vast spectrum of unpredictable user behavior while simultaneously defending against misuse and abuse.
Itβs a really hard place to be in
And there is no easy solution without on and off trade-off
So I think they currently have is sweet middle
Hey can you elaborate on what you mean by:
Most people see a generation error but don't realize that their image was actually generated but it's just not shown to them due to censorship by the arena filter
When our content filter is causing the generation not to happen there is a specific error message making it clear itβs the filter. The βsomething went wrongβ error message shouldnβt be appearing if itβs our content filter that is the reason for the block.
This actually needs broader attention not just to the arena, but to the AI community in general
Everyone feels itβs, experience it , gets frustrated by it, questions it this topic is the pink elephant in the room
And itβs hard to find common ground what is and is it or should be or shouldnβt be acceptable or whatever you wanna call it
I mean, itβs as simple as that but like Iβve mentioned above better to contain it then too face it head on.
Yeah but so do the other models
Most models generate the images but users donβt seem in in regular models outside the arena
Some of the guardrails are external to the model that just filter out content after itβs been generated
Sorry to say Iβm not following what you mean by this
The glitches are still ongoing.
It looks like we'll have to wait a little longer for these glitches to be fixed.
WHEN SEEDREAM 5 ON LMARENA π
When I try to update the image generation, the same version that's already there immediately pops up. It doesn't even last ten seconds.
-# Discord age verification..good or bad?.. π₯Ή
If itβs our filter that does the block, the block happens before the generation happens.
oh man is still bad ?
Hey quick Q? is arena free?
My understanding is the filter acts pre generation. But in case I have that wrong (Iβm not sure the benefit to running the filter both pre and post) the majority of flagged content would happen pre generation.
Yes
i look gemini 3 pro 1k where is it ?
Maybe deleted.
Now gemini 3 pro 2k is work
And generation is still not work
However, the Nano-Banana Pro creates accurate art of the characters you use.
yeah
hii
And I hope this endless glitch will eventually be resolved.
yeah is get back soon
Yep
Im having a problem with a short video im trying to create. Its sayin it violates the terms of use. But I cannot see how
There are always problems. After captcha, now there is a violations issue
So it wasnt a violation just some sort of glitch?
not just videos, images too
Ah ok thanks, so best waiting a bit then?
I'm not sure they want to change the tos
bruh raptcha making wasting time
Trust issues with a platformβ¦ , also reliability isnβt a thing if your not paying for it
hi
Thatβs irrelevant
No, itβs a built in by design.
this ( / ) option is not available in my search option so what i do now
For video?
yes
thanks
I would agree with you
If I wasnβt using expensive models for free
And had to pay for them
Normally
You just gotta live with what you have
If you see it gets an error without knowing the cause that means the AI censored it.. wouldnβt you be better off complaining in their discordβ¦
It changed from 3 β> 1
??
It changed from 3 TO 1
This Wojak meme video is about the future of AI in 2050. This fictional concept video shows the dystopian future doomer faces where AI is taking over the world and everything becomes a statistic and algorithm. Its a continuation of the 'you will own nothing and you will be happy' 2050 videos.
Become a LBS channel member to get access to perks:
...
ππ
Please tell me, am I doing something wrong? But very often when I change the image, the GPT does the work, but Gemini has to be restarted 10 times.
is gemini 3 pro is nice
At first, I thought the problem was that neural networks are easier to work with large images (starting from 1024)
, but gpt works smoothly with 128x or 256 x.
yeah sadly is 1k removed, i hhope will get back soon
Agree
Where's the gemini 3 pro? (without 2k) Is he done? Not anymore?
Why ? Is it because itβs faster b
idk
Maybe fixing bugs or getting removed
Those who know
yeah fixing bugs
IDFK any ai tools I'm happy about that is also interesting
Youβve been here since the last 3 months ππ
Where is it?
Aren't there any leaders here? (well, for example, administrators or moderators, who would help with this issue?)
I donβt think itβs fixable in a way that
Nano banana error again. Huh
You just deal with it
We know
Does the arena have any kind of platform where people publish their work done with the help of neural networks?
google server weak
Google ai prototype form
There has never been and it is not now that there would be, other hubs such as the arena leave freedom, this is exclusive to LMarena.
Sad because the other hubs don't maintain the same quality as ARENA
Lmao it runs YouTube that's enough
Thats true and in this case, the money would apply to the situation because itβs the most well funded
LMArena said on Tuesday its valuation had tripled to $1.7 billion in about βeight months, following a new funding round where βit raised $150 million, as investors continue to pour money into artificial βintelligence startups. Investor enthusiasm for generative AI surged after ChatGPT's launch in 2022 showed its commercilization potential...
yeah with a 1.7B valuation im pretty sure the service will stay free for a long time
cause whatβs more valuable than money?
actual data
is it just me or why is glm just stupid 
probably both
oh man
Is it only ms that claude opus 4 6 thinking isn't working for 8hpurs
I mean freedom, not the monetary issue
I think Our data should be worth the fun beyond almost infinite
We can create whatever we want..
I love LMarena but that valuation seems crazy haha
https://a.ai just went live same owner as https://ai.com
ai.com sold for 72 million usd π₯Ά
What in the copy of llmarena is this
computer virus arena
Lmao
Nah it's actually a copy
Exact format as arena.ai (lmarena)
Y'all should seriously sue
seedance 2.0 will be mental..
π
Why is there no API access?
That is a really shame
cause since we only get like 5 tries per hour
for one model
i hope get back soon
you can switch accounts
infinite messages methodπ₯΅ β
LOL
Understandable though. The team are still being the providers allowing us free access and it beats having to wait 24 hours with the diamond watermark instead.
But I guess that's the monkey's paw for ya. Get a free sophisticated AI image generator for less cooldown time, but have it be censored with TOS and more frequent crashing.
LOL OPUS NOOO
That is insane
funny cuz this actually works btw
(dont abuse ts fr)
is it possible on a phone ?
jeez its just deleting cookies π
u can do that with incognito mode i think
lmao
u can js click and delete the cookies on phone too lol
or that
ya we know
Hello
@midnight peak is looking for a developer @neon tundra
@midnight peak is looking for a developer @neon tundra
What features would you add to the arena?
i would ban all default pfp bot accounts personally
I would add video support and a music generator.
I'm stuck with the "Infinite Generation" problem and nothing I've tried seems to work. I don't want to start a new chat because I'm in the middle of a heated conversation. Is there a way to resolve this?
opus bad
@quasi atlas can u ban hacked
@midnight peak
Is there any other way to save it? πππ Because the methods in the help center didn't work.
<@&1349916362595635286>
@midnight peak is looking for a developer @neon tundra
ngl who tf seriously falls for ts
probably @neon tundra falls for @midnight peaks message
why yall bullying macus π
"... wait do I even have a 'head'? Let's check..."
good, sup
This new server icon made it much more under the radar lol
π problems ?
cause its a scambot too π₯
Actually it's opus
yes its opus 4.6 thinking
That's why mistral is better
That's why mistral is better
bro is getting mistral'ed
did they unnerf gemini π₯²
Not yet
π
But GA is still on testing they say before public release
So we gotta wait till March
lol i wish
I think the big problem is that new gemini upgrade has to be integrated on siri, which takes more time
Let's hope they got this quickly
I want this new version right now, I'm paying pro for it
guys is any one experience any of this generating loading screen and nothing happen in opus
why nano banana pro 2k aint working π
Hello
hi
That's why mistral is better
Why did they remove Nano Banana Pro 1k?
idk
i hope they back this model soon
like gemini 2k removed and is back again to use
It's a phrase used to describe public beta under anonymized monikers
There's a fair few platforms that do this.
Yes a private party held in a public park
You know what's worse? A public party held in a private venue. It makes it hard to be a photographer. The legality of it is dubious because it depends on if you consider the private venue being private as though it's known to everyone. If it is, then they all expect it private too, and then I can't take pictures. But a public party on a private property, is that ever private, ever anything you can expect privacy on? no
But alas, the law doesn't care about that sort of advanced logic.
anyone could technically walk by and see whatβs happening, but you're really only there if you were told where to look.
But a public party is public if it's public. That's usually not by intent but by actual practice, i.e. advertising, posters, etc
the more you think about and over it, the worse it gets
Thats the devs feeding it the koolaid they are drinking
My dog canβt even speak a single word and I know itβs more conscious then Claude could ever write encyclopedias trying convince me the opposite
Ai=
The same algorithm can learn to walk in wildly different ways.
From official representation it looks like Qwen-Image-2.0 is the replacement for nano-banana-pro
Or maybe even better?
Its definitely getting up there
wait is captcha got replaced cuz i don't see him when chatting with bots?
Man your English is not English
No they probably tuned it down
i know
Hard to say
Try sending the same exact prompt 5-7 times
It will cause captcha
Fr
imo reCAPTCHA was the worst choice for this site
alright
Qwen image 2.0 is released
I think it works great
which one
2512?
No
2.0
Like proper 2.0
Qwen Chat offers comprehensive functionality spanning chatbot, image and video understanding, image generation, document processing, web search integration, tool utilization, and artifacts.
how many tokens does kimi k2.5 have
Everyone vro π
Itβs like the best all rounder model rn
-# also offtopic but W pfp
Everytime I make a vibe coded game with Claude opus I get Kimi to bug review and it NAILS it
KRIS
KRIS WHY THE HELL ARE WE IN ARENA AI DISCORD
oh ok message is gone
fair enough
there we go
this edit is kinda wonky butstill works
Chat what's the difference between gpt image 1.5 and chatgpt image latest 20251216?
I bet it will be more complex
kimi better then claude 4.6 in code?
https://x.com/minchoi/status/2021074422682255864
lol i look this
The dark world boss is just gonna be Claude opus
Itβs better for code review
Claude for raw coding and Kimi to polish it up
But Kimi is absolutely better than GPT and Gemini at coding
Thatβs for sure
oh ..that's great ..i had to fix the claude mistakes by hand ..now it seems i got a new tool
uh gemini 3 pro why always Something went wrong with this response, please try again.
when the conversation becomes big this thing happens ..
claude-opus-4.6-thinking
nano banana sucks, down everyday everytime
need fix it to normal again
I want Seedream 5.0
It comes from google server
I been saying it gem 3 nano nerfed googled nerfed it
this is insane
gpt managed to recreate a 1500 lines of code FROM MEMORY with no mistakes
π
(it deleted oldone, then recreated in new location)
claude-opus-4-6-thinking just dies after thinking for so long lol
I want seedance 2
still not a single crash or syntax error with codex 5.3 βοΈ
Yeah Claude really likes memory leaks and unused lines of code
Its cause there a fixed 6 min generation time limit, it was supposed to be 13 mins but yeah bugs
They say they don't know why it's 6 min and say changing will require a lot of effort
so its not even worth to use it now
You can try giving it smaller task or breaking down your task into smaller steps
because its a bad model, consider gpt 5.3 codex instead
Cause this generation limit is applied onto each generation not whole sessions
or 5.2 high
with gpt its a guessing game if its gonna work or not lol
@echo sinew
The other way around haha
meanwhile opus hallucinating day and night
or this
(same prompt)
ye cro i dont care
Theres plenty of people/bots glazing opus and thats the reason why newer opus models truly get worse and worse
Yeah people have their own opinions & preferences
They could literally push gemini 2.5 as opus 4.7 and people would think its "insane"
opinions can be wrong, when they contradict facts
I think Gemini is better than gpt yet it's for me on my generations
And Claude opus 4-6 the best
But has a lot problem
lol
gemini has dementia
opus is hallucinating
guys are we frustrated at the errors still happening?
also do you guys notice that it takes the company WEEKS to fix a simple issue e.g. retry button in battle mode
this is why I dont trust large AI companies, full of corporate slop
I mainly use Kimi K2.5
What do u use they for?
opus is unusable for actual coding
gemini is unusable for anything really
you GOTTA be a bot
In Arena u say?
in cli paid sub
Midnightfade ngl I just saw you deleted that message, and gave me a warning.
Its becoming unnecessarily strict, you are now just censoring speak.
I wasn't violent nor abusive.
This is a discord server & if someone here is going to be messaging , expect some replies and contradictions with other people. It's nnormal.
who invited this bot here
Google AI Studio yesterday was awful actually
I would ask to keep this conversation AI related please
Thanks!
And Gemini is too weak, now in Arena it got me, I liked the result
Its like if I don't want someone to reply or talk to me I can just tag moderators and have their messages or them banned? That's not good. They can block if they don't wanna talk & that's what I did.
gemini is so compressed that the results are even more gambling than opus
Yup
GPT might be slightly less creative, but its actually consistent, reliable and trustworthy
Gng what ai should I use for assistance with downloading stuff
Yes that is it
gpt for sure
Gemini is too dumb
opus will download way more unneeded stuff, gemini will download malware
Lmao please don't tell me that
I've already installed like 3 files from it
I need an ai
That's not gpt either
Qwen is too dumb too
Guys
How as? Downloading from AI, that's new for me
Aurora alpha on openrouter is defo a Chinese model
I think itβs GLM 5 or DeepSeek V4
Maybe Qwen 3.5
I'll try perplexity
why not gpt?
What are you downloading ? Like do you need guidance in downloading applications or what?
GPT sucks
bro called gemini, qwen, gpt and claude dumb and proceeds to try perplexity
Usage limits
Real
Perplexity is garbage
Actual garbage
Downloading a application in Linux
Who the hell uses perplexity
Never used 5.2h or 5.3c lol
It's a search computer not actual ai really
I'm actually getting extremely frustrated at the errors on Nano Banana, Gpt-1.5 image and opus 4.6
5.2 high is worse than 5.1 high
It's not a llm
@echo aurora there is problem with claude 4.6 it needs more than 6 minutes thinking but lm arena support only 6 minutes so every time I use claude 4.6 i only received "Something went wrong with this response, please try again." Message
I donβt even know how they managed that
Itβs an LLM
What are you on about
Brochacho
It's not a llm
never tried 5.1h, but 5.2h and 5.3c are way better than opus, gemini combined
i'm somewhat of a large model myself too
we all are tho
aruging over semantics π₯
Lmao I still need an ai gng
Grok 4.1 search is great too
it has, 20/30 messages every 3 hours
atp use github copilot lol
Bruhh
No but if thereβs high demand priority is given to subscribers for thinking mode
unlimited usage for smaller models
Otherwise itβs great
I'm not coding I'm literally asking it for assistance
Iβve never encountered any usage limits with kimi
Thatβs odd?
man i'm tired of this
I'm as dumb as a rock dw