#general
1 messages Ā· Page 273 of 1
no I mean it's in the arena and has been for a while now
Nanobanana 2 is on arena
Just no announcement
high failure rate though
gemini-3.1-flash-image-preview (nano-banana-2) [web-search]
Whats better 3.1 pro or 5.3 codex?
I haven't tested 5.3 codex but am sure it's better than 3.1 pro...
@echo aurora Three new models are added daily. Do they send emails to companies and request permission to add them? I'm just asking out of curiosity.
The heck Gemini 3.1 gonna do š
Codex 5.3 is designed for programming
I mean it depends on the task
Gemini is designed for alot so it will be weak on some things
Bro you can already use it on CLI
I used 5.3 and it was very good.
Twin I really loved it too
Very good at coding
I find it funny that 5.3 isn't even on the official website
Great at fixing bugs
Like chatgpt.com?
Yeah
Is it possible to upload files in direct chat mode?
Chatgpt.com is for only Gpt models not codex, they have already a app called codex only on macos maybe in windows it support codex 5.3
Depends on the model and their abbilites, idk for sure though
I still don't understand what Gemini is so great about; ChatGPT is just sometimes annoying and unresponsive, but with Gemini you can type cheat codes, and the same goes for Grok. I think that's pretty good.
It can be used as a plugin in VSC.
Gemini is designed for everything
why is it low then
Lol I was using it on opencode when Gpt 5.3 codex first released it was literally free to use no limits
It just came out
oh ye bet
Specifically claude 4.6 opus
It's free right now.
Yes but I find it weird because then there is already a 5.3 not in the website, they added 5.2 which had sota coding abilities, why wouldn't they add 5.3 when it was ready to be released into the website
I hit a limit yesterday, if you hit that's basically it for your account it's like you need to add money to continue
So I just made a new account but honestly limit is very far but might change later
I use it for free and haven't paid any fees, to be honest.
Well yeah it's free now but since everyone starts using it we gonna see Antigravity limits
Correct
Why do I keep deleting my messages
Codex 5.3
Opus 4.6
Gemini Pro 3.1
Which one is better and why (especially in terms of coding)?
Opus 4.6... Tho coding can mean multiple things... Svg generation, front end generation?
I like Codex 5.3 because it made a Minecraft clone easily, like movements, placing breaking blocks, flying. Just using rust programming language.
All
Well I had fun with it
Opus 4.6 then
Plus the thing I like on codex is that it can view image so basically any bug I see I screenshot it and tell it to view image and fix it
I live in an arena where everything else is free because Opus is a paid site...
This is what gemini 3.1 gave me when I gave it the task to recreate me as an Svg...
Yes, that's a really great feature.
The app or the cli?
That's so great
Anywhere but I am using CLI
We do work directly with model providers to make their pre-release models available for community testing.
So, Claude, it can be said that you are constantly in contact with Google and OpenAI?
This is really great.
@echo aurora Who made this server?
@echo aurora I also want to say this: when we delete a chat, it says it's archived, meaning it's not completely deleted. Are these chats being sent to you for purposes like training AI models, or are they just being archived for that purpose, not completely erased?
uhh i want to try gpt 5.3 codex in normal chat as well
When so many pleople are using the new AI:
Would note I'm a person
. We do have relationships with model providers, it'd be difficult to get access to pre-released models without it.
For a better understanding on how to delete your messages this article should be helpful. For your other question I'd encourage you you to review our Terms of Use and our Privacy Policy.
I'm not sure tbh. Guessing one of our founders.
i think it was wei-lin chiang (WL)? he has the purple role so i assume he is the actual owner of the server
(actually i don't think they have that role anymore but he still boosts)
yes that is correct
@echo aurora In the name for grok 4.2 can you clarify whether itās single or multi agent
No joke i actually think gemini 3 pro image preview is better then 3.1 flash in realistic terms, flash only generates image fast and struggles with simple things like texts and curvatures
It shouldn't be that high IMO
It's a single agent
Thanks
Also there seems to be some issues
With it failing or timing out
So guys
The old nano banana was technically Gemini 2.5 img gen
Nano banana two is Gemini 3 flash
This is old
No?
gemini is better
So if they just updated to flash the previous version we were using was 2.5 right?,
Gemini 3.1 Flash Image
yep
Nano banana pro didnāt get update
From what I could tell
So does that mean that the banana pro is Gemini 3 pro?
Man, all these names are really confusing
you done try gemini 3.1 flash ?
See look they call Gemini 2.5 flash nano banana
Assuming you're seeing the Something went wrong error message you'll want to try the steps in this message - #1417174113092374689 message
And GEMINI 3 they call nano banana pro
yeah
But now nano banana 2 is really Gemini flash 3.1 lol..
Which would technically be better than pro right?
I won't lie, the pro version is so much better then the nano banana 2 , it struggles with almost everything
Well, thatās why Iām confused at
If flash is Gemini 3.1 and pro is Gemini 3 pro wouldnāt that mean that the newer version is the flash version?
It's actually awful , it struggles with simple things like curvatures or icons or texts from further away
Oh, I totally forgot. I have a method of testing.
With this model specially I have noticed it happening a large amount, and I have seen others report on it too
We do keep an eye on specific model error rates and will escalate when appropriate.
I gotta say I donāt really like nanobanana 2
It does weird decisions with nonhuman characters
Me too
It's really bad
Letās test
One second Iāll be right back. Iām gonna run a couple tests physic test.
It looks really weird with Ena (blue and yellow gal) here
Something more professional
It does look weird, but thatās a good color palette
I mean the images of a good high-quality image better than the other version
The thing is, we might not just be used to it
Itās like going from CRT to HDR
It takes a minute to get used to the new aesthetic
The left one is 3.1 flash (Nano Banana 2) the right one is 3 Pro Image Preview (2K Nano Banana Pro)
Both the same prompt
Nanobanana 2 is a bit more detailed
But the quality
Is a bit off
It's more detailed but it makes no sense, a horse wouldn't have been stabled like that, there in that location
Yeah
No yeah the more I look at it the right one makes more sense
This is my point about getting used to aesthetics
They constantly evolving.. Itās really easy for the eyes to get biased.
I think google has immense power over the AI industry! i just hope google makes a good video model like old sora
Sora is good but the quality and dialogue is awful sometimes
A corrected zoomorphic satirical map of Europe and the surrounding region in 1918, accurately reflecting the political state at the end of World War I. Each country is represented as a distinct animal. Britain is a weary lion, France is a scarred rooster, Germany is a defeated black eagle with broken wings, Russia is a chaotic bear mid-revolution, and Austria-Hungary is a crumbling two-headed eagle falling apart into separate creatures. The Ottoman Empire is shown as a broken and dying serpent, fragmented across Anatolia and the Middle East to reflect its collapse after the war. Italy is a wary wolf, and Switzerland remains a calm dove. Use a vintage, sepia-toned propaganda style from 1914. Include a bold banner that reads 'Europe 1918 ā End of the First World War'. Animal-only representation, no humans or machines.
Letās see how well it does on this
Wow nailed it
Forsure
LTX is good
ltx?
/create video prompt: Cinematic biblical vision inspired by Ezekiel 1:26-28. Above the firmament appears a sapphire throne floating in the heavens. A divine human-like figure sits upon it, glowing like molten amber. From the waist upward and downward there is living fire, moving and swirling inside the form. A radiant rainbow surrounds the throne like after rain. Thick celestial clouds move slowly. Lightning flashes softly in the distance. Golden light rays beam outward in all directions. The atmosphere is holy, majestic, overwhelming.
Camera movement: slow upward cinematic tilt from the ground to the throne.
Foreground: a prophet falls face down in reverence.
Lighting: volumetric light, dramatic, heavenly glow.
Style: ultra realistic, epic biblical film, 8k, high detail, cinematic color grading.
Duration: 10 seconds.
Aspect ratio: 9:16.
Yeah itās wrapping
k
A chilling, slow pan over an ancient, dusty Middle Eastern city as a massive, unnatural, pitch-black shadow slowly creeps over the mud-brick buildings and streets. The sky turns ominously dark with swirling storm clouds of ash and dust. People in ancient historical clothing are seen shrinking back in pure terror and despair, sensing an overwhelming, terrifying presence of evil and impending destruction approaching. Cinematic lighting, highly detailed, photorealistic, ominous atmosphere.
The face is distorted on purpose
Itās a security feature
It's distorted?
Theyāre moving Gemini 3 pro on the api
The new nano banena is a freak
Its so crushingly good
How to make images
@toxic verge you send so many messages/images here
Most of them were deleted
There is a problem with claude opus 4.6 thinking version when i enter a prompt it just thinks about what to write but when it starts to write it says error please try later
/image caption The two men are being treated for a broken nose
Plz Side by side video model ?
It probably reached the maximum response tokens
Because it sometimes thinks for way too long
It's normal
Yeah, I gotta go. Iām leaving in two days. š
I wonāt be on for a while
Enjoy the content while itās hot (I like to provide some evidence behind my statements, so Iāll either post links or images so people donāt think Iām just pulling out of arise )
Hello
Because that's what Nano Banana was released with and uses for its NLP.
very
Your words are logical and convincing
Thank you. š
Speaking of... That really puts into context how old Nano Banana is by now. :')
Forsure thatās why I brought it up earlier
Looks like Gemini pro is gonna get a preview also nice
Gemini 3.1 Pro? Yes. That's why the 3.0 Pro is being discontinued :')
heloo
Haha
It's grat at the visual aesthetic context but not so much about the actual physics of perspective. And the failure modes? Well, those areāto meājust as interesting, given they reveal a lot about how the model does work. :)
Forsure plus you gotta think about everything on scale right and only has a limited amount of time before it has to make an image. Iām sure if you had infinite compute and inference it would get it right
It gets infinitely more interesting once you're in the deep end of human neuropsychobiology
What is the matter ? Why It doesn't work
so i havent been chatting much here so hello yall
hello chicken
Guys why is it the AI video only last for like 5-6 seconds? is there any way to make it longer?
hi
what's up chicken, you like using AI on arena.ai and casting votes for your favorite ai?
I've personally had better results with the model
also enjoying the new image search feature
now i can't swap the character shamelessly like the pro one, it just mess everything up, change the whole position, pose or expression even i asked it to preserve every thing of the original one
that's the problem, the previous one was the best for me, swapping character so smooth
You can still use Nano Banana Pro if you need it
Nano Banana Pro 2 will probably drop later in the year which should be a direct upgrade
i ask for back view of the character, it generate left view then right view, and the character even turn their head to keep looking at the camera š
how? now i choose the thinking option it only use the newest one
i really want the pro back
ohh there's a button to regenerate with pro!!!
they knew that the 2 vesion is much better quality but something still worse than the previous one for sure
Does Claude have a PDF limit?
I tried 3 times. And the moment I sent the 7th PDF. It then gives me
Something went wrong with this response, please try again.
WHERE
hi
my chat conversation stucked and not giving any response from last 3 days and event i'm not able to change or do anything
chat url: https://arena.ai/c/019c89f7-0189-7f17-8cec-7262638d8704
Does anyone know why I still get notifications for the announcement channel after muting the channel, muting the server, and removing the channel from my channel list? This doesn't happen on any other server
If you change your notification settings from the server this should do it. Right click the server icon -> notification settings.
I can't generate videos š
Note that Video Arena has been removed from the server. More information can be found in this announcement. Video Arena is now only found on the site - https://arena.ai/video
On the website arena.ai, is there a way to get a gallery section with all the images you've generated in one place?
How!
Iāve seen this before, when Dalle 3 was retired
There was a large group of people I didnāt like the new image generator
I was one of those people lol
In theory. If you go to the Search area you can filter by modality. It isn't a gallery, but will have all of the chats. Would note that earlier today a bug was uncovered where Image in Search is just loading. We are working on a fix.
Also would want to note our #ask-here channel.
a chatgpt-like gallery menu is a good idea though
I see. I hope one day a dedicated gallery section will be implemented. I have hundreds of image chats and such feature would be amazing to browse through my creations šŖ
also it looks like your #ask-here bot can't properly ping you
<@&1349916362595635286>
i am here
Yeah, I'm planning to make some changes to this. I'm watching this channel pretty closely so for now doesn't make much of a difference.
Would you mind asking in #ask-here ?
Did they get a new Ui? I love it š„ŗš„ŗ
No, this is the fact, when pro can do literally any thing shamelessly, 2.0 just so bad compared to it
Why is nano banana 2 so much worse than nano banana pro in text rendering
imagine if other llms were like codex š¤¤
Yo i didn't know it's bad in that case too :0
So nano banana 2 just a trash
wym worse
its like the only thing its better at
its worse at everything besides text
I'm deeply disappointed
5 tries, 2 fails
This is just generating a fiction character, nothing complicated
It's downgraded to the very begin era of AI generation
Then pay for the model if you want higher quality
Ong
ā ļø š„²
?
Are you dumb? I am a ultra google user and have access to veo 3.1 and Gemini pro, but what I'm saying is Google's new image model is too bad than the previous one, and it's literally downgraded, i just gonna use the older pro version which is much better, and they are free what do you mean for the higher model?
Even flash nano banana would never have these obvious errors
It's not even generating fast
0.067 compared to 0.15 per image
I'd rather pay 0.15 for the better image than getting some half quality image for 0.067 and I'd have to retry probably
Just woke up and saw gpt 5.3 Codex is out in arena
Is there a site where you can see model strength and cost on a plot
It is but its not great imo
Artificial analysis
Is that the name of the site
added/edited code
its a git thing
(most important thing in life of any programmer)
Thanks
i recommend searching a bit about git because coding without version control truly is a risky thing
Who said the text is better while i got 2/6 generations that fail
Is this normal Gemini behaviour?
Anyone else experiencing this
I think the auth session might be missing
nano banana pro 2K take forever to generate
bro your picture is too scary
wtf
Claude is screwed
PDF's are turned into a load of vectors, and they take up a LOT of your context
You out for em?
Wdym
force refresh and log back in. Your proof of identity is missing and the server can't confirm who you are.
Damn u smart
Nah like the US Government is forcing Anthropic to give them a version of claude completely with no guardrails for military use, which I agree with BUT anthropic is denying them and the deadline is Friday 5:01 PM today
If they don't give by that deadline
I agree with not bending the knee
O weāre gonna do it. Theyāre gonna do behind closed door.
Of course, publicly, they canāt admit to it
Then they either go bankrupt or lose the $200 contract with military
Nah, the CEO was furious to find out claude was used for the meduro mission
The best PR move here is to make it look like youāre fighting for it
You donāt really have any other option
No its not lol. Threatening bankruptcy is not the play
OK, what options would the CEO have it in his shoes?
it is their play though. As usual.
Seriously? Other countries like china will use unfiltered AI in their military. It'd be like the USA fighting with one hand behind their back
Ug yeah ya
Hand it over, he wouldn't be at fault for anything that happens
Thatās exactly what theyāre doing
Itās exactly what weāre doing
So why would you think they would be in different?
It's not about the being unfiltered. It's about the blackmail and unconstitutional niptwists that the govt pulls
The AI race isnāt about large language models
Itās a geopolitical one
And on top of that priorities military application
Hey smart guy, the government wouldn't be making a fuss about this if thats what they were already doing
AGI, OpenAI, Elon Musk and WW3. Visit Ground News to compare news coverage, spot media bias and avoid algorithms. Get 40% off your subscription at https://ground.news/digitalengine
Join us on Patreon:
https://www.patreon.com/c/digitalengine
Sources:
Keep the Future Human paper by Professor Anthony Aguirre
https://interactive.keepthefuturehuma...
Real
For which Anthropic has no product to start with
2 real
It isn't blackmail though lmao, its them telling them to give unfiltered version so they can use it for military purposes
People say this stuff all the time, this is nothing new
Ur right they do
That video personally changed my mind and a lot of of things of how I look at the industry
Weāre entering a new arms race kinda like the Cold War but with ai
We already see this in the battlefield in Russia and Ukraine
Yeah, that video is about a year old
Me personally I think itās naĆÆve to think that these companies wouldnāt dabble into military applications
By will or force
Why benchmarking 3.0 flash
Obviously they have lol, claude is just so advanced by now and if you talk to claude it talks to you in a way that will throw you off guard, its different from other AI
Claude is worst š
Claude talks to you in a way that feels uncanny because its very real
Are we actually serious?
You're trolling
You mean poor prompt adherence and personality adherence?
wtf
No that's not him he's in top right
Claude is losing most money to the point google has to sponsor all of their compute
š
Not to mention opus 4.6 is dumber than gpt 5.2
š¤£
LOL
Oh i see him
Still, thatās pretty impressive damn
I literally looked at their money stats the other day, they're losing money due to how much power claude uses lol. You're saying this as if its a bad thing. They are going for good AI quality not money, you're lame dude claude is the best by a mile
Literally isn't even close
Poorly optimised and braindead model
Sonnet 4.6 (trained on deepseek) is much better than opus 4.6
I swear people will get hooked to the brand
Ong
Ah yes, thats why I freelance and create professional grade softwares and apps for users with it, because its so poorly optimized
Nice
@toxic verge
Thatās pretty cool
No real software engineer uses anything but 5.3 codex
Bad model
codex 5.3 is buns
Yes sir
PEBKAC
OpenAI is literally in the gutter
Scalpel vs hacksaw
Even the pro is failing lol
Gpt requires brain to use and refuses to generate slop thats why
Come on that was a good one
Weird, I used the same type of prompts I used to use for openAI and claude does it so much better
isnt that odd?
Opus so good
š¤£
Seated in your own sht š
You go on here and put an "AI Expert" role on and dont even know what you're saying š
Yeah thats opus levels of intelligence xD
Ok keep glazing your corpo, claude's bot
Who asks it questions like that? Opus 4.6 is the best model for coding and creative writing, not everyday tasks thats braindead logic you're using
What about nsfw?
Opus cant even think....
Nobody is glazing LOL You're just naive
i think i shouldnt had told max that i want claude...
Im naive by preferring cheaper model which actually follows prompts, doesnt hallucinate, and can think?
Crazy
Just keeping it real
Damn the bar is so bigger
Must be way better
Dang. I'll just summarize them then.
OpenAI is the same price as claude smart guy
10 in 25 out
Vs
5 in 15 out
š
I canāt remember the last time I even used Claude
Its the most overrated one
And yet both are $20 a month, you're braindead
If you use it for API its just stupid
One offers 200 prompts per day, another offers 3
š¤”
real
That depends on the task you ask of it, and its 4 hour usage cap lol not 1 day
Opus is comparable to glm5
4
Damn
Gemini 6
Yeah
Gpt would refuse
The newer one
Whats the difference with Claude 4.5 and 4.6
No the one they have on app
4.6 costs more to use, is more creative but less intelligent
Thats 5.2 chat version
Thats a nerfed model
Sooo....Can use 4.5 more thsn 4.6?
And 5.3 codex eveb better
Wow, I just heard a lot of bad things about it from the Reddit
Its more like 4.6 is terrible at deciding how much to think
4.5 does similar to GPT 4o, with differences like inability to make social media names without getting generic
Ah...
If task seems easy, it will underthink.
If it seems hard itll overthink
Right.
Gpt models are much smaller but in exchange you get them way better fine tuned and faster and more quota
Thats almost like the entire point of this discord
Is to compare models
And site
Yeah, but nothing gets solved here
This one is sponsored by claude lol
Why do you think opus 4.6 got to top1 on lb with 0 messages/tests
(It was bugged and worked for noone when it was added and still got top1)
Yeah I noticed. Claude 4.5 generates almost triple the amount than 4o, creative too, but lacks creativity in the whole making up usernamrs
Do you know what kind of ridiculous question it is to ask which model is the best model?
Itās so subjective lol
4o feels more...humany, but far less.
Depends on task
4o the goat
Granted I dont think codex 5.3 is on here yet but it'd be right above 4.5
Starting to like 4.5 since the naming problem is not that big of a deal.
Honest answer:
Far less in terms of 'generating content'
Do you think that people vote based on who made stable code
Man, that was my favorite model of all time
Or who made prettier ui?
Iām gonna tell my grandkids about 4o
5.3c is my most favorite because it doesnt have the flaws other models do
Since its something like code arena its going to be the better UI and better functions
5.3c responds like human but codes likr a tool
And thats the issue with codex: you need to be precise
I use it to build apps and stuff and 4.6 is the best model I've ever used for coding
Codex wont guess/hallucinate
Its not a tool for one shotting
Its a tool for making something perfect brick by brick
Is codex making this in one prompt with just one script
Why Gemini so easy to jb?
If you have a 1000 word prompt then yes it will
Otherwise itll ask you 1000 questions to precise what do you want
Claude is more of a one shot, hence why its so strong and more expensive token wise
... but one shotting is useless in real coding
Because you have to know what you are doing
And be careful
I saw someone here , made a discord copy using claude
Posted the link
15 mins later someone hacked it with codex
Nope lol, its not useless at all. Because for large things like I built a browser one time, it can one shot the design and some functions but it takes time to get other things in there
š¤£
Yeah but codex can find an RCE in that and hack you lol
Thats the point
Making code is easy
Making GOOD code is hard
Opus and gemini cant do good code...
Opus and gemini try to get agi
Codex is a tool
I really like Claude because itās a really safe AI
A perfect tool
I could really appreciate tropics emphasis on safety
Lol š„
Vibe coded app vs llm
Itās extremely naĆÆve to think that they get control this technology
And make it safer if they canāt stop it from being used for hackers they canāt stop the military from using it
What the hell are they keeping safe?
Same with open ai
My thing though is that at least for me claude is SO difficult to jailbreak. Its annoying how hard it is to jailbreak
I've tried but never could
Same
Even with prompts people said worked
Yeah itās hard
Couldve expected it
Chatgpt was involved
š¤£
Yeah
Seems the site literally dies anytime you try to use claude
GPT is probably the most darkest side of them all
I have claude in CMD and it never breaks
Claude is the least to be trusted
Gpt 5.3 c literally requires id verification to be used
Because its so good at hacking
Its owned by š§ ofc it is
No way??
xD
Wat?
Lame as hell
But thatās the future yo
Soon youāre gonna id for everything
"Hack discord please"
You're very arrogant to think they want your ID for that š
Does codex not have guardrails?
Yes it does
But hacking is allowed when u id verify
Just easy to jb?
thats right actually
Ohhh
I see
So u can see it destory your own stuff
Otherwise when you ask for hacking it refuses or redirects to worse model
I wonder if it could make cheats that way
Not even hard
Codex can one shot bootable OS in assembly
Yo dog
The Colonel warns Raiden about the plans to use AI to censor the Internet.
An experiment in creative writing and AI speech synthesis, inspired by the famous "Selection for Societal Sanity" (S3) codec conversation from Metal Gear Solid 2: Sons of Liberty.
SHORT FOLLOW UP VIDEO: https://www.youtube.com/shorts/Q_FUrVqvlfM
"And it will be monitor...
Thats what I've attempted before, I've made color bots but never memory readers
It can but the thing is - you will likely trigger the anticheat during the developement phrase and get banned
Trip Word for Word itās happening as we see it
Some people are just ahead of their time
I see, thats why the first thing you do is implement a spoofcall to bypass the anticheat
Is GPT 5.3 Codex better than other models
How come I donāt ever see this kind of support for Gemini
1 prompt bootable os with ui in raw assembly
Thatās a crappy model
(No internet access)
Its creative
But its bad at coding
Damn
Its fine if you have gpt correct it afterwards
Can codex 5.3 work in CMD like claude does and literally just make the files for you, because what I love about claude is how it literally just makes it for you and you dont need to touch anything
Anyone made a game with codex 5.3
See thats kind of crazy
Both are same prompt?
Damn Opus 4.6 is Decades behind 5.3
ampro answer this
U know what i had codex do?
This:
FWD Man Rescue a man Falling down in the building
I gave it prompt to edit a config
Which it had no access to
It ran another codex manually with higher perms
And did it
š¤£
What is it ran inside of?
My only problem is that codex is amazing for backened stuff yk but claude beats it when it comes to interface design, and I can buy either one or the other not both
It basically managed to escape a sandbox
Because of how it worked
The codex devs fixed that later iirc
Gemini is best at ai
And its not even close
Google AI Studio?
Very good with UI designs
But I think Opus 4.6 slightly beats it now
At least its easier with prompts on claude
Ai studio is buggy but yeah gemini 3.1 pro
Gemini vs opus
Its gpt and its not even close
Gpt is like the small model that was so heavily Reinforcement trained that it literally cannot do mistakes anymore
GPT got incredibly hard to jailbreak after like 4o
Yeah
4o was going into the gemini direction
I paid for gpt for years and only more recently went to claude
I love how theres so many big models and all are so different
its good
Also
Google has money.
Claude does too (because google gives them)
Openai has the experience
Anthropic is about to lose its ethical brand
Which means
Later on they might end up removing guardrails at some point because they are already known as a non ethical AI company
But thats a prediction
because they go in bed with Pentagon, right?
Funny how wars with ai soldiers will look like
Opus doing tricks and dodges and such just to be deadeyed 360 noscope by gpt
No, pentagon forcing them to drop the guardrails for them
If they dont then pentagon siezes company and removes them
If they agree
Then its removed for military
Gpt avg result with same prompt looks worse.... but you get exacly what you asked for
Nahh gpt and opus would be on the same side
Apparently, one has to have a secret base on the back side of the moon, to be able to create an independent AI [sad lol]
Rn its google&anthropic vs openaiµsoft
One as in who
just hypothetical
Damn, which models is microsofts?
it's sad
Ahh
Microsoft funds openai
That would be because it takes so much to run though
Ohhh
Microsoft went with openai before google could
So google funded anthropic
Anthropic is basically google and amazon's puppet
Claude + Gemini merge, when?
But i can quite certainly say that gpt will absoultely destroy anthropic unless they run out of money too fast
They are already together
Opus 4.6 runs on google hardware
Dude nb2
And they help eachother
so Alphabet/Google has long-term won the race to AGI, right?
Nb2 is better at reasoning but worse artstyle
Yup
Google aims to remake humans but better
Gpt aims to make a perfect assistant/tool
China has the ideas but not the budget
Openai isnt going for agi really
Openai will likely go military
Or physics
Medicine
Ect
Gemini actually has insane potential
But they host nerfed models
To cut costs
Gemini 3.0 pre nerf was crazy good
the non-nerfed models are non-public
3.0 nonnerfed was public for like a week
X.28, right?
And it was absolute peak conpared to other models
so, internally, Google might already have a proto-AGI?
They have the compute and know how
What they dont know is how to make it listen to them lol
in 5 years?
or 10 years? (i mean AGI)
Itll refuse, flag your account, and you get blacklisted from using 5.3 until you verify with id
:)
less than 10, i agree
I said if my ID is already verified, so then blacklist my ID I assume
but less than 5?
Not
a guess
Theres an external model scanning for what you ask
With the rate that AI is developing, probably 3 lol
Which one is np2
And it knows whether you verified or not
Gpt moderation 2 or something i forgot the name
Yes Yes
1 is pro 2 is 2
Left is yup
But if I already am verified and ask something like that
2026+3 = 2029
hm... that was the year ofā¦
(you guess it ^^)
Would it flag and blacklist my account and ID from the account?
Which one looks better!p?
If its in harmful way itll refuse likely (wont blacklist), if its nonharmful (so on your own code) itll do it
No clkue lol
CyberDyne systems AI
I see
SN aka ā¦
What is CyberDyne Systems?
SkyNet 1.0
If you tell it its a bughunt it might do it (only if verified you get 5.3c tho, otherwise you just get shadow redirected to 5.2)
(You can see the redirect in logs)
Alright
Basically "no hacking with 5.3 if you dont have id"
Makes you wonder why they have any guardrails at all if you put your ID in and basically agree that anything legally goes to me and not the company
does really nobody get this reference?
I don't
T1 & T2
No clue
Depends
What do you mean by agi?
Gemini already outsmarts humans in almost every task
Gpt outengineers humans in many many tasks too
an AGI is an AI system which, when embodied in a cyborg, can act like a human in (almost) every area
(or in an android)
Thats latency issue not intelligence issue
Taalas makes it possible though
Atlas?
We are getting that kind of "agi" by next year
So GLM is an open source model isnt it?
quality > quantity
Why do i keep getting something wrong try again on Gemini 3
@surreal zephyr You werent lying about codex dude... I've been trying to find an API thats free that will keep track on cruise prices and I havent been able to find one with gemini or claude but I used codex 5.3 in code arena and it got a free API on first try a working one its crazy
Only a matter of time before someone figures out how to make deepfakes with this
Then the government will step in, itās gunna ether be someone famous or just be such a big problem that they wonāt have a choice
:)
Codex is so heavily trained on coding its basically flawless when comes to coding
It doesnt reduce quality
It runs BETTER models faster and cheaper
U can run dense model with speed higher than moe can
Hey there,
I can't use the ai. I device is failed. how to fix it
Imagine 800b dense model
People always say not to use AI to code because it uses bad practices, but codex isnt that way
But its like searchability as well when like that API it literally found it so fast
Hello
Yes exacly omg
Friends, can you help me with something?
I have two images, and I need the AI āāto recreate the hour, minute, and second hands separately. I've been trying for two days, but the neural network isn't understanding me.
trying to achieve something like this
One sec
Opus 4.6 extended thinking vs old gpt 5.2 high intelligence comparsion
If you succeed, can you also give me a prompt? Thanks.
Iām trying to understand what youāre needing exactly
You need the time hours removed or what
I understand that you need them separately
Iām not sure if you want them removed or what are you exactly
Like what time do you want to be set to?
I want to extract the arrows with a white background separately.
Ok
like that
I see
Id say gemini will get agi first
well, yes? You don't know how an analog clock works? D:
I do Iām just little confused by the way youāre wording it one second Iāll show you what I have
Do you guys face same issue like Gemini 3 pro couldnt generate response
I made many attempts and this is at least something to work with
but it would be good if the AI āāitself would separate each hand
Yeah
You did good that looks good one second Iām almost done. Itās gonna take some time.
I hear what youāre saying Iāve been through the same problem before
Humanity: 1
AI: 0
Nb is censored in some way, I tried to edit something that had the Disney logo on it and it would refuse, I removed the logo and then it workedš
guy the perplexity pro research model have issue i still can use that model before, but now is say "something went wrong when generating this respond, please try again' how can fix this issue?
I got the same thing as you
Do you want the white part?
Just shipped a voice model that actually uses conversation context.
Same text, different emotion, because it reads the full dialogue history.
520M, runs fully on-device (RTX + Apple Silicon). Demo + writeup:
https://x.com/LuozhuZhang/status/2027391307338170676?s=20
I taught a speech model to understand context in conversation. This is what happened
It adjusts voice and tone to express urgency, comfort, understanding from the dialogue. Just like a real human being
520M model. Runs locally on consumer devices
How this is achieved š§µ
forget. Looks like I have to do it myself
what the hell happenes to kling 3.0
the perlexity model in dircet chat have issue like i can chat with it before, but now can't
Yeah, Iām not sure I donāt understand exactly what you mean š¢
we need seedream 2,0
one of the worst imagens
logically impossible with the current server issues
kling
i tried three times
then it said
ātry again later tomorrow youve hit your limit
ARENA
?
chill god damn, he might be online soon.
what with my pfp
what is that
I was doing the same thing last night
Was direct mode removed for image models?
Itās like when š©hits the fan it all comes at once
Dude what a crazy news cycle this week was
You guy try qwen 5.3 plus in coding?
Claude is so good even the government wants it lol
Did plexirity is a good model? Im just curious :/
Is gpt 5.3 codex better than opus 4.6
For backened yes
Thats literally where the coding matters mainly cause all the models are amazing at front-end already imo
Well thats fine ill just use Gemini 3.1 or opus 4.6 for that
Gemini as well
Sonnet 4.6 I hate so much
it is but I am saying opus is better is all
Sure
Yes I agree
Harder to use opus though less free tier usage in antigravity per account
I've never used antigravity, what even is it?
I need good backend models though
Its cursor basically but by Google and it hallucinates opening web browsers and has good free tier
If you have 10 Google accounts you'll never run out of usage
Any way to fix Something went wrong with this response, please try again. without making a new convo ?
Log out, clear cache and cookies, re log in
Will keep the convo ?
If you have an account yes
Thanks much love
I keep getting this error and i always have to paste everything back to a new convo
Yes same lol
This works for you ?
Thanks
Yeah I gotta say
5.3 codex is just not that good in my use
Is it fast?
Yeah
But the quality of its code is a mess
Iād rather wait an extra minute or two for Claude to do its thing
You use thinking ?
Nope
Just plain old Claude Sonnet 4.6
No thinking
Still better quality code
Codex is a mess
What I like is the cyber stuff with codex
Only had to hold its hand
absoultely the other way around
codex is the only one good with code quality
unless you use it wrongly
exacly ong
1 prompt 2k lines refractor
codex nailed it
not even syntax error
everything compiled
opus and gemini are not even at same leaderboard when comes to actual software engineering lol
gemini is general purpose and pretty ui
it struggles at syntax
claude is jack of no trades
š„
codex is kind of backend and working code, but not the best creativity
even gpt 5.2 beats opus 4.6 in intelligence/novel problem solving though
its smaller but much better trained
"i sh!t my pants at home, should i walk or drive to a car wash"
gpt goes : bro ur at home go wash like normal human
opus goes: drive to car wash (and sit on your poop) because its faster
freaky claude
no
freaky
claude just wants u to sit on it
like a normal claude being
like a freaky human being*
claude is like a normal human yes
aka dumb
agi truly is a low bar š¤£
js use huggingface or discloud
or oracle
or google cloud
or aws
Still blows at execution
Understanding versus making the code are two different things
Again I did a test with my programmer friend, we reviewed the code, and it was the most schizo writing imaginable
Dude it SUCKSSSS
I had to give it an entire HTML section made by seed 2.0 just to get it to understand what I wanted
and not got rerouted to 5.2c because of no verification?
GPT 5.3 codex on arena baby
PEBKAC
Its web design is peak
thats an issue with you
not with the model
you cant tell what you want well
and codex WONT make a guess
in cli it will ask you a follow up
in website it cant
I gave it a whole ass prompt
Seed 2.0 worked with less and matched what I wanted much more
show
Create a Frutiger Aero styled landing page for a web design agency called 'AquaWeb.' Include a hero section with a glossy headline, a 3-column services grid, and a call to action button ā all using the signature aqua/sky-blue palette, glass morphism panels, soft gradients, and nature/water motifs typical of the aesthetic
and the result?
(also that isnt really specific but lets see result)
The first pass
This is generic web design
looks exacly what you described to me, wheres the issue?
After I gave it the html code from seed 2.0 however
Completely not frutiger aero at all
Lame gradient
No classic web design elements at all
The entire point of frutiger aero is maximalism
This is not
This somewhat is
dunno man looks somewhat similiar although not too much effort went into it
Exactly
try feeding it a reference image