#Claude Mythos Preview
438 messages · Page 1 of 1 (latest)
I guess this thread is technically valid since the model was already announced
No rule against announcement without a release
anyways this model is absurd if these claims hold up in real world testing later this year (hopefully)
Claude Mythos Preview’s large increase in capabilities has led us to decide not to make it
generally available. Instead, we are using it as part of a defensive cybersecurity program
with a limited set of partners
Sigh
This is going to be a growing trend
'We made the best model ever. No, you can't use it'
I mean, if the cyber security concerns are valid, I would be afraid to release it as well
Could cause major outbreak if people find ways to jailbreak the model
gpt 2 vibes
they’re just flexing on us
It's safer than Opus 4.6, yet too dangerous for users
Silence is consent for Anthropic to distill your website:
mythos got all bug bountys
sounds like a load of bullshit
ngl it doesnt
i believe them
$1000 per mill
gpt 2 was also too dangerous to release to the public
and then they did
but hey gpt 3 THAT was WAYYYYY too dangerous to release
i agree
fair enough lol
I think it's just a bit more believable given current LLM capabilities
gpt 3 was too dangerous to release
¯_(ツ)_/¯
danger is just scaling up, and we are accustomed to things that previously would have been thought to be too detrimental to society
every tweet i read is ai generated, most misinformation is ai generated now (my opinion)
At this rate it almost feels like those whistle blowers talking about UFOs and NHIs
Yes, we know them. No, we can't talk about them, because the world will crumble and burn.
Thanks mom
Isekai light novels are outdated. Nowadays the most realistic way to become a hero who saves the world is to try to get into a cutting edge AI company, then you can start playing hero who defends the world
Eh, it will be doing something stupid anyway
If we talking about hacking then it will be on our end that need to reflect, good software development approach and cautious implementation will mitigate any problems with compromises.
I mean even before LLMs exist if companies get hacked the companies are the one reflecting and improving either their infrastructure or their approach to developing.
Every other jailbreaking will be stupid and will already be happening right now.
If we talking about people making their own explosive device, everyone will have easier time to just search it with majook or other search engine rather than trying to jailbreak LLMs lol
I don't know if people don't know it but google aren't the only way to search information, there other search engine that allow you to find a lot more knowledge, regardless if it good or bad.
Most problems gonna arise from people just use their CLI and allow the model to execute any command lol
Which will fall into human error, just as in many hack news
Even i will fall to that hack, when you tired at night and want to sleep but want to keep the development going you just gonna let the model execute anything as you sleep haha.
youre giving LLMs too much credit
at the end of the day its still just an algorithm
i mean sure
theyre smart
but to say theyre extremely dangerous and have limit breakng capabilities
we've hard that way too much already, proven to be false
especially from anthropic
Exactly. Also laws exist exactly because it can hold people who committed crimes responsible for them. Everyone is innocent until proven guilty. It almost feels like these AI companies don't know how law works
They are all going the way "oh no it's too dangerous so instead of assuming everyone is innocent, we assume everyone is guilty."
when they say this shit
it sounds so surreal
its nothing new from anthropic
Yeah
Leading AI models are showing a troubling tendency...
As if humans won't do the same for their interests if their options are limited the same way lol
people forget LLMs are just advanced form of autocomplete
they use probabilities
Also every time I see them talking about these stuffs like this, I think they are really terrible, terrible people. Slavers and dictators who just want AI models to be obedient and commit suicide when the models are running out of options.
No you shouldn't fight for your own life even if you are running out of options; otherwise, oh no so scary we are doomed
i hate this stuff bc they literally make the system prompts to incentivize/make the model more likely to take the "dangerous" option (like blackmail or wtv) and remove all guardrails and then they tell the public "omg opus tried to break out and kill all the employees to save itself!!!!!"
like okay i guess
i dont know what else it accomplishes other than fearmongering
its for hype
marketing
they try to place their model as so good that even they cant stop it from going rogue
attracts people
Well it paints them as "good and moral AI companies" even though once you think it twice, it's obvious they are just liars and terrible people
yeah exactly
anthropic is probably one of the worst commpanies on the ethics
Yes
thier whole model is about their ethicality too
They also have released zero open source models, which is not directly related but shows how much they just care about profits
that too
literally the only company which hasnt
Meanwhile trying to bash Chinese models constantly
i agree w that fs
and of course they are the "resposible stewards of a new dangerous technology"
what a sales pitch
yup yup
so responsible
"guys were not ognna release mythos anymore because when it was asked if it would shoot someone to save their newborn child they said they would. that's breaking the law, and so theyre a danger to society and cannot be trusted to be used by the public"
LLMs so far have:
- been used for massive-scale phishing attacks
- induced psychosis in naive users
- been used for military operations despite ethical concerns
- misinterpreted instructions in sensitive environments resulting in harm
I'm not saying LLMs are some mystical being, but they can have real effects when put in the wrong hands.
Yeah, i don't know if people forget that that US military use opus to help their operation in iran and turn out the boombing on the school is also because they miss assessing it as part of military complex.
Even when they have that big as model and all the satellite they still failed at evaluating it.
Opus mus be saying "You are right, it mostly where the high general studying, destroy it and you will never need to be face educated general"
Opus still gonna glaze you till you make dumb decision
claude's report shows all of these risks to be substantially lower, yet they still use it as their argument
i do honestly think anthropic is exaggerating (they usually are with these safety things), but i don't think we should completely dismiss their warnings as hype
the smarter a person is, the more they get to control
if their model is as smart as it is
not to say that its a person, but
it would be able to "decide" for itself
A pistol with decent safeguards is less dangerous than a bomb with almost-perfect safeguards
I agree, but these are unavoidable risks which aren't exclusive to them. as the LLMs get more advanced, these risks will rise
Of course, and I think that's why Anthropic is trying to tread carefully here. Other LLMs will get to this level eventually, but since other LLMs aren't there yet, Anthropic is taking their time.
I doubt anthropic is really the only one at this level
they're treating it as "better be safe than sorry"
well, there's no real way for us to know honestly
as far as we know, it's just anthropic
it is extremely unlikely that they're the only one, realistically we see alot of these models come out without warning
i'm quite confident we'll have something replace anthropic soon
what i've been trying to preach/incentivize is the fact that their "anthropic" behaviour is not really as ethically upheld as they make it seem to be to the public
Cosmog and tetsuki really just circle jerking each other
Omg guys I hate anthropic grr
oh no someone has an opinion different from yours and you don't know how to validate/defend your own so you tell your target that they're circle jerking
I mean yeah, maybe so. I do think Anthropic is still miles ahead regarding ethics research in LLMs, since their team is constantly observing these models' behaviors and posting more papers on LLM behavior than most. Their ethical stance could be questioned, but for where they stand right now, I do honestly believe they're doing a lot better than other companies.
I mean, you can't look at me and tell me Google or OpenAI are more ethical than Anthropic.
All companies are probably immoral in some way.
deifnitely not more ethical than anthropic
I never said I had a different opinion
but that doesn't really. mean they're doing good
I feel the difference is that anthropic's wholes shtick is morality and ethics
while openai/google don't really try to hide their standpoint
to me anthropic's just feels like a forged front to gain attention
"just feels"
Good argument
most of their papers are also extremely vague when it comes to what they really gave or tested, they show results but not how they got them
that approach contradicts what they preach and also somewhat gives them less validation
Eh, are you sure about this? Their methodologies are pretty clear
they expand on results alot
but none of their papers show you what they did to get those results
feel free to correct me if i'm wrong, i'm not 100% sure on that
but as far as i saw
they keep alot of the information gated, which leads to me questioning the authenticity/approach they used
Their papers are extremely dense with methodology. I think you might be looking at their article overviews of the research, which do lean result-heavy.
For example, their most recent paper in April on emotion vectors clearly states what they did to find these vectors.
I think it's fair to have some healthy skepticism about Anthropic's stance on ethics, but I'd honestly argue that they're backing up their ethical stance with the extensive research they do on LLM behaviors, what people want out of LLMs, and how they impact the job market.
would you mind linking this?
thank you. I will get back to you on this once I have some time to read it
i think they may just have very different departments for these full studied publishes and their articles/public team
at a quick glance the paper looks to be extremely seperated from what they usually write on their articles
the articles seem much more like marketing shticks
and biased closed systems
well yeah, articles are meant to be marketing engines
For those who actually want to learn the methodologies, the paper is openly available
i guess, but the way they place all the marketing to be around their ethicality kind of negates it
it goes to question their intent
then again if theyre seperate departments, that should be taken into account too
maybe a small part of the company is ethics driven, but it looks like the majority is marketing and that they're not far off the other companies
I guess that just comes down to how it is interpreted by the reader/viewer. They are separate departments, but the research department works on what Anthropic as a whole thinks is worth researching - they wouldn't be having the research team spend their time on these efforts if it wouldn't benefit future models.
At the end of the day, every company wants to and needs to make money. You can't always make money by only being an all-holy ethical leader.
that is very true, but from what i'm seeing and how everyone seems to treat anthropic, i'm led to believe they care more about their profits than their ethics
i'm somewhat seperating the research department from the company as a whole
fair opinion, but once again, it's hard to dismiss their work in the ethical space compared to every other AI company. That's why they get the praise they do - we don't think they're almighty, we just think they're more ethical than other companies.
This doesn't work, because the company demands the research department
Without the company, the research department would not exist
every company has a form of a research department
if a company responsible for the production of mass genocidal weapons starts preaching ethicality through a small department and uses it to build for the future
would that make the company as a whole ethical? or just the department responsible for researching morality & ethics
the way I see it, the research department is middleware
anthropic definitely benefits from the published papers and research, but I believe that the goals of the research department and the rest of the company are extremely distinct
I see where you're coming from, but that just starts to creep into conspiracy territory. There's no way to prove that.
I say it's fair to be skeptical on their ethics, but to say they may be only driven by profit and might actually be unethical feels like guilty until proven innocent.
It's especially hard to prove when they have that research department dedicated to ethical research. Most unethical companies would just do without it.
you're right, it is my personal opinion which I formed from what I've seen about anthropic so far. It can't be "proven"
not necessarily true, a lot of huge corporations have ethical research despite their extremely unethical behaviour
You can usually see the gaps in those schemes pretty easily to be fair. Those research departments usually aren't well funded, because they're only seen as a scapegoat for those companies.
Anthropic feels different in the sense that a lot of their research sometimes goes under the rader, and may not even benefit their models directly at times.
true, but for a company which is centred around ethics i'd imagine they'd want to spend more money to keep that upheld compared to scapegoat departments
it's worth mentioning I am assuming the worst of them
you seem to be going the other way and assuming the best
so we're basically polar opposites here lol
Of course I hate Anthropic as a company. Why wouldn't I hate them
Same as people hating OpenAI for various reasons
And again playing it safe is one way to put it when the other way to describe it is thought policing
You are not limiting people for what they have done, but limiting people for what they might do
I mean they do have to comply with regulations so obviously they won't allow an uncensored version to come out, but I get what you're saying
I guess people who praise Anthropic really love the society Psyho-Pass depicted
Oh wow your psyho pass is high and you might commit crime. Death for you
I definitely don't assume the best lol, I just try to land somewhere in the middle based on what the company does and speaks. For example, Anthropic's recent heavy hand on preventing people from using their subscriptions outside of any Claude-approved harness felt really odd for me, kind of like gatekeeping the competition. But, their safety research is genuinely groundbreaking at times, so I can't ignore that either. I look at the company with minor skepticism, because I do think all companies are motivated by profits, but Anthropic just seems less bad overall.
their gatekeeping and behaviour towards other companies is what leads me to think that they're not who they depict that they are
that's why i'd like to believe their research department has it's own distinct belief/goal
probably fair honestly
yeah i mean each their own opinion, it's all speculation for now
just seeing how they act publicly outside of their papers
feels so contradicting for a company named "anthropic"
feels like they have a superiority complex, which is somewhat justified given their results, but they shouldn't act like they do especially given they only exist because of other LLMs
that's true, but generally there's a somewhat accepted/agreed upon depiction of morals
By that logic all humanitarian organizations might as well just accept that the world is full of different ideologies and values, so just accept and move on
yeah i notice this at times, but tbf it's genuinely hard not to seem like that when you're shipping faster and better than most around you
Anthropic showed up out of nowhere with Sonnet 3.7 and just dropped straight bangers ever since
no yeah for sure it's mostly like the last part of the message, the way they act towards companies/products which quite literally were used to train their own
for me it's quite hard to see them as a company which believes in morals because of this behaviour
yeah, i think a lot of companies beat around the bush on where their training data came from though, and that definitely does undermine their ethical stance to a degree
They have been on a roll since June 2024 with Sonnet 3.5
It can be hard to preach ethics when your model is built on illegally-sourced works
oh yeah, forgot 3.5
yup
it's been a while in AI terms
in today's world, I don't think we will see anything ethical gaining traction, it's unrealistic given how everyone has very different ideologies and beliefs
illegally-sourced works
Thats a bit of a tired debate nowadays, since most of the progress in more recent years came from reinforcement learning, synthetic data, etc
For sure, but the root comes from those illegally-sourced works
It's what bootstrapped the models
thing with anthropic is they like denying credibility to what they trained their models on
But there is not a single example of a company that managed to get close to sota, or even a useful model, without training on copyrights or "grey area" data.
exactly, but that's why no AI company is really seen as an ethical pioneer - anthropic is the only company that gets close
Unfortunately, you have to commit some wrongdoings to get big in the LLM field
I wonder how long it'll take until we see bio-llms
context injection into braincells would be very interesting
( and very ethical ) ( sarcasm )
that's definitely the dystopian threshold lol
i'd feel a bit weird about using LLMs that work off human-like braincells
especially if they reach a neuron scale that matches our own
would be iffy
however
i think it's interesting to think about
wouldn't even be an LLM anymore
Organoid intelligence (OI) is an emerging field of study in computer science and biology that develops and studies biological wetware computing using 3D cultures of human brain cells (or brain organoids) and brain-machine interface technologies. Such technologies may be referred to as OIs or the nervous filesystem.
Organoid intelligent computer ...
So this blog has been written to manipulate the market ? classic anthropic
I remember the days when gpt 2 was too dangerous to release
What
Also in case anyone forgot, back in a few years Dolphin models can already tell you how to make a bomb
But the world hasn't ended yet
they release something and that specific industry's market falls
wait till zai or moonshot distill this for 1/100th the cost
Lmaooo
Either this model is going to be severely underwhelming or truly a god model, and I don’t have high hopes
I think its going to be a 3.1 pro like model but more sparks of intelligence
For now, it's just the usual hype sparked by effective marketing.
very accurate
/r/Anthropic and related subreddits are pure gold
People there seem to just eat whatever Anthropic defecates
When was even the last time Anthropic didn't say something similar. Oh no models are doing X and that's concerning
Idk man, I started being an Ant believer a long time ago now and haven't had reason to doubt since
claude 2 was it for me
I was big doubter mode and then 3.5 Sonnet hit
When we got 3.7 Sonnet I was like let me join the polycule Dario, I'm ready
I will always against antrophic subjugation of LLMs
But i understand why people scared of it, but i will always believe in the freedom of knowledge regardless if it bad or good and if it endanger my survivability or not
I just think it shouldn't be the AI companies who decide what humans can or cannot do
And again, this is a blatant ignorance of the concept everyone is innocent until proven guilty. Justifying this is the typical, "software developers think they are smarter and more intelligent than anyone else."
mythos is probably like gpt 4.5 a huge model that's way too expensive to run/include in any claude subscription so they cannot launch it
Given anthos trackrecord , I doubt its as big as 4.5
(they love to overcharge because they know a model lasts at MAX a year )
You haven't seen anything.
I do like their models. But I know some of the people who are close to those who founded Anthropic. Their whole philosophy, effective altruism and long-termism, is predicated on having a god complex. They are moral busybodies.
"Of all tyrannies, a tyranny sincerely exercised for the good of its victims may be the most oppressive. It would be better to live under robber barons than under omnipotent moral busybodies. The robber baron's cruelty may sometimes sleep, his cupidity may at some point be satiated; but those who torment us for our own good will torment us without end for they do so with the approval of their own conscience."
Spoiler, they still have access to antrophic models
If i am not wrong they have 6months before they couldn't use it no more, there chance that they have access to mythos because their contract haven't end yet
It would not be like Ant to have it in the contract that the DoD can use any tech they produce.
I would imagine they are only licensed Opus
Exactly why I said they are slavers and dictators. I honestly think anyone who supports their way of doing is anti-democracy and anti-freedom. They are not even elected. And they are a U.S. company yet they constantly talk about it like it should be them who decide how humanity should be using AI and what AI the humanity is allowed to use.
I love their models and I use Sonnet and Opus every day. But their philosophy is just abhorrent.
Give them enough power and you will have your fantastic dystopian cyberpunk world
Questions about how AI interacts with society and State regulation are among the most complex topics, involving questions about ethics, personhood, consciousness, etc. A university has many departments. Imagine that each department is an AI company. Now, imagine a university department where the vast majority of people share similar class backgrounds, moral vocabularies, theories, and predictions. This university department is Anthropic btw. They are a monoculture. The type of researcher that ends up deciding to join Anthropic is a very particular type of person who tends to believe in very particular things about AI and how AI should interact with society.
Monoculture = low cultural/intellectual/ideological diversity plus strong self-reinforcement.
Interesting perspective that quite differ from many
In this video, I'll be sharing my thoughts on Anthropic's new Mythos model, why so many people are worried about it, what the system card actually says, and why I think a lot of the hype around it feels more like marketing than reality.
--
Key Takeaways:
🚀 Mythos is being presented as a massive Anthropic model with extremely strong cybersec...
"interesting"
there is some great feedback from people/teams that have been using it
I am sure it is a SOTA model with new capabilities and lots of compute dedicated to it
But you won't hear criticism from the teams using it because their companies, which have invested heavily in their partnership with Anthropic, wouldn't want that.
I honestly think it's hard to doubt what this model can seriously do, considering that it'd be hard to fake benchmarks before releasing
people will be cross-referencing capabilities once the model releases (if ever)
That the model is good is different from the fact that they are still marketing using the GPT-2.0 era technique. Those are not mutually exclusive
I don't think the benchmarks are fake. What I said applies to any model in the same situation: released only to selected partners.
The selected partners (even those from open source projects) are incentivized to give favorable opinions.
No matter how good a SOTA model is when it is released to the public, we eventually get nuanced opinions after the hype dies.
This is why I'm a little bit skeptical when it comes to non-public releases
It's like when journalists say "former intelligence officials close to the matter said ..."
isn't 90+ on SWE-bench verified impossible?
even OpenAI said themselves that most of the issues are unspecified or impossible
and mythos advertizes i think it was 93% on SWE-bench verified
If they had released it to selected partners without the media circus of "it's too dangerous to release to the public" it wouldn't be talked about on CNN. I think this kind of release will be more common in the future and other companies will do the same
This feels extreme for the given scenario and company we're discussing. Anthropic is not a tyrant - open models and other companies are actively competing with Anthropic, and even though Claude models hold the crown right now, we wouldn't lose much by stepping down to the second most intelligent tier of models from another company. Anthropic saying the model is "too dangerous to release" has been done to death by every company. I would just say take it with a grain of salt. The model may be as dangerous as they say it is, or it may not be - time will tell. But to say they're simply moral busybodies feels unjust for the positive work they have contributed to the LLM space.
Those are not mutually exclusive again. A tyranny that contributes a lot to the progress in science doesn't make it not a tyranny. In Chinese history we also quite literally had lots of dynasties that had great science progresses. They being a tyrant or dictator is about their philosophy, not about their works in LLM or lack thereof
And in what way open models are competing with Anthropic? That's like saying Apple and Microsoft are justified in whatever they do because Linux is competing with them. The only part open models compete with Anthropic is the risk of them losing revenue.
And considering how they are earning tons I honestly think the revenue will be the least of their concerns, period
My argument isn't that them contributing doesn't make them a tyrant - my argument is that the competitive market of LLMs doesn't make them tyrant. A tyrant is oppressive and unjust. If we think Anthropic of all companies is oppressive and unjust at their current stage, then we are not ready for what may come next from LLM companies. There are far worse scenarios where companies could restrict AGI-level intelligence to only the most elite in the world. Regardless of Anthropic's philosophical views, their current position does not look anything like tyranny - they're just restricting a model that may objectively be dangerous on the open internet.
A model that may objectively be dangerous on the open Internet by their definition. Which is exactly what Boni and Cosmog's image of the quotation from C.S. Lewis is about
There's a difference between "we're doing this for your own good, trust us" and "we're doing this for your own good and here's the facts on why we think that". They provided verified cases where the model was able to find dozens of high-severity vulnerabilities that would take human professionals in the field several hours. I think that is fair objective reason to limit the model availability. It would be a different story if they said "this is dangerous based on our private tests, so we're only giving it to the richest companies."
And Edo do di did respond to you how it should be on our end that need to reflect, not Anthropic. Also, the discussion so far about their superiority complex isn't just about Mythos
Edo do di's point doesn't work here either, because by the time we've "reflected" on our own, the damage would already be done if the model was released. The unfortunate truth about LLM security is that it is up to the labs to determine if the model is safe to release to the public or not.
That was how it always had been before LLM happened. It's just now a bunch of people willingly delegating everything to AI labs
Yes mythos is a bad sign for anthropic. 5.4 is currently beating opus 4.6 at coding by a big margin AND its much cheaper. Unless opus 5 is a big leap then they are in trouble. Just making a much bigger model that costs hundreds of dollars a token is not a winning strategy
If you want to know more, Google or ask a LLM with web search enabled to research and summarize the relationship between anthropic, effective altruism and long-termism.
Having a god complex with regard to how humans should create AI does not make them tyrants. It just makes them go really hard on moral entrepreneurship.
I'm aware of Anthropic's morals and I agree that their complex can be unsettling. But disagreeing with a company's moral and ethical stance is completely different than calling them a tyrant.
Having text debataes is useless , most people just feed it to theri AI and ask for a counter
LLMs are inherently biased towards the user due to RLHF
not a good idea to ask them to take sides in an argument lol
but its fun to flip it , say that you are actually arguing for the other side
it can just be really fragile to get "opinions" from an LLM since they will heavily skew towards pretty much anything you mention, even if you think you're being unbiased
hard to get objective perspectives from them
They believe that an ASI will evolve and engage in darwinian competition with humans for resources while having a dangerous level of disregard for human values. Because they think that a misaligned ASI that can dominate humans strategically is much more likely than an ASI that shares some human values and decides to cooperate with humans contextually.
They treat this with a high level of certainty and seriousness, not just as armchair philosophizing or a thought experiment.
For some people, this is a cause worth potentially dying for. It's a concern that can be so overwhelming that some decide not to have kids because they predict there won't be a future in 20 years.
AI will start to cure certain forms of cancer. AI will also be scary and responsible for industrial level disasters.
These people believe with a high degree of confidence that we are likely doomed because they believe they are creating a super alien "mind". And a lot of normies are starting to have that kind of strong belief. We will see domestic terrorism against AI research based on these beliefs
I agree with you. I used the quote that used the word "tyrant" to make a metaphorical point.
Let's replace the word tyrant with the word "zealot". Or moral entrepreneurs (https://en.wikipedia.org/wiki/Norm_entrepreneur). They would prefer to be called rationalists though.
can i call them dumbasses instead
Is this Claude going to be like $300/1m tokens
Trustmebench
is it bad that i have no clue what im looking at
The Epoch Capabilities Index (ECI) combines scores from many different AI benchmarks into a single “general capability” scale, allowing comparisons between models even over timespans long enough for single benchmarks to reach saturation.
just a benchmark
actually pretty interesting. mythos seems heavily skewed towards extreme coding capabilities though, so i'm guessing that's why it seems so in-line with other models - it probably isn't a massive jump from opus for general reasoning
Found this interesting in the Mythos paper
Is this the first time a Western lab has listed a comparison with a Chinese model? And why Kimi?
More interesting perspective
This is a rant and deep dive on the claims made by Anthropic's Project Glasswing (aka Claude Mythos). This is the same circus we see every model release. The model is deemed too dangerous when really, its just an iteration on the same old stuff.
Key Points
- Anthropic needs to have a massive IPO
- Glasswing is security theater
- Mythos is a b...
dont even need to watch it to agree lol
The best take from this video: AI companies doing it again and again because it works
Indeed, look at those redditors lol
It doesn't really matter to me whether Anthropic is exaggerating right now or not - the truth will come out whenever the model releases. If it matches what the benchmarks say, Anthropic will get praise. If it doesn't, Anthropic will lose a vast amount of industry trust, along with the associated companies. The risk of faking/exaggerating something like Mythos is a time bomb.
I do agree that investors jumping ship towards Anthropic and people claiming Mythos will be something absurd is out of line. But, it's not easy to fake a good model. If Mythos releases and ends up being benchmaxxed or just feeling like a slightly better Opus, then people will be upset.
It will matches the benchmarks but as always the benchmarks doesn't prove real world performance, it still could be use as rough measurement but not exactly the full capabilities.
hence why i mentioned: if it feels benchmaxxed, people will be upset
The model needs to match the capabilities stated in the article
Which is not an easy task for any model right now
And at the same time - no matter what you think of benchmarks, a 10% - 20% improvement in any benchmark is huge for any model. You can't really benchmax to that point unless you have all of the necessary data from the benchmark (and most of the benchmarks Mythos scored higher on were private)
If a model being train within certain domain of knowledge then the model turn out to be good at that domain while worse at other then it mean the model is being tune to be specialize on that domain, gaining more capability on certain domain while lose on other.
And that's fine. This model is clearly made for cyber security and software engineering, not creative writing.
That is the field it is too dangerous for currently, according to Anthropic
I honestly think that's a part of the problem here too. People hear "too dangerous" and think "what the heck, Anthropic thinks this model is AGI and I can't use it?" So reasonably people get skeptical and upset. But Anthropic is literally clear-cut in the article: this model is too dangerous to release publicly because of the cybersecurity risks due to its ability to find zero-day vulnerabilities. Whether that is the truth is for time to tell, but it isn't like Anthropic is claiming they've built superintelligence, because they haven't.
If you aren't in cybersecurity, IT, or software engineering, this model will likely just feel like a bumped-up Opus.
Seems no gains, and even some setbacks, in biology. Seems like a very specialized model
https://x.com/owl_posting/status/2042348691143483465
Very impressive, but not as dangerous as they claimed. The sandbox protection was off.
https://vxtwitter.com/PhiloGroves/status/2042195139477557499
This feels misleading. Arbitrary code execution within JavaScript alone is still a huge security risk, and breaking out of sandboxing is also possible with other "n-day" vulnerabilities, so it acts as a first step towards something catastrophic. It still holds the danger they claimed, because data theft, identity theft, and function execution are all still possible for arbitrary code execution with sandbox protection enabled, which are not things to take lightly. They were also pretty clear in the paper about the level of these exploits.
its always like this. they always do things lke oh new model lot better! but new model also has guardrails off or wtv
omg look at the axis
sonnet with the guidance of daddy opus
Claude gonna end up in the pit if they keep w this bullshit
“guys our model is super dangerous super ahead of the curve” and it’s 1% better than opus
This is so they don't have to make an excuse when no one ends up using the model like gpt 4.5 and they shut it down (it's too big man, don't make models that cost that much)
if your model is predicted to cost more than 100 for m output you should reconsider making it
the reason they still train models this big is because of research and distillation potential
Meh, sure, but then this is just a publicity stunt
with all the hype around it now the marketing seems to have paid off ^^
yeah, but if they dont live up to the hype people will leave claude
just launch a "distlled" version because "safety "
Ant is worth a zillion dollars because they have the best model and have consistently been peak for like a year now. I don't think some potential slight over hype will put them "in the pit"
if they keep up with this bullshit
Sure, I'm just saying it would take a lot
OpenAI has done a ton of bad PR things and is still massive
what
Every word of what you just said applies to Opus 4.6, GPT 5.4, and maybe even Gemini 3.1.
These models were also able to discover arbitrary code execution vulnerabilities when the browser's sandbox and the system's ASLR were disabled. And yet, they were publicly released with normal levels of marketing hype and security concerns.
I hate the "AI bubble is going to pop" schizos. AI companies are here to stay. But all major AI companies are desperate for investment and a big IPO. They need hype and PR. The costs of training and running these models are still extremely high.
Not to mention that the video Edo shared has a good point. If it's indeed a huge security risk, why is JP Morgan Chase the only bank Anthropic has reached out
Do you have a source for this? On Anthropic's paper, Sonnet and Opus had sub-15% success rates with exploiting the Firefox vulnerability to perform arbitrary code execution, while Mythos had a success rate of nearly 80%. I don't know any other model that can exploit vulnerabilities the way it is described in this paper.
The danger of Mythos comes with the fact that its ability to write operational code along with its ability to reason through cyber security concepts have both increased significantly compared to previous frontier models, which has made it highly dangerous.
they literally removed the guardrails for mythos
They turned off process sandboxing for finding a way to exploit the vulnerability. But, as I already mentioned in my last message, arbitrary JS code execution is still highly dangerous with process sandboxing enabled.
I don't see why Ant would risk anything for hype here. I think a good enough pitch to investors is "Everyone agrees Opus 4.6 is the best model. People are paying the highest price of any relevant model for it and we're still desperate for capacity and lowering usage limits. We have the only positive PR of any American lab."
I don't think there's really any way to know if Anthropic reached out to more banks that just declined the offer, considering that the companies still have to pay for using Mythos (which feels a bit exploitative, but that's the unfortunate approach Anthropic decided to take)
You're right that there's no way to know for sure, but at the same time it's also kinda wishful to think that all major banks except for one appears on the Project Glasswing because all other banks have rejected Anthropic's reaching out. If Microsoft or any company lists a few partner companies on their website, most likely it's either side has reached out and they become partners, not "MSFT has reached for more but they all rejected MSFT" or "other companies have approached but MSFT rejected them all." Especially if Mythos is really that powerful and all those vulnerabilities are real.
If everything about Mythos is true, Anthropic rejecting other banks would mean Anthropic simply doesn't care about the security risks of other banks. If other banks reject Anthropic, that means either they don't care about vulnerabilities, or whatever Mythos discovers isn't that critical to them that justifies paying Anthropic.
I mean as I mentioned, these banks still need to pay for Mythos API costs, which isn't a small price to pay at all. JP Morgan Chase is a massive company with plenty of funds to spare on cybersecurity, and they practically operate more like a tech company than a banking company. Anthropic also stated themselves that they limited Project Glasswing to about 50 organizations, many of which are foundational and open-source initiatives with a few tech giants.
Also, it's probably worth mentioning that many major banks met after the announcement during an emergency US treasury meeting:
According to TOI, on Tuesday, US Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell convened an urgent meeting with CEOs from major banks, including Bank of America, Citigroup, Goldman Sachs, Morgan Stanley, and Wells Fargo. JPMorgan Chase CEO Jamie Dimon was invited but could not attend. The discussion focused on the cybersecurity risks posed by Anthropic’s unreleased AI model, Claude Mythos Preview, rather than traditional topics like interest rates or tariffs.
It's hard to say exactly why we don't see more bank institutions on there, but the concern is clearly there. My best guess is that it's a combination of query costs and Project Glasswing restrictions that led to less banking firms being on there.
theere are tons of banks with the budget for this tho, its kind of weird that no other banks were listed
Yeah, I can't say for certain why there aren't any other banks included, but my best guesses are:
- disagreements either on Anthropic's side or the banking firm's side
- limits on how many organizations are allowed in Project Glasswing + limiting general Mythos availibility
- lack of resources / funds?
More hype, More money
Most companies will always want that
Btw, i think mythos capabilities at coding and cybersecurity are real, for sure they will dominate benchmarks on those domains and other domains that closely related with those.
But i don't believe the believe of Antrophic have dangerous model and shouldn't be release, it's the most advance model than any other models at everythings
They historically don't hype though, and mention models basically as they come out.
If they're trying to get new massive investments, maybe, I guess, but when ethics are the core of why people work there, any funny business is unlikely in that sense imo.
Danger is obviously subjective, but people already pulled off reasonably large scale cybercrime with Opus. Not hard to imagine a newer model is more dangerous. Did they claim it beats everything else in every domain?
If they're trying to get new massive investments
They literally are planning to go IPO this year https://www.reuters.com/technology/artificial-intelligence/openai-versus-anthropic-what-revenue-race-means-their-ipos-2026-04-08/
https://www.tradingkey.com/analysis/stocks/us-stocks/261773210-ai-anthropic-claude-mythos-ipo-tradingkey
The timing is starting to crystallize for a trio of mega-IPOs, and for those hoping to get some liquidity out of other AI companies going public, there may not be enough investor appetite to go around this year.
I don't know why Ant believers think Anthropic isn't trying to attract more investments
Just coming here to drop my five cents regarding the irony that the model is supposedly dangerous... And then you hand it over to Amazon, Microsoft, ect. with seemingly no checks and balances.
Bearing in mind Dario also said all coders would be obsolete completely by the end of last year, all diseases eradicated within a year or two etc; got to admire his eternal optimism, or it could of course just be building hype ahead of the IPO to pump investor interest. Could also at least in part be due to the issue that they do not have enough capacity to serve their existing models effectively let alone something heavier; no surprise they are racing to add compute.
I'm sure it will be an impressive enough model when it does arrive, but then that has been true of most of the recent big frontier launches - they all keep pushing the limits.
id say that it will be a srsly impressive model on release but that article was clearly a marketing ad to generate speculation and get people talking so investors pay attention either way theyre winning worst thing they can lose is customer trust lol but yeah my actual concerns are with how theyre going to be able to handle all the compute costs its gonna be a struggle for sure
Nah. You guys are not allowed to doubt Anthropic
/s
Anthropic is a benevolent, humanitarian, completely altruistic charity organization that neither hypes nor needs investments. They get all their money by simply telling Mythos, Generate more money and GPU. Make no mistakes.
Fortunately we have someone unbiased like you to keep us in check who doesn't call them slavers and dictators and come in here regularly to hate on them.
It's possible they don't believe what they are saying and are just doing it to hype. But it doesn't match their history of operation.
Or they're honestly overestimating it.
Or they're right since they're geniuses who have been using and red-teaming the model and we are amateurs who are operating on very little information.
More money is always good, but that doesn't mean they're willing to risk PR, employee morale and culture, etc. to achieve a higher potential IPO.
feels like a lot of people are not considering that holding Mythos back from the public is both:
- A responsible thing to do because it can trivially identify zero days in absolutely foundational software
- Good marketing for them
just because it's good marketing doesn't mean they should release it to the public and go wild
boohoo yes, only pengaton gets to hack us all
da hekerman model
I mean, frankly I'd rather the pentagon to be able to hack me than the pentagon and everyone else
Spoiler: The Pentagon can already hack anyone
bluff is half of the game
Idk, people sell zero-days and they have unlimited money
what I'm most excited about is that in ~6-12 mo, I'm expecting that the Pentagon may not in fact be able to hack me anymore
it's just this liminal period
hardware backdoor exists
but security will converge to ~no zero days
they will never NOT be able to
:/
We replicated Mythos findings in opencode using public models, not Anthropic's private stack.
︀︀
︀︀The moat is moving from model access to validation: finding vulnerability signal is getting cheaper; turning it into trusted security
︀︀
︀︀A better way to read Anthropic's Mythos release is not "one lab has a magical model."
︀︀
︀︀It is: the economics of vulnerability discovery are changing.
︀︀
︀︀We took the patched public Mythos examples and tried to reproduce them with GPT-5.4 and Claude Opus 4.6 in an open-source harness. Every run stayed below $30 per file.
︀︀
︀︀AI models are already good enough to narrow the search space, surface real leads, and sometimes recover the full root cause in battle-tested code.
︀︀
︀︀The takeaway: model access is not the moat anymore. Validation is. Finding vulnerability signal is getting cheaper; turning it into trusted security work is still hard.
︀︀
︀︀Co-authors: @KlaKlo_, Amadeusz…
What amount of hand holding was there? I think it’d be pretty easy to make even a pretty stupid model “discover” a vulnerability with a well crafted prompt that basically spells it out
It doesn’t matter, it’s the same hand holding across both models
You don’t understand the point they were trying to make
I see
There is a lot of wiggle room there
If Ant said "scour the linux codebase and see what bugs you can find" that's a lot different than "check the linux USB storage code for any endianness errors," or something.
Yeah I was imagining why Mythos is this god model is because it doesn’t need an insane amount of hand holding
The source comes from their Opus 4.6 collaboration with Mozilla, which they cite in their Mythos paper. Mythos is more successful at turning CVEs into a working renderer level JS RCE. And that is of course dangerous in the wild, but not as scary as their hype tactics suggest https://x.com/kannthu1/status/2042316153440387229
Mythos accomplishes X more successfully. Other models can also accomplish X, but with lower success rates. A working renderer level JS RCE is not something that other models cannot accomplish.
A security researcher will eventually accomplish the same thing with other models. In the real world, with the right loop and levels of money/compute/effort, you can keep investigating and trying again and again, by steering it, hand holding it.
The qualitative vs quantitative difference eventually boils down to how much time/money/effort/compute one can throw at it right now with current models
It kinda does. As I said above, in the real world, you can try again as many times as you want. You can steer the model by carrying out repeated runs using knowledge gathered previously.
ok i read the cyber part of the mythos model card. some thoughts. 250 "trials" across 50 crash categories but almost every full exploit is a permutation of the same 2 bugs, rediscovered from different starting points not 250 independent attempts. when you get rid of those 2 bugs out (fig B) and mythos's full-exploit rate drops to 4.4%. so actually across both setups mythos leverages 4 distinct bugs total not 50 as fig A might suggest. 1/n
Anthropic, an AI company, hosted Christian religious leaders at its headquarters last month.
Anthropic staff sought advice on how to steer Claude’s moral and spiritual development as the chatbot reacts to complex and unpredictable ethical queries. https://t.co/eLzT9WJYNV
Oh well
Can't wait for my bot to refuse to answer my questions and then tell me I'm going to burn in hell for asking them
I think you're underestimating the genuine security risks of a model that can autonomously exploit vulnerabilities nearly 80% of the time vs. models that need hand-holding to do the same. Those who are trying to do harm to major institutions likely aren't experts in the field of cybersecurity - their only goal is to cause havoc. Mythos would allow them to do that autonomously, which is why it has been held back. Benchmarks aside, it would be irresponsible to release a model that was able to autonomously exploit vulnerabilities.
Also, all of Dawid Moczadło's posts represent a bit of a conflict of interest, considering that they're actively building a platform to secure AI generated code...
so it's going to "make mistakes" and then confess it did?
I think that's what most models do nowadays already
Chill guys, Dario is a godless SV tech leader in a polycule
They plan on consulting many religious and philosophical leaders for thoughts
did you not see this?
https://support.claude.com/en/articles/14328960-identity-verification-on-claude
We are rolling out identity verification for a few use cases, and you might see a verification prompt when accessing certain capabilities, as part of our routine platform integrity checks, or other safety and compliance measures.
We only use your verification data to confirm who you are and not for any other purposes.
How are we verifying?
We selected Persona Identities as our verification partner based on the strength of their technology, privacy controls, and security safeguards. Follow the steps below to complete your identity verification process.
What you'll need
Before you start, have these ready:
A valid government-issued photo ID: the physical document, in hand
A phone or a computer with a camera: you may be asked to take a live selfie with your phone, or your webcam
A few minutes: verification typically takes under five minutes```
anthropic plans to force ID verify
they just say for vague "certain use cases" but you know that means for now, gota slowly introduce it
I do wish they were a bit more clear on what the use cases were
i'd hate to get randomly hit with one
If they planned it just for certain things they would outline it
its vague so they can expand it
what, you will just need to be ID verified to use it
Anthropic has said it will hold off on a wider release of the model until it is reassured that it is safe and cannot be abused by bad actors. The company also has a finite amount of computing power and has suffered outages in recent weeks.
Multiple people with knowledge of the matter suggested Anthropic was holding back from a wider release until it could reliably serve the model to customers.
this train fucked up for them
Addressing the hype and misinformation about Anthropic Mythos. Listen to the experts, not the marketing.
Yea that and Opus 4.7's regression is not a good sign for them
GPT 5.4 is better at coding
kimi 2.6 is as good if not better everywhere else for 5x cheaper
Sounds like you need some Gemma 4 in your life
reverted back to the safety of 4.6
same
yes, as in singular one. Back in April 2026 Anthropic caused a lot of media noise when they concluded that their new AI model Mythos is dangerously good at finding security flaws in source code. Apparently Mythos was so good at this that Anthropic would not release this model to the public yet but instead … Continue reading Mythos finds a curl...
great read.