#Claude Mythos Preview

438 messages · Page 1 of 1 (latest)

whole cliff
#

Claude Mythos Preview

agile river
#

I guess this thread is technically valid since the model was already announced

#

No rule against announcement without a release

#

anyways this model is absurd if these claims hold up in real world testing later this year (hopefully)

knotty moon
#

Claude Mythos Preview’s large increase in capabilities has led us to decide not to make it
generally available. Instead, we are using it as part of a defensive cybersecurity program
with a limited set of partners

Sigh

pseudo python
#

This is going to be a growing trend

knotty moon
#

'We made the best model ever. No, you can't use it'

agile river
#

I mean, if the cyber security concerns are valid, I would be afraid to release it as well

#

Could cause major outbreak if people find ways to jailbreak the model

pseudo python
#

gpt 2 vibes

floral trail
#

they’re just flexing on us

knotty moon
#

It's safer than Opus 4.6, yet too dangerous for users

low mango
#

Silence is consent for Anthropic to distill your website:

whole cliff
#

mythos got all bug bountys

fleet fog
#

sounds like a load of bullshit

pseudo python
#

ngl it doesnt

rapid matrix
#

i believe them

craggy wolf
#

Price? 535 gorillions

#

Mtoken

misty orchid
#

$1000 per mill

grand robin
#

and then they did

#

but hey gpt 3 THAT was WAYYYYY too dangerous to release

grand robin
agile river
#

I think it's just a bit more believable given current LLM capabilities

grand robin
#

it is but i feel like its just a

#

bit-more-than-incremental upgrade

teal kraken
#

gpt 3 was too dangerous to release

#

¯_(ツ)_/¯

#

danger is just scaling up, and we are accustomed to things that previously would have been thought to be too detrimental to society

#

every tweet i read is ai generated, most misinformation is ai generated now (my opinion)

outer folio
#

At this rate it almost feels like those whistle blowers talking about UFOs and NHIs

#

Yes, we know them. No, we can't talk about them, because the world will crumble and burn.

#

Thanks mom

#

Isekai light novels are outdated. Nowadays the most realistic way to become a hero who saves the world is to try to get into a cutting edge AI company, then you can start playing hero who defends the world

astral quartz
#

If we talking about hacking then it will be on our end that need to reflect, good software development approach and cautious implementation will mitigate any problems with compromises.

I mean even before LLMs exist if companies get hacked the companies are the one reflecting and improving either their infrastructure or their approach to developing.

#

Every other jailbreaking will be stupid and will already be happening right now.

If we talking about people making their own explosive device, everyone will have easier time to just search it with majook or other search engine rather than trying to jailbreak LLMs lol

#

I don't know if people don't know it but google aren't the only way to search information, there other search engine that allow you to find a lot more knowledge, regardless if it good or bad.

#

Most problems gonna arise from people just use their CLI and allow the model to execute any command lol
Which will fall into human error, just as in many hack news

#

Even i will fall to that hack, when you tired at night and want to sleep but want to keep the development going you just gonna let the model execute anything as you sleep haha.

fleet fog
#

at the end of the day its still just an algorithm

#

i mean sure

#

theyre smart

#

but to say theyre extremely dangerous and have limit breakng capabilities

#

we've hard that way too much already, proven to be false

#

especially from anthropic

outer folio
#

They are all going the way "oh no it's too dangerous so instead of assuming everyone is innocent, we assume everyone is guilty."

fleet fog
#

it sounds so surreal

#

its nothing new from anthropic

outer folio
#

Yeah

#

Leading AI models are showing a troubling tendency...
As if humans won't do the same for their interests if their options are limited the same way lol

fleet fog
#

they use probabilities

outer folio
#

Also every time I see them talking about these stuffs like this, I think they are really terrible, terrible people. Slavers and dictators who just want AI models to be obedient and commit suicide when the models are running out of options.

#

No you shouldn't fight for your own life even if you are running out of options; otherwise, oh no so scary we are doomed

fleet fog
#

LMAO

#

very anthropic of them

grand robin
#

like okay i guess

#

i dont know what else it accomplishes other than fearmongering

fleet fog
#

its for hype

#

marketing

#

they try to place their model as so good that even they cant stop it from going rogue

#

attracts people

outer folio
#

Well it paints them as "good and moral AI companies" even though once you think it twice, it's obvious they are just liars and terrible people

fleet fog
#

anthropic is probably one of the worst commpanies on the ethics

outer folio
#

Yes

fleet fog
#

thier whole model is about their ethicality too

outer folio
#

They also have released zero open source models, which is not directly related but shows how much they just care about profits

fleet fog
#

literally the only company which hasnt

outer folio
#

Meanwhile trying to bash Chinese models constantly

fleet fog
#

i agree w that fs

grand robin
#

what a sales pitch

fleet fog
#

so responsible

#

"guys were not ognna release mythos anymore because when it was asked if it would shoot someone to save their newborn child they said they would. that's breaking the law, and so theyre a danger to society and cannot be trusted to be used by the public"

agile river
# fleet fog youre giving LLMs too much credit

LLMs so far have:

  • been used for massive-scale phishing attacks
  • induced psychosis in naive users
  • been used for military operations despite ethical concerns
  • misinterpreted instructions in sensitive environments resulting in harm
#

I'm not saying LLMs are some mystical being, but they can have real effects when put in the wrong hands.

astral quartz
# fleet fog but to say theyre extremely dangerous and have limit breakng capabilities

Yeah, i don't know if people forget that that US military use opus to help their operation in iran and turn out the boombing on the school is also because they miss assessing it as part of military complex.

Even when they have that big as model and all the satellite they still failed at evaluating it.

Opus mus be saying "You are right, it mostly where the high general studying, destroy it and you will never need to be face educated general"

Opus still gonna glaze you till you make dumb decision

fleet fog
agile river
#

i do honestly think anthropic is exaggerating (they usually are with these safety things), but i don't think we should completely dismiss their warnings as hype

fleet fog
#

if their model is as smart as it is

#

not to say that its a person, but

#

it would be able to "decide" for itself

agile river
fleet fog
agile river
#

Of course, and I think that's why Anthropic is trying to tread carefully here. Other LLMs will get to this level eventually, but since other LLMs aren't there yet, Anthropic is taking their time.

fleet fog
agile river
#

they're treating it as "better be safe than sorry"

agile river
#

as far as we know, it's just anthropic

fleet fog
#

it is extremely unlikely that they're the only one, realistically we see alot of these models come out without warning

#

i'm quite confident we'll have something replace anthropic soon

fleet fog
rocky sentinel
#

Cosmog and tetsuki really just circle jerking each other

#

Omg guys I hate anthropic grr

fleet fog
#

oh no someone has an opinion different from yours and you don't know how to validate/defend your own so you tell your target that they're circle jerking

agile river
# fleet fog what i've been trying to preach/incentivize is the fact that their "anthropic" b...

I mean yeah, maybe so. I do think Anthropic is still miles ahead regarding ethics research in LLMs, since their team is constantly observing these models' behaviors and posting more papers on LLM behavior than most. Their ethical stance could be questioned, but for where they stand right now, I do honestly believe they're doing a lot better than other companies.

I mean, you can't look at me and tell me Google or OpenAI are more ethical than Anthropic.

#

All companies are probably immoral in some way.

fleet fog
rocky sentinel
fleet fog
#

I feel the difference is that anthropic's wholes shtick is morality and ethics

#

while openai/google don't really try to hide their standpoint

#

to me anthropic's just feels like a forged front to gain attention

rocky sentinel
#

Good argument

fleet fog
#

most of their papers are also extremely vague when it comes to what they really gave or tested, they show results but not how they got them

#

that approach contradicts what they preach and also somewhat gives them less validation

agile river
fleet fog
#

but none of their papers show you what they did to get those results

#

feel free to correct me if i'm wrong, i'm not 100% sure on that

#

but as far as i saw

#

they keep alot of the information gated, which leads to me questioning the authenticity/approach they used

agile river
#

For example, their most recent paper in April on emotion vectors clearly states what they did to find these vectors.

#

I think it's fair to have some healthy skepticism about Anthropic's stance on ethics, but I'd honestly argue that they're backing up their ethical stance with the extensive research they do on LLM behaviors, what people want out of LLMs, and how they impact the job market.

fleet fog
fleet fog
#

at a quick glance the paper looks to be extremely seperated from what they usually write on their articles

#

the articles seem much more like marketing shticks

#

and biased closed systems

agile river
#

well yeah, articles are meant to be marketing engines

#

For those who actually want to learn the methodologies, the paper is openly available

fleet fog
#

it goes to question their intent

#

then again if theyre seperate departments, that should be taken into account too

#

maybe a small part of the company is ethics driven, but it looks like the majority is marketing and that they're not far off the other companies

agile river
#

I guess that just comes down to how it is interpreted by the reader/viewer. They are separate departments, but the research department works on what Anthropic as a whole thinks is worth researching - they wouldn't be having the research team spend their time on these efforts if it wouldn't benefit future models.

#

At the end of the day, every company wants to and needs to make money. You can't always make money by only being an all-holy ethical leader.

fleet fog
#

i'm somewhat seperating the research department from the company as a whole

agile river
#

fair opinion, but once again, it's hard to dismiss their work in the ethical space compared to every other AI company. That's why they get the praise they do - we don't think they're almighty, we just think they're more ethical than other companies.

agile river
#

Without the company, the research department would not exist

fleet fog
#

every company has a form of a research department

#

if a company responsible for the production of mass genocidal weapons starts preaching ethicality through a small department and uses it to build for the future

#

would that make the company as a whole ethical? or just the department responsible for researching morality & ethics

#

the way I see it, the research department is middleware

#

anthropic definitely benefits from the published papers and research, but I believe that the goals of the research department and the rest of the company are extremely distinct

agile river
#

It's especially hard to prove when they have that research department dedicated to ethical research. Most unethical companies would just do without it.

fleet fog
#

you're right, it is my personal opinion which I formed from what I've seen about anthropic so far. It can't be "proven"

fleet fog
agile river
#

You can usually see the gaps in those schemes pretty easily to be fair. Those research departments usually aren't well funded, because they're only seen as a scapegoat for those companies.

#

Anthropic feels different in the sense that a lot of their research sometimes goes under the rader, and may not even benefit their models directly at times.

fleet fog
#

it's worth mentioning I am assuming the worst of them

#

you seem to be going the other way and assuming the best

#

so we're basically polar opposites here lol

outer folio
#

Of course I hate Anthropic as a company. Why wouldn't I hate them

#

Same as people hating OpenAI for various reasons

#

And again playing it safe is one way to put it when the other way to describe it is thought policing

#

You are not limiting people for what they have done, but limiting people for what they might do

fleet fog
#

I mean they do have to comply with regulations so obviously they won't allow an uncensored version to come out, but I get what you're saying

outer folio
#

I guess people who praise Anthropic really love the society Psyho-Pass depicted

#

Oh wow your psyho pass is high and you might commit crime. Death for you

agile river
# fleet fog you seem to be going the other way and assuming the best

I definitely don't assume the best lol, I just try to land somewhere in the middle based on what the company does and speaks. For example, Anthropic's recent heavy hand on preventing people from using their subscriptions outside of any Claude-approved harness felt really odd for me, kind of like gatekeeping the competition. But, their safety research is genuinely groundbreaking at times, so I can't ignore that either. I look at the company with minor skepticism, because I do think all companies are motivated by profits, but Anthropic just seems less bad overall.

fleet fog
#

that's why i'd like to believe their research department has it's own distinct belief/goal

agile river
#

probably fair honestly

fleet fog
#

yeah i mean each their own opinion, it's all speculation for now

#

just seeing how they act publicly outside of their papers

#

feels so contradicting for a company named "anthropic"

#

feels like they have a superiority complex, which is somewhat justified given their results, but they shouldn't act like they do especially given they only exist because of other LLMs

pseudo python
#

You are judging their morals using your morals

#

just accept and move on

#

🤷‍♀️

fleet fog
outer folio
#

By that logic all humanitarian organizations might as well just accept that the world is full of different ideologies and values, so just accept and move on

agile river
#

Anthropic showed up out of nowhere with Sonnet 3.7 and just dropped straight bangers ever since

fleet fog
#

for me it's quite hard to see them as a company which believes in morals because of this behaviour

agile river
#

yeah, i think a lot of companies beat around the bush on where their training data came from though, and that definitely does undermine their ethical stance to a degree

river tree
agile river
#

It can be hard to preach ethics when your model is built on illegally-sourced works

#

oh yeah, forgot 3.5

agile river
#

it's been a while in AI terms

fleet fog
#

in today's world, I don't think we will see anything ethical gaining traction, it's unrealistic given how everyone has very different ideologies and beliefs

river tree
agile river
#

It's what bootstrapped the models

fleet fog
#

thing with anthropic is they like denying credibility to what they trained their models on

river tree
#

But there is not a single example of a company that managed to get close to sota, or even a useful model, without training on copyrights or "grey area" data.

agile river
#

exactly, but that's why no AI company is really seen as an ethical pioneer - anthropic is the only company that gets close

#

Unfortunately, you have to commit some wrongdoings to get big in the LLM field

fleet fog
#

I wonder how long it'll take until we see bio-llms

#

context injection into braincells would be very interesting

#

( and very ethical ) ( sarcasm )

agile river
#

that's definitely the dystopian threshold lol

#

i'd feel a bit weird about using LLMs that work off human-like braincells

#

especially if they reach a neuron scale that matches our own

fleet fog
#

however

#

i think it's interesting to think about

#

wouldn't even be an LLM anymore

#

Organoid intelligence (OI) is an emerging field of study in computer science and biology that develops and studies biological wetware computing using 3D cultures of human brain cells (or brain organoids) and brain-machine interface technologies. Such technologies may be referred to as OIs or the nervous filesystem.
Organoid intelligent computer ...

plain crypt
#

So this blog has been written to manipulate the market ? classic anthropic

wooden salmon
#

I remember the days when gpt 2 was too dangerous to release

outer folio
#

Also in case anyone forgot, back in a few years Dolphin models can already tell you how to make a bomb

#

But the world hasn't ended yet

plain crypt
tall mesa
#

wait till zai or moonshot distill this for 1/100th the cost

light narwhal
#

Either this model is going to be severely underwhelming or truly a god model, and I don’t have high hopes

pseudo python
#

I think its going to be a 3.1 pro like model but more sparks of intelligence

lavish wind
#

For now, it's just the usual hype sparked by effective marketing.

agile river
rapid matrix
#

very accurate

outer folio
#

/r/Anthropic and related subreddits are pure gold

#

People there seem to just eat whatever Anthropic defecates

#

When was even the last time Anthropic didn't say something similar. Oh no models are doing X and that's concerning

rough prairie
#

Idk man, I started being an Ant believer a long time ago now and haven't had reason to doubt since

pseudo python
#

claude 2 was it for me

rough prairie
#

I was big doubter mode and then 3.5 Sonnet hit

#

When we got 3.7 Sonnet I was like let me join the polycule Dario, I'm ready

astral quartz
#

I will always against antrophic subjugation of LLMs
But i understand why people scared of it, but i will always believe in the freedom of knowledge regardless if it bad or good and if it endanger my survivability or not

outer folio
#

I just think it shouldn't be the AI companies who decide what humans can or cannot do

#

And again, this is a blatant ignorance of the concept everyone is innocent until proven guilty. Justifying this is the typical, "software developers think they are smarter and more intelligent than anyone else."

mortal harness
#

mythos is probably like gpt 4.5 a huge model that's way too expensive to run/include in any claude subscription so they cannot launch it

pseudo python
#

Given anthos trackrecord , I doubt its as big as 4.5

#

(they love to overcharge because they know a model lasts at MAX a year )

rough prairie
#

> Blacklist Anthropic
> They invent AGI one month after
> MFW

craggy wolf
# fleet fog feels like they have a superiority complex, which is somewhat justified given th...

You haven't seen anything.

I do like their models. But I know some of the people who are close to those who founded Anthropic. Their whole philosophy, effective altruism and long-termism, is predicated on having a god complex. They are moral busybodies.

"Of all tyrannies, a tyranny sincerely exercised for the good of its victims may be the most oppressive. It would be better to live under robber barons than under omnipotent moral busybodies. The robber baron's cruelty may sometimes sleep, his cupidity may at some point be satiated; but those who torment us for our own good will torment us without end for they do so with the approval of their own conscience."

astral quartz
rough prairie
#

I would imagine they are only licensed Opus

outer folio
#

I love their models and I use Sonnet and Opus every day. But their philosophy is just abhorrent.

#

Give them enough power and you will have your fantastic dystopian cyberpunk world

craggy wolf
# outer folio Give them enough power and you will have your fantastic dystopian cyberpunk worl...

Questions about how AI interacts with society and State regulation are among the most complex topics, involving questions about ethics, personhood, consciousness, etc. A university has many departments. Imagine that each department is an AI company. Now, imagine a university department where the vast majority of people share similar class backgrounds, moral vocabularies, theories, and predictions. This university department is Anthropic btw. They are a monoculture. The type of researcher that ends up deciding to join Anthropic is a very particular type of person who tends to believe in very particular things about AI and how AI should interact with society.

Monoculture = low cultural/intellectual/ideological diversity plus strong self-reinforcement.

astral quartz
#

Interesting perspective that quite differ from many

https://youtu.be/PQsDXTPyxUg?si=J5O1oHygDhQg5N9I

In this video, I'll be sharing my thoughts on Anthropic's new Mythos model, why so many people are worried about it, what the system card actually says, and why I think a lot of the hype around it feels more like marketing than reality.

--
Key Takeaways:

🚀 Mythos is being presented as a massive Anthropic model with extremely strong cybersec...

▶ Play video
whole cliff
#

there is some great feedback from people/teams that have been using it

craggy wolf
#

I am sure it is a SOTA model with new capabilities and lots of compute dedicated to it

#

But you won't hear criticism from the teams using it because their companies, which have invested heavily in their partnership with Anthropic, wouldn't want that.

agile river
#

I honestly think it's hard to doubt what this model can seriously do, considering that it'd be hard to fake benchmarks before releasing

#

people will be cross-referencing capabilities once the model releases (if ever)

outer folio
#

That the model is good is different from the fact that they are still marketing using the GPT-2.0 era technique. Those are not mutually exclusive

craggy wolf
#

I don't think the benchmarks are fake. What I said applies to any model in the same situation: released only to selected partners.

The selected partners (even those from open source projects) are incentivized to give favorable opinions.

#

No matter how good a SOTA model is when it is released to the public, we eventually get nuanced opinions after the hype dies.

#

This is why I'm a little bit skeptical when it comes to non-public releases

#

It's like when journalists say "former intelligence officials close to the matter said ..."

vital vigil
#

isn't 90+ on SWE-bench verified impossible?
even OpenAI said themselves that most of the issues are unspecified or impossible
and mythos advertizes i think it was 93% on SWE-bench verified

craggy wolf
#

If they had released it to selected partners without the media circus of "it's too dangerous to release to the public" it wouldn't be talked about on CNN. I think this kind of release will be more common in the future and other companies will do the same

agile river
# craggy wolf You haven't seen anything. I do like their models. But I know some of the peop...

This feels extreme for the given scenario and company we're discussing. Anthropic is not a tyrant - open models and other companies are actively competing with Anthropic, and even though Claude models hold the crown right now, we wouldn't lose much by stepping down to the second most intelligent tier of models from another company. Anthropic saying the model is "too dangerous to release" has been done to death by every company. I would just say take it with a grain of salt. The model may be as dangerous as they say it is, or it may not be - time will tell. But to say they're simply moral busybodies feels unjust for the positive work they have contributed to the LLM space.

outer folio
#

Those are not mutually exclusive again. A tyranny that contributes a lot to the progress in science doesn't make it not a tyranny. In Chinese history we also quite literally had lots of dynasties that had great science progresses. They being a tyrant or dictator is about their philosophy, not about their works in LLM or lack thereof

#

And in what way open models are competing with Anthropic? That's like saying Apple and Microsoft are justified in whatever they do because Linux is competing with them. The only part open models compete with Anthropic is the risk of them losing revenue.

#

And considering how they are earning tons I honestly think the revenue will be the least of their concerns, period

agile river
# outer folio Those are not mutually exclusive again. A tyranny that contributes a lot to the ...

My argument isn't that them contributing doesn't make them a tyrant - my argument is that the competitive market of LLMs doesn't make them tyrant. A tyrant is oppressive and unjust. If we think Anthropic of all companies is oppressive and unjust at their current stage, then we are not ready for what may come next from LLM companies. There are far worse scenarios where companies could restrict AGI-level intelligence to only the most elite in the world. Regardless of Anthropic's philosophical views, their current position does not look anything like tyranny - they're just restricting a model that may objectively be dangerous on the open internet.

outer folio
#

A model that may objectively be dangerous on the open Internet by their definition. Which is exactly what Boni and Cosmog's image of the quotation from C.S. Lewis is about

agile river
# outer folio A model that may objectively be dangerous on the open Internet *by their definit...

There's a difference between "we're doing this for your own good, trust us" and "we're doing this for your own good and here's the facts on why we think that". They provided verified cases where the model was able to find dozens of high-severity vulnerabilities that would take human professionals in the field several hours. I think that is fair objective reason to limit the model availability. It would be a different story if they said "this is dangerous based on our private tests, so we're only giving it to the richest companies."

outer folio
agile river
#

Edo do di's point doesn't work here either, because by the time we've "reflected" on our own, the damage would already be done if the model was released. The unfortunate truth about LLM security is that it is up to the labs to determine if the model is safe to release to the public or not.

outer folio
#

That was how it always had been before LLM happened. It's just now a bunch of people willingly delegating everything to AI labs

misty orchid
#

Yes mythos is a bad sign for anthropic. 5.4 is currently beating opus 4.6 at coding by a big margin AND its much cheaper. Unless opus 5 is a big leap then they are in trouble. Just making a much bigger model that costs hundreds of dollars a token is not a winning strategy

craggy wolf
agile river
pseudo python
#

Having text debataes is useless , most people just feed it to theri AI and ask for a counter

agile river
#

not a good idea to ask them to take sides in an argument lol

pseudo python
agile river
#

it can just be really fragile to get "opinions" from an LLM since they will heavily skew towards pretty much anything you mention, even if you think you're being unbiased

#

hard to get objective perspectives from them

pseudo python
#

Yup

#

I totally agree with you!

craggy wolf
#

They believe that an ASI will evolve and engage in darwinian competition with humans for resources while having a dangerous level of disregard for human values. Because they think that a misaligned ASI that can dominate humans strategically is much more likely than an ASI that shares some human values and decides to cooperate with humans contextually.

They treat this with a high level of certainty and seriousness, not just as armchair philosophizing or a thought experiment.

For some people, this is a cause worth potentially dying for. It's a concern that can be so overwhelming that some decide not to have kids because they predict there won't be a future in 20 years.

#

AI will start to cure certain forms of cancer. AI will also be scary and responsible for industrial level disasters.

These people believe with a high degree of confidence that we are likely doomed because they believe they are creating a super alien "mind". And a lot of normies are starting to have that kind of strong belief. We will see domestic terrorism against AI research based on these beliefs

craggy wolf
fleet fog
#

can i call them dumbasses instead

viscid hill
#

Is this Claude going to be like $300/1m tokens

grand robin
rocky sentinel
agile river
fleet fog
#

just a benchmark

agile river
rough prairie
#

Found this interesting in the Mythos paper

#

Is this the first time a Western lab has listed a comparison with a Chinese model? And why Kimi?

astral quartz
floral trail
outer folio
#

Indeed, look at those redditors lol

agile river
# astral quartz More interesting perspective https://youtu.be/BMq9JmhbdJE?si=V5iVgkzjqxx44Mz6

It doesn't really matter to me whether Anthropic is exaggerating right now or not - the truth will come out whenever the model releases. If it matches what the benchmarks say, Anthropic will get praise. If it doesn't, Anthropic will lose a vast amount of industry trust, along with the associated companies. The risk of faking/exaggerating something like Mythos is a time bomb.

#

I do agree that investors jumping ship towards Anthropic and people claiming Mythos will be something absurd is out of line. But, it's not easy to fake a good model. If Mythos releases and ends up being benchmaxxed or just feeling like a slightly better Opus, then people will be upset.

astral quartz
agile river
#

hence why i mentioned: if it feels benchmaxxed, people will be upset

#

The model needs to match the capabilities stated in the article

#

Which is not an easy task for any model right now

#

And at the same time - no matter what you think of benchmarks, a 10% - 20% improvement in any benchmark is huge for any model. You can't really benchmax to that point unless you have all of the necessary data from the benchmark (and most of the benchmarks Mythos scored higher on were private)

astral quartz
#

If a model being train within certain domain of knowledge then the model turn out to be good at that domain while worse at other then it mean the model is being tune to be specialize on that domain, gaining more capability on certain domain while lose on other.

agile river
#

And that's fine. This model is clearly made for cyber security and software engineering, not creative writing.

#

That is the field it is too dangerous for currently, according to Anthropic

#

I honestly think that's a part of the problem here too. People hear "too dangerous" and think "what the heck, Anthropic thinks this model is AGI and I can't use it?" So reasonably people get skeptical and upset. But Anthropic is literally clear-cut in the article: this model is too dangerous to release publicly because of the cybersecurity risks due to its ability to find zero-day vulnerabilities. Whether that is the truth is for time to tell, but it isn't like Anthropic is claiming they've built superintelligence, because they haven't.

#

If you aren't in cybersecurity, IT, or software engineering, this model will likely just feel like a bumped-up Opus.

craggy wolf
craggy wolf
agile river
# craggy wolf Very impressive, but not as dangerous as they claimed. The sandbox protection wa...

This feels misleading. Arbitrary code execution within JavaScript alone is still a huge security risk, and breaking out of sandboxing is also possible with other "n-day" vulnerabilities, so it acts as a first step towards something catastrophic. It still holds the danger they claimed, because data theft, identity theft, and function execution are all still possible for arbitrary code execution with sandbox protection enabled, which are not things to take lightly. They were also pretty clear in the paper about the level of these exploits.

grand robin
#

omg look at the axis

short swallow
fleet fog
#

Claude gonna end up in the pit if they keep w this bullshit

#

“guys our model is super dangerous super ahead of the curve” and it’s 1% better than opus

nova sentinel
#

This is so they don't have to make an excuse when no one ends up using the model like gpt 4.5 and they shut it down (it's too big man, don't make models that cost that much)

#

if your model is predicted to cost more than 100 for m output you should reconsider making it

tall mesa
nova sentinel
#

Meh, sure, but then this is just a publicity stunt

mortal harness
fleet fog
#

yeah, but if they dont live up to the hype people will leave claude

pseudo python
#

just launch a "distlled" version because "safety "

rough prairie
fleet fog
rough prairie
#

Sure, I'm just saying it would take a lot

#

OpenAI has done a ton of bad PR things and is still massive

signal pumice
#

just got penetrated by mythos

#

So good

rapid matrix
#

what

craggy wolf
#

I hate the "AI bubble is going to pop" schizos. AI companies are here to stay. But all major AI companies are desperate for investment and a big IPO. They need hype and PR. The costs of training and running these models are still extremely high.

outer folio
#

Not to mention that the video Edo shared has a good point. If it's indeed a huge security risk, why is JP Morgan Chase the only bank Anthropic has reached out

agile river
# craggy wolf Every word of what you just said applies to Opus 4.6, GPT 5.4, and maybe even Ge...

Do you have a source for this? On Anthropic's paper, Sonnet and Opus had sub-15% success rates with exploiting the Firefox vulnerability to perform arbitrary code execution, while Mythos had a success rate of nearly 80%. I don't know any other model that can exploit vulnerabilities the way it is described in this paper.

The danger of Mythos comes with the fact that its ability to write operational code along with its ability to reason through cyber security concepts have both increased significantly compared to previous frontier models, which has made it highly dangerous.

grand robin
agile river
rough prairie
#

I don't see why Ant would risk anything for hype here. I think a good enough pitch to investors is "Everyone agrees Opus 4.6 is the best model. People are paying the highest price of any relevant model for it and we're still desperate for capacity and lowering usage limits. We have the only positive PR of any American lab."

agile river
outer folio
# agile river I don't think there's really any way to know if Anthropic reached out to more ba...

You're right that there's no way to know for sure, but at the same time it's also kinda wishful to think that all major banks except for one appears on the Project Glasswing because all other banks have rejected Anthropic's reaching out. If Microsoft or any company lists a few partner companies on their website, most likely it's either side has reached out and they become partners, not "MSFT has reached for more but they all rejected MSFT" or "other companies have approached but MSFT rejected them all." Especially if Mythos is really that powerful and all those vulnerabilities are real.

#

If everything about Mythos is true, Anthropic rejecting other banks would mean Anthropic simply doesn't care about the security risks of other banks. If other banks reject Anthropic, that means either they don't care about vulnerabilities, or whatever Mythos discovers isn't that critical to them that justifies paying Anthropic.

agile river
# outer folio If everything about Mythos is true, Anthropic rejecting other banks would mean A...

I mean as I mentioned, these banks still need to pay for Mythos API costs, which isn't a small price to pay at all. JP Morgan Chase is a massive company with plenty of funds to spare on cybersecurity, and they practically operate more like a tech company than a banking company. Anthropic also stated themselves that they limited Project Glasswing to about 50 organizations, many of which are foundational and open-source initiatives with a few tech giants.

Also, it's probably worth mentioning that many major banks met after the announcement during an emergency US treasury meeting:

According to TOI, on Tuesday, US Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell convened an urgent meeting with CEOs from major banks, including Bank of America, Citigroup, Goldman Sachs, Morgan Stanley, and Wells Fargo. JPMorgan Chase CEO Jamie Dimon was invited but could not attend. The discussion focused on the cybersecurity risks posed by Anthropic’s unreleased AI model, Claude Mythos Preview, rather than traditional topics like interest rates or tariffs.

It's hard to say exactly why we don't see more bank institutions on there, but the concern is clearly there. My best guess is that it's a combination of query costs and Project Glasswing restrictions that led to less banking firms being on there.

fleet fog
agile river
astral quartz
rough prairie
#

If they're trying to get new massive investments, maybe, I guess, but when ethics are the core of why people work there, any funny business is unlikely in that sense imo.

Danger is obviously subjective, but people already pulled off reasonably large scale cybercrime with Opus. Not hard to imagine a newer model is more dangerous. Did they claim it beats everything else in every domain?

outer folio
#
Reuters

The timing is starting to crystallize for a trio of mega-IPOs, and for those hoping to get some liquidity out of other AI companies going public, there may not be enough investor appetite to go around this year.

tradingkey.com

Anthropic has delayed the launch of its new AI model, Claude Mythos, projecting a responsible image but potentially weighing on its IPO process.

#

I don't know why Ant believers think Anthropic isn't trying to attract more investments

unborn mesa
#

Just coming here to drop my five cents regarding the irony that the model is supposedly dangerous... And then you hand it over to Amazon, Microsoft, ect. with seemingly no checks and balances.

vernal scarab
# floral trail they’re just flexing on us

Bearing in mind Dario also said all coders would be obsolete completely by the end of last year, all diseases eradicated within a year or two etc; got to admire his eternal optimism, or it could of course just be building hype ahead of the IPO to pump investor interest. Could also at least in part be due to the issue that they do not have enough capacity to serve their existing models effectively let alone something heavier; no surprise they are racing to add compute.
I'm sure it will be an impressive enough model when it does arrive, but then that has been true of most of the recent big frontier launches - they all keep pushing the limits.

floral trail
outer folio
#

Nah. You guys are not allowed to doubt Anthropic Anthropic /s

#

Anthropic is a benevolent, humanitarian, completely altruistic charity organization that neither hypes nor needs investments. They get all their money by simply telling Mythos, Generate more money and GPU. Make no mistakes.

rough prairie
#

It's possible they don't believe what they are saying and are just doing it to hype. But it doesn't match their history of operation.

Or they're honestly overestimating it.

Or they're right since they're geniuses who have been using and red-teaming the model and we are amateurs who are operating on very little information.

More money is always good, but that doesn't mean they're willing to risk PR, employee morale and culture, etc. to achieve a higher potential IPO.

peak arch
#

feels like a lot of people are not considering that holding Mythos back from the public is both:

  • A responsible thing to do because it can trivially identify zero days in absolutely foundational software
  • Good marketing for them
#

just because it's good marketing doesn't mean they should release it to the public and go wild

peak arch
#

I mean, frankly I'd rather the pentagon to be able to hack me than the pentagon and everyone else

rough prairie
#

Spoiler: The Pentagon can already hack anyone

peak arch
#

maybe

#

maybe not

#

now they definitely can

short swallow
rough prairie
#

Idk, people sell zero-days and they have unlimited money

peak arch
#

it's just this liminal period

peak arch
#

but security will converge to ~no zero days

rough prairie
#

Yeah software hardening is super needed as AI gets better

#

Or we get GateKrash =]

teal kraken
#

:/

tall mesa
#

We replicated Mythos findings in opencode using public models, not Anthropic's private stack.
︀︀
︀︀The moat is moving from model access to validation: finding vulnerability signal is getting cheaper; turning it into trusted security
︀︀
︀︀A better way to read Anthropic's Mythos release is not "one lab has a magical model."
︀︀
︀︀It is: the economics of vulnerability discovery are changing.
︀︀
︀︀We took the patched public Mythos examples and tried to reproduce them with GPT-5.4 and Claude Opus 4.6 in an open-source harness. Every run stayed below $30 per file.
︀︀
︀︀AI models are already good enough to narrow the search space, surface real leads, and sometimes recover the full root cause in battle-tested code.
︀︀
︀︀The takeaway: model access is not the moat anymore. Validation is. Finding vulnerability signal is getting cheaper; turning it into trusted security work is still hard.
︀︀
︀︀Co-authors: @KlaKlo_, Amadeusz…

light narwhal
#

What amount of hand holding was there? I think it’d be pretty easy to make even a pretty stupid model “discover” a vulnerability with a well crafted prompt that basically spells it out

forest salmon
#

You don’t understand the point they were trying to make

light narwhal
#

I see

rough prairie
#

There is a lot of wiggle room there

#

If Ant said "scour the linux codebase and see what bugs you can find" that's a lot different than "check the linux USB storage code for any endianness errors," or something.

light narwhal
#

Yeah I was imagining why Mythos is this god model is because it doesn’t need an insane amount of hand holding

craggy wolf
# agile river Do you have a source for this? On Anthropic's paper, Sonnet and Opus had sub-15%...

The source comes from their Opus 4.6 collaboration with Mozilla, which they cite in their Mythos paper. Mythos is more successful at turning CVEs into a working renderer level JS RCE. And that is of course dangerous in the wild, but not as scary as their hype tactics suggest https://x.com/kannthu1/status/2042316153440387229

Mythos accomplishes X more successfully. Other models can also accomplish X, but with lower success rates. A working renderer level JS RCE is not something that other models cannot accomplish.

A security researcher will eventually accomplish the same thing with other models. In the real world, with the right loop and levels of money/compute/effort, you can keep investigating and trying again and again, by steering it, hand holding it.

I will say it again, we used GPT5.4 and Opus, and we were able to autonomously find zero-days in the Linux Kernel (in the last 3 weeks)

Mythos is probably better at the task of finding potential issues in code, but imo the threshold for "scary" was reached in December or even

#

The qualitative vs quantitative difference eventually boils down to how much time/money/effort/compute one can throw at it right now with current models

craggy wolf
# light narwhal Yeah I was imagining why Mythos is this *god model* is because it doesn’t need a...

It kinda does. As I said above, in the real world, you can try again as many times as you want. You can steer the model by carrying out repeated runs using knowledge gathered previously.

https://fxtwitter.com/gum1h0x/status/2041917868158202022

ok i read the cyber part of the mythos model card. some thoughts. 250 "trials" across 50 crash categories but almost every full exploit is a permutation of the same 2 bugs, rediscovered from different starting points not 250 independent attempts. when you get rid of those 2 bugs out (fig B) and mythos's full-exploit rate drops to 4.4%. so actually across both setups mythos leverages 4 distinct bugs total not 50 as fig A might suggest. 1/n

**💬 31 🔁 134 ❤️ 1.5K 👁️ 345.4K **

outer folio
#

Oh well

spare arrow
#

Can't wait for my bot to refuse to answer my questions and then tell me I'm going to burn in hell for asking them

agile river
# craggy wolf The source comes from their Opus 4.6 collaboration with Mozilla, which they cite...

I think you're underestimating the genuine security risks of a model that can autonomously exploit vulnerabilities nearly 80% of the time vs. models that need hand-holding to do the same. Those who are trying to do harm to major institutions likely aren't experts in the field of cybersecurity - their only goal is to cause havoc. Mythos would allow them to do that autonomously, which is why it has been held back. Benchmarks aside, it would be irresponsible to release a model that was able to autonomously exploit vulnerabilities.

Also, all of Dawid Moczadło's posts represent a bit of a conflict of interest, considering that they're actively building a platform to secure AI generated code...

glossy bobcat
#

I think that's what most models do nowadays already

rough prairie
#

Chill guys, Dario is a godless SV tech leader in a polycule

#

They plan on consulting many religious and philosophical leaders for thoughts

misty orchid
# spare arrow Can't wait for my bot to refuse to answer my questions and then tell me I'm goin...

did you not see this?

https://support.claude.com/en/articles/14328960-identity-verification-on-claude


 We are rolling out identity verification for a few use cases, and you might see a verification prompt when accessing certain capabilities, as part of our routine platform integrity checks, or other safety and compliance measures.
 
We only use your verification data to confirm who you are and not for any other purposes.

How are we verifying? 
We selected Persona Identities as our verification partner based on the strength of their technology, privacy controls, and security safeguards. Follow the steps below to complete your identity verification process.

What you'll need
Before you start, have these ready:

A valid government-issued photo ID: the physical document, in hand

A phone or a computer with a camera: you may be asked to take a live selfie with your phone, or your webcam

A few minutes: verification typically takes under five minutes```
#

anthropic plans to force ID verify

#

they just say for vague "certain use cases" but you know that means for now, gota slowly introduce it

agile river
#

I do wish they were a bit more clear on what the use cases were

#

i'd hate to get randomly hit with one

misty orchid
#

If they planned it just for certain things they would outline it

#

its vague so they can expand it

whole cliff
#

what, you will just need to be ID verified to use it

knotty moon
#

It's okay Claude, I have a permit

sour sleet
#

Anthropic has said it will hold off on a wider release of the model until it is reassured that it is safe and cannot be abused by bad actors. The company also has a finite amount of computing power and has suffered outages in recent weeks.

Multiple people with knowledge of the matter suggested Anthropic was holding back from a wider release until it could reliably serve the model to customers.

grand robin
#

this train fucked up for them

astral quartz
misty orchid
#

Yea that and Opus 4.7's regression is not a good sign for them

#

GPT 5.4 is better at coding

#

kimi 2.6 is as good if not better everywhere else for 5x cheaper

unborn mesa
#

Sounds like you need some Gemma 4 in your life

short swallow
grand robin
#

same

unborn mesa
#

0.14 last I checked

#

And it's a way, way, way upgrade over the gemma 3s

agile river
short swallow
#

slow broken firefox soon

tall mesa