I still am struggling to understand why they informed the government about something that is known t...

Topfi • yesterday at 6:12 PM • 12 replies • view on HN

I still am struggling to understand why they informed the government about something that is known to be an issue in every LLM. There is no LLM that cannot be jailbroken, so unless this means that we have reached the absolute maximum publicly accessible US made LLMs are allowed to operate at with GPT 5.5, this is not grounded in any sane regulation attempt.

Does anyone know what limits Fable 5 has overstepped in the eyes of the government? Parameter count? Certain benchmark results? Training computer?

Cause if it’s just the ability to assist with cyberattacks and being jailbreakable, there is no model previously released that isn’t equally guilty.

Remember that for GPT 5.5 and 5.4, OpenAI also restricted the cybersecurity focused use under designated models, otherwise rerouting to 5.3-codex like Fable did with Opus 4.8. And both OpenAI models can also be jailbroken all the same.

Basically, what was the reason to tell the government now and not with Opus 4.5 or GPT 5.4? sama has been doing the rounds with apocalyptic predictions…

Replies

themgt • yesterday at 9:17 PM

I submitted separately, but this Axios report has some details that call a lot of the speculation in this thread into question, i.e. that this wasn't much of a "jailbreak" at all and that it's not Anthropic-specific - the White House intends to generally regulate Mythos-class models (whatever exactly that means):

Between the lines: The government's response "seems way out of line with what's actually in the research report," Luta Security CEO Katie Moussouris, who Anthropic shared the Amazon report with, told Axios.

Moussouris said the researchers were able to find security vulnerabilities by asking questions normal defenders would ask AI, which is exactly what the model was intended to do.

An administration official told Axios they do not view other models as national security threats because they do not surpass the bar that Mythos set.

Anything at Mythos level or above would need to go through the administration to ensure the government's national security apparatus is hardened enough, the official added.

https://www.axios.com/2026/06/13/anthropic-amazon-white-hous...

➕ show 2 replies

irthomasthomas • yesterday at 10:24 PM

They literally asked for it. Two days ago Amodei wrote an essay urging the government to regulate them. He explicitly cited Mythos, as proof that frontier AI has acquired autonomous hacking capabilities that threaten critical infrastructure and national security.

  "Mythos Preview scrambled the global cybersecurity landscape. But its broader significance is that it proves beyond doubt that AI models are now tools of global and national strategic consequence." 


  "The government should have the power to block or deter deployment of the model if it is determined, in light of third-party assessment, to present unacceptable risks. This power must be scoped to the above four specific risks and there must be protective measures against political favoritism or arbitrary decisions"

https://darioamodei.com/post/policy-on-the-ai-exponential

A third-party demonstrated that it was possible to jailbreak the safety measures of Fable to access the raw Mythos abilities. Abilities which Anthropic say are too dangerous for the public.

trinsic2 • yesterday at 10:52 PM

>I still am struggling to understand why they informed the government about something that is known to be an issue in every LLM. There is no LLM that cannot be jailbroken, so unless this means that we have reached the absolute maximum publicly accessible US made LLMs are allowed to operate at with GPT 5.5, this is not grounded in any sane regulation attempt.

I wondering where you are getting the idea that there is an sane regulation right now?

lebovic • yesterday at 6:38 PM

Claims of retribution aside, one strawman is that Mythos is likely the most capable model that's usable by folks like the NSA [1], and decision-makers across the USG and industry partners have seen a stream of reports of Mythos successfully finding serious vulnerabilities over the past couple months due to Glasswing.

So even if GPT 5.5 is just as capable in these scenarios (which, imo, it largely is), it is not known by the government apparatus as having the same capabilities.

Personally, I think we crossed the threshold of capabilities with Opus 4.6 [2], which translated to an even more capable open-weight GLM 5.1 (which it is rumored to have distilled Opus 4.6) [3][4]. But the USG and its partners aren't fully rational actors with perfect data, so it's possible they're only viscerally aware of these capabilities in the context of Mythos.

[1]: https://www.reuters.com/business/us-security-agency-is-using...

[2]: Opus 4.6 was used for https://www.noahlebovic.com/testing-an-autonomous-hacker/

[3]: See GLM 5.1 scoring in https://www.cybergym.io/cybergym/

[4]: https://dualuse.dev/posts/chinese-models-are-sometimes-bette...

➕ show 1 reply

thayne • yesterday at 9:18 PM

The only reason I can see is because Amazon wanted something like this to happen. But I'm not sure what Amazon would gain from that, since they don't have their own competing frontier models.

➕ show 3 replies

Jcampuzano2 • yesterday at 6:54 PM

The reason is pretty obvious. Anthropic tried to play hardball with the government and now they are under their thumb for scrutiny of any and every little thing they do.

That's what this admin is known for. If you do even what a normal person would think is sane but they don't like it, well now they need to make you bow down and break you so you "learn your lesson".

It doesn't help that they themselves marketed this model as being especially dangerous in the publics hands. If this was just another model drop and none of the fear mongering I don't doubt this probably wouldn't have had any issues.

➕ show 4 replies

m3kw9 • yesterday at 9:42 PM

Because based upon on what Anthropic has told the “AI people” and military, it is dangerous if an adversary gets its hands in the cyber capabilities. Knowing that if they ignored it and something did happen, heads will roll. Blame Anthropic for that, or wait if they are all for safety, they shouldnt complain.

nowittyusername • yesterday at 9:11 PM

The simple answer is that Trump has a stick up his ass against Anthropic and is also fond of stock market manipulation. No need to get too deep when it comes to dealing with that orange shmuck.

➕ show 1 reply

vrganj • yesterday at 6:15 PM

Its not Fable 5 that overstepped in the eyes of the US government.

It's Anthropic.

This is transparent revenge for them daring to try and push back a little on enabling war crimes.

➕ show 7 replies

giancarlostoro • yesterday at 8:30 PM

Reminds me of people freaking out about the Grok Bikini thing, but GPT and Googles image model they all do the same behavior. Clearly biased against Elon Musk despite it being a problem for every single image model out there.

alt Hacker News

Replies