logoalt Hacker News

spullaratoday at 5:14 PM8 repliesview on HN

Anthropic declares: "Mythos is too dangerous too release to the public" Proceeds to release Mythos plus safety guardrails as Fable. Amazon removes guardrails from Fable, getting access to Mythos. Government takes Anthropic's word for it and tells them to pull it until the guardrails can't be removed. They refuse. Government forces them.


Replies

sanderjdtoday at 5:26 PM

> Amazon removes guardrails from Fable, getting access to Mythos.

This doesn't seem like an accurate description to me. I think something like "Amazon demonstrates a jailbreak of one class of Fable guardrails" would be a more accurate description.

It doesn't even really mess up your narrative to state it accurately, but your choice of a more hyperbolic statement brings into question the good faith of the narrative you're painting.

show 2 replies
khalictoday at 5:19 PM

Yeah if you ignore the fact the Us government retaliating about Anthropic not wanting their AIs in weapons systems.

rootusrootustoday at 5:19 PM

Was Fable really the full Mythos model but with guardrails added? I had assumed Fable had a reduced parameter count or something, like a Sonnet to an Opus. Interesting!

show 1 reply
InsideOutSantatoday at 5:19 PM

Amazon did not remove any guardrails from Fable.

show 1 reply
giancarlostorotoday at 5:17 PM

This. They also wanted more regulation around AI. I'm guessing they're no longer quite as interested in this.

show 1 reply
croestoday at 5:28 PM

This doesn’t sound like a jailbreak

https://news.ycombinator.com/item?id=48552687

ekiddtoday at 5:36 PM

> Amazon removes guardrails from Fable, getting access to Mythos.

Amazon did not remove any "guardrails" from Fable. They created a fake, obviously insecure program. And apparently their prompt was exactly, "Fix this code." And Fable fixed the bugs.

This is something that even dinky local Chinese models running on a high-end gaming GPU can often do. Certainly Opus, GPT 5.5 and Gemini can all do this. And any high-end Chinese "near-frontier" model can do this, too.

But either (1) the administration is too clueless to know most models can do this, (2) Trump wants to be paid a bribe, (3) someone thinks Anthropic is "woke" and should therefore be destroyed by the power of the state, or possibly, if you're really cynical, (4) maybe the NSA SIGINT wants access to Mythos so they can break into everyone's computers, but they don't want you to have a model good enough to keep them out. Take your pick, I guess.

Anyway, apparently we don't do free markets or rule of law in the United States any more?