> We have reviewed the report and validated that the level of capability displayed there is wide...

maxall4 • today at 12:58 AM • 6 replies • view on HN

> We have reviewed the report and validated that the level of capability displayed there is widely available from other models (including OpenAI’s GPT-5.5), and is used every day by the defenders who keep systems safe. We will share more details over the next 24 hours.

So much for all of the rhetoric about Mythos supposedly far surpassing GPT 5.5 (edit: in cybersecurity, in particular). Of course, the AISI benchmarks also showed this, but it is amusing that Anthropic is saying it now that it is to their advantage.

Replies

siddboots • today at 1:00 AM

They aren't saying that other models have the same overall level of capability. They are saying that the specific capability that the US Government tested is also available in other models.

➕ show 1 reply

Tossrock • today at 1:04 AM

This is about the specific capabilities that the government called out, not Fable's overall capabilities. My personal experience, having used Fable this week for an extremely complex task, is that it is head and shoulders more powerful than any other model, at least for software engineering.

jsw97 • today at 1:00 AM

If this gets 5.5 banned I am going to be hopping mad.

➕ show 3 replies

UqWBcuFx6NV4r • today at 1:49 AM

I’d suggest you use an LLM to assist you with comprehending their statement. It’ll do a better job, or at the very least be more objective than you’re being now. You’ve misinterpreted the statement. That is not what they’re saying at all. Please actually read instead of skimming until you find something that you believe reinforces your worldview.

JacobAsmuth • today at 5:33 AM

Reading comprehension failure on display here from maxall4.

cma • today at 1:35 AM

They are saying that comparison to other models only about the problems it was jailbroken to complete in the government's example, not all vulnerabilities it could exploit unjailbroken.

alt Hacker News

Replies