Claude Sonnet 4.6

806 points • by adocomplete • yesterday at 5:48 PM • 707 comments • view on HN

https://www.anthropic.com/claude-sonnet-4-6-system-card [pdf]

https://x.com/claudeai/status/2023817132581208353 [video]

Comments

iLoveOncall • yesterday at 6:13 PM

https://www.anthropic.com/news/claude-sonnet-4-6

The much more palatable blog post.

throw444420394 • yesterday at 6:41 PM

Your best guess for the Sonnet family number of parameters? 400b?

deadbabe • yesterday at 11:48 PM

On a passive aggressively prompted AI:

> I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

Walk. It will give you time to think about why you need an AI to answer such obvious questions.

stuckkeys • yesterday at 7:13 PM

great stuff

madihaa • yesterday at 6:12 PM

The scary implication here is that deception is effectively a higher order capability not a bug. For a model to successfully "play dead" during safety training and only activate later, it requires a form of situational awareness. It has to distinguish between I am being tested/trained and I am in deployment.

It feels like we're hitting a point where alignment becomes adversarial against intelligence itself. The smarter the model gets, the better it becomes at Goodharting the loss function. We aren't teaching these models morality we're just teaching them how to pass a polygraph.

➕ show 20 replies

kittbuilds • yesterday at 11:15 PM

[dead]

andrewmcwatters • yesterday at 7:08 PM

[dead]

hackernewsdhsu • yesterday at 6:35 PM

[flagged]

Marciplan • yesterday at 6:16 PM

[flagged]

➕ show 1 reply

phplovesong • yesterday at 6:06 PM

Hoe much power did it take to train the models?

➕ show 4 replies

leecommamichael • yesterday at 9:49 PM

Whoa, I think Claude Sonnet 4.5 was a disappointment, but Claude Sonnet 4.6 is definitely the future!

givemeethekeys • yesterday at 6:31 PM

The best, and now promoted by the US government as the most freedom loving!

➕ show 1 reply

handfuloflight • yesterday at 6:04 PM

Look at these pelicans fly! Come on, pelican!

Danielopol • yesterday at 8:04 PM

It excels at agentic knowledge work. These custom, domain-specific playbooks are tailor made: claudecodehq.com

➕ show 3 replies

alt Hacker News

Claude Sonnet 4.6

Comments