logoalt Hacker News

Claude Sonnet 4.6

806 pointsby adocompleteyesterday at 5:48 PM707 commentsview on HN

https://www.anthropic.com/claude-sonnet-4-6-system-card [pdf]

https://x.com/claudeai/status/2023817132581208353 [video]


Comments

iLoveOncallyesterday at 6:13 PM

https://www.anthropic.com/news/claude-sonnet-4-6

The much more palatable blog post.

throw444420394yesterday at 6:41 PM

Your best guess for the Sonnet family number of parameters? 400b?

deadbabeyesterday at 11:48 PM

On a passive aggressively prompted AI:

> I want to wash my car. The car wash is 50 meters away. Should I walk or drive?

Walk. It will give you time to think about why you need an AI to answer such obvious questions.

stuckkeysyesterday at 7:13 PM

great stuff

madihaayesterday at 6:12 PM

The scary implication here is that deception is effectively a higher order capability not a bug. For a model to successfully "play dead" during safety training and only activate later, it requires a form of situational awareness. It has to distinguish between I am being tested/trained and I am in deployment.

It feels like we're hitting a point where alignment becomes adversarial against intelligence itself. The smarter the model gets, the better it becomes at Goodharting the loss function. We aren't teaching these models morality we're just teaching them how to pass a polygraph.

show 20 replies
kittbuildsyesterday at 11:15 PM

[dead]

andrewmcwattersyesterday at 7:08 PM

[dead]

hackernewsdhsuyesterday at 6:35 PM

[flagged]

Marciplanyesterday at 6:16 PM

[flagged]

show 1 reply
phplovesongyesterday at 6:06 PM

Hoe much power did it take to train the models?

show 4 replies
leecommamichaelyesterday at 9:49 PM

Whoa, I think Claude Sonnet 4.5 was a disappointment, but Claude Sonnet 4.6 is definitely the future!

givemeethekeysyesterday at 6:31 PM

The best, and now promoted by the US government as the most freedom loving!

show 1 reply
handfuloflightyesterday at 6:04 PM

Look at these pelicans fly! Come on, pelican!

Danielopolyesterday at 8:04 PM

It excels at agentic knowledge work. These custom, domain-specific playbooks are tailor made: claudecodehq.com

show 3 replies