An Anthropic safety researcher just recently quit with very cryptic messages , saying "the worl...

the_duke • yesterday at 7:07 PM • 14 replies • view on HN

An Anthropic safety researcher just recently quit with very cryptic messages , saying "the world is in peril"... [1] (which may mean something, or nothing at all)

Codex quite often refuses to do "unsafe/unethical" things that Anthropic models will happily do without question.

Anthropic just raised 30 bn... OpenAI wants to raise 100bn+.

Thinking any of them will actually be restrained by ethics is foolish.

[1] https://news.ycombinator.com/item?id=46972496

Replies

mobattah • yesterday at 7:37 PM

“Cryptic” exit posts are basically noise. If we are going to evaluate vendors, it should be on observable behavior and track record: model capability on your workloads, reliability, security posture, pricing, and support. Any major lab will have employees with strong opinions on the way out. That is not evidence by itself.

➕ show 1 reply

spondyl • yesterday at 7:17 PM

If you read the resignation letter, they would appear to be so cryptic as to not be real warnings at all and perhaps instead the writings of someone exercising their options to go and make poems

➕ show 2 replies

skybrian • yesterday at 7:41 PM

The letter is here:

https://x.com/MrinankSharma/status/2020881722003583421

A slightly longer quote:

> The world is in peril. And not just from AI, or from bioweapons, gut from a whole series of interconnected crises unfolding at this very moment.

In a footnote he refers to the "poly-crisis."

There are all sorts of things one might decide to do in response, including getting more involved in US politics, working more on climate change, or working on other existential risks.

➕ show 1 reply

zamalek • yesterday at 7:56 PM

I think we're fine: https://youtube.com/shorts/3fYiLXVfPa4?si=0y3cgdMHO2L5FgXW

Claude invented something completely nonsensical:

> This is a classic upside-down cup trick! The cup is designed to be flipped — you drink from it by turning it upside down, which makes the sealed end the bottom and the open end the top. Once flipped, it functions just like a normal cup. *The sealed "top" prevents it from spilling while it's in its resting position, but the moment you flip it, you can drink normally from the open end.*

Emphasis mine.

➕ show 1 reply

stronglikedan • yesterday at 7:58 PM

Not to diminish what he said, but it sounds like it didn't have much to do with Anthropic (although it did a little bit) and more to do with burning out and dealing with doomscoll-induced anxiety.

vunderba • yesterday at 8:32 PM

> Codex quite often refuses to do "unsafe/unethical" things that Anthropic models will happily do without question.

I can't really take this very seriously without seeing the list of these ostensible "unethical" things that Anthropic models will allow over other providers.

ljm • yesterday at 7:30 PM

I'm building a new hardware drum machine that is powered by voltage based on fluctuations in the stock market, and I'm getting a clean triangle wave from the predictive markets.

Bring on the cryptocore.

➕ show 1 reply

WesolyKubeczek • yesterday at 7:13 PM

> Codex quite often refuses to do "unsafe/unethical" things that Anthropic models will happily do without question.

That's why I have a functioning brain, to discern between ethical and unethical, among other things.

➕ show 2 replies

groundzeros2015 • yesterday at 7:44 PM

Marketing

tsss • yesterday at 7:37 PM

Good. One thing we definitely don't need any more of is governments and corporations deciding for us what is moral to do and what isn't.

bflesch • yesterday at 7:27 PM

Wasn't that most likely related to the US government using claude for large-scale screening of citizens and their communications?

➕ show 1 reply

ReptileMan • yesterday at 7:23 PM

>Codex quite often refuses to do "unsafe/unethical" things that Anthropic models will happily do without question.

Thanks for the successful pitch. I am seriously considering them now.

idiotsecant • yesterday at 8:51 PM

That guys blog makes him seem insufferable. All signs point to drama and nothing of particular significance.

manmal • yesterday at 7:36 PM

Codex warns me to renew API tokens if it ingests them (accidentally?). Opus starts the decompiler as soon as I ask it how this and that works in a closed binary.

➕ show 2 replies

alt Hacker News

Replies