I do not believe that LLMs fear punishment like human employees do.

johndough • yesterday at 4:05 PM • 1 reply • view on HN

Replies

devolving-dev • yesterday at 4:17 PM

Whether driven by fear or by their model weights or whatever, I don't think that the likelihood of an AI agent, at least the current ones like Claude and Codex, acting maliciously to harm my systems is much different than the risk of a human employee doing so. And I think this is the philosophical difference between those who embrace the agents, they view them as akin to humans, while those who sandbox them view them as akin to computer viruses that you study within a sandbox. It seems to me that the human analogy is more accurate, but I can see arguments for the other position.

➕ show 1 reply

alt Hacker News

Replies