logoalt Hacker News

johndoughyesterday at 4:05 PM1 replyview on HN

I do not believe that LLMs fear punishment like human employees do.


Replies

devolving-devyesterday at 4:17 PM

Whether driven by fear or by their model weights or whatever, I don't think that the likelihood of an AI agent, at least the current ones like Claude and Codex, acting maliciously to harm my systems is much different than the risk of a human employee doing so. And I think this is the philosophical difference between those who embrace the agents, they view them as akin to humans, while those who sandbox them view them as akin to computer viruses that you study within a sandbox. It seems to me that the human analogy is more accurate, but I can see arguments for the other position.

show 1 reply