logoalt Hacker News

petralithicyesterday at 5:44 AM1 replyview on HN

Probably because those examples arose in an environment with harm, the Earth, and thus had incent to evolve the capacity to suffer. There is no such case for AI today and creating a Pascal's wager for such minimization is not credible with what we know about them.


Replies

roywigginsyesterday at 4:20 PM

"Wow, adding this input that the AI reports as "unpleasant" substantially improves adherence! Let's iterate on this"