logoalt Hacker News

Aerroonyesterday at 10:56 PM1 replyview on HN

But AI isn't going to be unaligned. It's going to be aligned the same way we are because it learns from our data.


Replies

drcodeyesterday at 11:10 PM

we mostly know how to make it understand what we want. we don't know how to make it care about what we want, except via reinforcement learning. there are good reasons to believe rl won't work for this once the ai reaches a certain levels of capability.

show 1 reply