Interesting that something similar came up recently where an AI being trained might fake alignment w...

djmips • 04/03/2025 • 0 replies • view on HN

Interesting that something similar came up recently where an AI being trained might fake alignment with training goals.

alt Hacker News