If the threat model was weighted by the stakes, then I wonder how the author would reassess their co...

whacked_new • today at 5:30 AM • 0 replies • view on HN

If the threat model was weighted by the stakes, then I wonder how the author would reassess their comfort level. Put to the extreme, the experiment could be whether the AI assistant could be trusted to keep a dangerous AI in a box a la https://rationalwiki.org/wiki/AI-box_experiment where the stakes are assumed much higher

alt Hacker News