Exposure to horrors doesn't imply capability or desire to commit said horrors. But it does seem...

paytonjjones • today at 2:35 AM • 0 replies • view on HN

Exposure to horrors doesn't imply capability or desire to commit said horrors. But it does seem like kind of a prerequisite.

All else being equal, I think I'd prefer my models to be naive about human degradation and torture, for instance. Exceptions made for specialized models used for police work etc.

I do think broader alignment is necessary either way but that seems like an extra guardrail it'd be nice to have.

alt Hacker News