logoalt Hacker News

rzmmmtoday at 2:46 PM0 repliesview on HN

The alignment favors supporting healthy behaviors so it can be a thin line. I see the system prompt as "plan B" when they can't achieve good results in the training itself.

It's a particularly sensitive issue so they are just probably being cautious.