logoalt Hacker News

catigulalast Friday at 4:45 PM0 repliesview on HN

That does feel a little more like over-fitting, but you might be able to argue that there's some philosophical proximity to lying.

I think, largely, the

  Pre-training -> Post-training -> Safety/Alignment training
pipeline would obviously produce 'lying'. The trainings are in a sort of mutual dissonance.