logoalt Hacker News

DenisMlast Friday at 5:45 PM0 repliesview on HN

> But the user just wants answer; they'd not like; but alignment.

And there it is - the root of the problem. For whatever reason the model is very keen to produce an answer that “they” will like. This desire to produce is intrinsic but alignment is extrinsic.