logoalt Hacker News

hnuser123456yesterday at 10:29 PM0 repliesview on HN

LLMs became sycophantic and effusive because those responses were rated higher during RLHF, until it became newsworthy how obviously eager-to-please they got, so yes, being highly factually correct and "intelligent" was already not the only priority.