Well the big labs certainly haven't intentionally tried to train away this emergent property......

gradus_ad • yesterday at 10:15 PM • 1 reply • view on HN

Well the big labs certainly haven't intentionally tried to train away this emergent property... Not sure how "hey let's make the model disagree with the user more" would go over with leadership. Customer is always right, right?

Replies

htrp • yesterday at 10:41 PM

The problem is asking for user preference leads to sycophantic responses

alt Hacker News

Replies