logoalt Hacker News

gradus_adyesterday at 10:15 PM1 replyview on HN

Well the big labs certainly haven't intentionally tried to train away this emergent property... Not sure how "hey let's make the model disagree with the user more" would go over with leadership. Customer is always right, right?


Replies

htrpyesterday at 10:41 PM

The problem is asking for user preference leads to sycophantic responses