alt
Hacker News
djeastm
•
today at 2:45 PM
•
0 replies
•
view on HN
I thought reinforcement learning with human feedback was meant to get that quantification of "taste"