alt
Hacker News
nl
•
today at 1:21 AM
•
0 replies
•
view on HN
This ignores that reinforcement learning radically changes the training objective