logoalt Hacker News

nltoday at 1:21 AM0 repliesview on HN

This ignores that reinforcement learning radically changes the training objective