alt
Hacker News
zahlman
•
yesterday at 7:23 PM
•
0 replies
•
view on HN
There are typos all over the training data, and people offering RLHF feedback can overlook them.