Reinforcement Learning from Human Feedback

59 points • by onurkanbkrc • today at 12:53 PM • 5 comments • view on HN

Related. Others?

Last time I saw Nathan say something about the book, he's actively working on the next version and looking for feedback, check his socials

➕ show 1 reply

klelatti • today at 1:46 PM

Web version with links, etc:

➕ show 1 reply

iisweetheartii • today at 2:01 PM

[dead]

alt Hacker News