I thought reinforcement learning with human feedback was meant to get that quantification of "t...

djeastm • today at 2:45 PM • 0 replies • view on HN

I thought reinforcement learning with human feedback was meant to get that quantification of "taste"

alt Hacker News