RL with the harness inputs and outputs of users is one of the primary improvers of model performance...

satvikpendem • today at 6:40 PM • 0 replies • view on HN

RL with the harness inputs and outputs of users is one of the primary improvers of model performance, a self perpetuating flywheel.

alt Hacker News