alt
Hacker News
revel
•
today at 3:33 AM
•
0 replies
•
view on HN
Running benchmarks at scale and protecting against reward hacking is non-trivial.