Yup, agree. “Evaluations” = Tests Gets pretty meta when you’re evaluating a model which needs to e...

pbronez • yesterday at 9:27 PM • 0 replies • view on HN

Yup, agree. “Evaluations” = Tests

Gets pretty meta when you’re evaluating a model which needs to evaluate the output of another agent… gotta pin things down to ground truth somewhere.

alt Hacker News