logoalt Hacker News

renewiltordlast Wednesday at 11:59 PM2 repliesview on HN

That’s what Triplebyte planned on. The truth is I don’t trust anyone else to run evals.


thaumasioteslast Thursday at 12:13 AM

> The truth is I don’t trust anyone else to run evals.

It's a common sentiment.

But compare https://www.cambridge.org/core/journals/judgment-and-decisio... . ("People predicting the future performance of college students state that interviewing the students aids prediction, although in fact the interviews make predictions less accurate.")

show 1 reply
tonymetlast Thursday at 12:07 AM

you're right there are quality issues, some probably deliberate.

but the screening cost for companies is eye watering so something should be done.