logoalt Hacker News

verdvermyesterday at 6:51 PM1 replyview on HN

Internal evals, Big AI certainly has good, proprietary training and eval data, it's one reason why their models are better


Replies

aydynyesterday at 6:58 PM

Then publish the results of those internal evals. Public benchmark saturation isn't an excuse to be un-quantitative.

show 1 reply