How would published numbers be useful without knowing what the underlying data being used to test an...

verdverm • yesterday at 7:03 PM • 1 reply • view on HN

How would published numbers be useful without knowing what the underlying data being used to test and evaluate them are? They are proprietary for a reason

To think that Anthropic is not being intentional and quantitative in their model building, because they care less for the saturated benchmaxxing, is to miss the forest for the trees

Replies

aydyn • yesterday at 8:17 PM

Do you know everything that exists in public benchmarks?

They can give a description of what their metrics are without giving away anything proprietary.

➕ show 1 reply

alt Hacker News

Replies