logoalt Hacker News

neversupervisedyesterday at 3:04 PM1 replyview on HN

But this is the good kind of goalpost moving


Replies

iLoveOncallyesterday at 3:13 PM

Only if you didn't read the article.

They're saying they need to move on from it because the benchmark is flawed (without bringing in proof) and that's why they can't hit 100%.

It's not a "our models are so good that the benchmark is too easy" thing.

show 3 replies