logoalt Hacker News

Upvoter33yesterday at 6:03 PM0 repliesview on HN

I read this differently: they are actually seeing that it's hard to keep advancing frontier models, and now are moving the goal posts so that when they start getting evaluated more harshly, they can point to something like this.