logoalt Hacker News

ben_wtoday at 7:23 AM1 replyview on HN

What's "exponential" about AI development?

The METR task-completion time horizons, for one.

https://metr.org/time-horizons/


Replies

zozbot234today at 9:40 AM

Lousy benchmark, they explicitly focus on the easiest tasks to automate for AI (i.e. heavily cherry picked outcomes) and it seems that they don't bother to test anything except just-released proprietary models.

show 1 reply