logoalt Hacker News

balefulboyyesterday at 9:25 PM0 repliesview on HN

METR's time horizon is not a reliable metric of LLM capability growth: https://www.transformernews.ai/p/against-the-metr-graph-codi...