logoalt Hacker News

chaiseyesterday at 8:11 PM1 replyview on HN

The official leaderboard for ARC-AGI-3 for current LLMs : https://arcprize.org/leaderboard (yous should select the 3th leaderboard)

CRAZY 0.1% in average lmao


Replies

Corenceyesterday at 8:17 PM

Note the scoring function is significantly different for ARC-AGI-3. It isn't the percentage of tests passed like previous versions, it's the square of the efficiency ratio -- how many steps the model needed vs the second best human.

So if a model can solve every question but takes 10x as many steps as the second best human it will get a score of 1%.