logoalt Hacker News

gpt5today at 7:59 AM0 repliesview on HN

ARC-AGI isn't perfect, but it helps demonstrates the gap. I'm sure all companies optimize their models for this benchmark given its dominance.