logoalt Hacker News

YetAnotherNicktoday at 2:16 PM1 replyview on HN

Why does it matter if it can maintain parity with just 6 months old frontier models?


Replies

hmmmmmmmmmmmmmmtoday at 2:22 PM

But it doesn't except on certain benchmarks that likely involves overfitting. Open source models are nowhere to be seen on ARC-AGI. Nothing above 11% on ARC-AGI 1. https://x.com/GregKamradt/status/1948454001886003328

show 3 replies