Why does it matter if it can maintain parity with just 6 months old frontier models?

YetAnotherNick • today at 2:16 PM • 1 reply • view on HN

Replies

hmmmmmmmmmmmmmm • today at 2:22 PM

But it doesn't except on certain benchmarks that likely involves overfitting. Open source models are nowhere to be seen on ARC-AGI. Nothing above 11% on ARC-AGI 1. https://x.com/GregKamradt/status/1948454001886003328

➕ show 3 replies

alt Hacker News

Replies