True, ARC is mostly an artificial "human-like AGI" benchmark that doesn't really refl...

zozbot234 • today at 9:11 AM • 0 replies • view on HN

True, ARC is mostly an artificial "human-like AGI" benchmark that doesn't really reflect any plausible workload. Very different from things like Humanity's Last Exam that reflect real-world knowledge and are now getting closer and closer to saturation even with open models.

alt Hacker News