My issue with AGI benchmarks is you can never tell if you're measuring actual capability or jus...

convexly • yesterday at 11:38 PM • 0 replies • view on HN

My issue with AGI benchmarks is you can never tell if you're measuring actual capability or just how much the training data overlapped with the test.

alt Hacker News