If anything it's a testament to human intelligence that benchmarks haven't really been a g...

alfalfasprout • today at 5:01 PM • 0 replies • view on HN

If anything it's a testament to human intelligence that benchmarks haven't really been a good measure of a model's competence for some time now. They provide a relative sorting to some degree, within model families, but it feels like we've hit an AI winter.

alt Hacker News