logoalt Hacker News

mbestolast Friday at 12:05 PM0 repliesview on HN

> AI benchmarks suck.

Not only do they suck, but it's an essentially an impossible task since there is no frame of reference on what "good code" looks like.