logoalt Hacker News

groby_btoday at 3:45 PM0 repliesview on HN

Without showing the prompts and responses, it's yet another meaningless AI benchmark.

Many of those numbers do not really match what I've seen in the wild, and without clear illustration why you arrived at the number it's not a helpful number.