logoalt Hacker News

LargoLasskhyfvlast Sunday at 10:37 AM1 replyview on HN

Errm, No :-) I meant bars as in benchmarks, often rather meaningless, because within the range of statistic noise.

For instance, something having 100.200 points in one config, in another 100.220, with the bars/scales distorted to make that difference seem much larger.

Gaming the bar-game, so to speak.


Replies

alpaca128last Sunday at 1:09 PM

OpenAI recently played a bit too hard with their GPT-5 announcement. Two bars with the same height but wildly different values, things like that. Such a lack of subtlety that their claim it was accidental is actually almost believable.