Why do they always cut off 70% of the y-axis? Sure it exaggerates the differences, but... it exaggerates the differences.
And they left Haiku out of most of the comparisons! That's the most interesting model for me. Because for some tasks it's fine. And it's still not clear to me which ones those are.
Because in my experience, Haiku sits at this weird middle point where, if you have a well defined task, you can use a smaller/faster/cheaper model than Haiku, and if you don't, then you need to reach for a bigger/slower/costlier model than Haiku.
marketing.
It’s a pretty arbitrary y axis - arguably the only thing that matters is the differences.