You’re completely overrating these benchmarks and it’s landing you at a nonsense opinion. Just actua...

solumunus • today at 5:25 AM • 1 reply • view on HN

You’re completely overrating these benchmarks and it’s landing you at a nonsense opinion. Just actually use the models and you will see that the gap is significant.

Replies

irthomasthomas • today at 9:42 AM

It should be easy for a company like Anthropic to prove this beyond a doubt. Why don't they? Why don't they have a collection of prompts and side-by-side comparisons with other models showing how far ahead they are?

➕ show 1 reply

alt Hacker News

Replies