logoalt Hacker News

johnfntoday at 6:38 PM1 replyview on HN

That's just one benchmark, though. Tab to the next one and Sonnet 5 performs better as effort goes up just as you'd expect. I imagine the suggestion is that performance vs effort tradeoff is task dependent.


Replies

energy123today at 6:41 PM

No it doesn't? It's worse than Opus across the whole shared frontier on both plots.

show 1 reply