logoalt Hacker News

benjiro29today at 6:21 PM2 repliesview on HN

Anybody notice that they did not include Sonnet 5 Max in the "Agentic Search results", when comparing to Opus 4.8 ...

Based upon the "Agentic Computer usage", Sonnet 5 Max was going to be off "Agentic Search results" chart. lol ...

In short, Sonnet 5 Low/Medium is more cost efficient, if its a task below Opus 4.8 Medium. For the rest its expensive and your better off using Opus 4.8.

Why even release this model?


Replies

ricardobeattoday at 6:30 PM

Because it’s a massive improvement over the previous model, and cheaper?

You are reading too much into the graph and ignoring the threshold of usefulness for real world tasks. By that logic Sonnet 4.5 would have never been worth using.

show 1 reply
bredrentoday at 6:26 PM

I'd narrow that to why even allow the harness to run `high` on this model?