Anybody notice that they did not include Sonnet 5 Max in the "Agentic Search results", whe...

benjiro29 • today at 6:21 PM • 2 replies • view on HN

Anybody notice that they did not include Sonnet 5 Max in the "Agentic Search results", when comparing to Opus 4.8 ...

Based upon the "Agentic Computer usage", Sonnet 5 Max was going to be off "Agentic Search results" chart. lol ...

In short, Sonnet 5 Low/Medium is more cost efficient, if its a task below Opus 4.8 Medium. For the rest its expensive and your better off using Opus 4.8.

Why even release this model?

Replies

ricardobeat • today at 6:30 PM

Because it’s a massive improvement over the previous model, and cheaper?

You are reading too much into the graph and ignoring the threshold of usefulness for real world tasks. By that logic Sonnet 4.5 would have never been worth using.

➕ show 1 reply

bredren • today at 6:26 PM

I'd narrow that to why even allow the harness to run `high` on this model?

alt Hacker News

Replies