It's a bit odd that they're not comparing it against Sonnet
I don't think so. They're comparing it to the highest tier available models from Anthropic and OpenAI. Generally speaking, Opus is better than Sonnet in almost every way, so why have the redundancy?
The tweet specifies that the new model is geared towards long-running tasks, which is what you'd use a model like Opus for anyway.
I don't think so. They're comparing it to the highest tier available models from Anthropic and OpenAI. Generally speaking, Opus is better than Sonnet in almost every way, so why have the redundancy?