Notes and two pelicans: https://simonwillison.net/2025/Nov/24/claude-opus/
Did you write the terminal -> html converter (how you display the claude code transcripts), or is that a library?
I wonder if at this point they read what people use to benchmark with and specifically train it to do well at this task.
i think you have an error there about haiku pricing
> For comparison, Sonnet 4.5 is $3/$15 and Haiku 4.5 is $4/$20.
i think haiku should be $1/$5
:%s/There model/Their model/g
I agree with your sentiment, this incremental evolution is getting difficult to feel when working with code, especially with large enterprise codebases. I would say that for the vast majority of tasks there is a much bigger gap on tooling than on foundational model capability.