logoalt Hacker News

conceptiontoday at 6:01 PM1 replyview on HN

Probably explains why Opus was trash for the last week - https://marginlab.ai/trackers/claude-code/. Curious if the new baseline will rise now in-line with the new benchmarks.


Replies

hedoratoday at 6:05 PM

Nice. Can you release that for older models too? I've been using a mixture of releases recently, and cannot tell the difference between any of them.

show 1 reply