logoalt Hacker News

vessenestoday at 6:27 PM2 repliesview on HN

This is an interesting document, in that it reads like a Claude Mythos model card that was hastily edited to be an Opus 4.7 model card.

I surmise that someone at the top put the Mythos release on hold, and the product team was told "ship this other interim step model instead. quickly."

I wonder if 4.7 will be seen as a net step-up in quality; there are some regressions noted in the document, and it's clearly substantially worse than Mythos, at least according to its own model card. Should be an interesting few months -- if I were at oAI I'd be rushing to get something out that's clearly better, and pressing for weakness here.


Replies

barneyboorootoday at 8:23 PM

Yeah, the section expanding on how they evaluated Mythos internally is a bit baffling considering how irrelevant it is.

the13today at 6:32 PM

What makes you think that? "it reads like a Claude Mythos model card that was hastily edited to be an Opus 4.7 model card"

show 1 reply