logoalt Hacker News

lukevyesterday at 8:50 PM1 replyview on HN

I think we should all consider the possibility that part of the reason Anthropic hasn't immediately released Mythos is that it would be slightly disappointing relative to the benchmark scores.


Replies

eiensyesterday at 9:02 PM

The models don’t get better on every dimension as they scale up - there’s trade offs.

I’m convinced specialised models are the way but this means writing off the investment in existing assets which they won’t do for obvious reasons.

show 2 replies