logoalt Hacker News

esafaktoday at 3:12 PM1 replyview on HN

They are, in benchmarks. In practice Anthropic's models are ahead of where their benchmarks suggest.


Replies

HNisCIStoday at 4:13 PM

Bear in mind that lead may be, in large part, from the tooling rather than the model