logoalt Hacker News

thegeomaster01/20/20251 replyview on HN

The parent comment was talking about coding specifically, not the average score. I see o1 at 69.69, and Claude 3.5 Sonnet at 67.13.


Replies

sebastiennight01/21/2025

o1's score looks like exactly what I would expect Elon Musk to aim for with Grok's benchmarks