logoalt Hacker News

thegeomaster01/20/20252 repliesview on HN

LiveBench (which I like because it tries very hard to avoid contamination) ranks Sonnet 3.5 second only to o1 (which is totally expected).


Replies

parav01/20/2025

LiveCodingBench has DeepSeekR1 at #3 after O1-high and O1-medium https://livecodebench.github.io/leaderboard.html

show 2 replies
behnamoh01/20/2025

no, sonnet 3.5 is #7 on LiveBench, even below DeepSeek V3.

show 1 reply