LiveBench (which I like because it tries very hard to avoid contamination) ranks Sonnet 3.5 second o...

thegeomaster • 01/20/2025 • 2 replies • view on HN

LiveBench (which I like because it tries very hard to avoid contamination) ranks Sonnet 3.5 second only to o1 (which is totally expected).

parav • 01/20/2025

LiveCodingBench has DeepSeekR1 at #3 after O1-high and O1-medium https://livecodebench.github.io/leaderboard.html

➕ show 2 replies

behnamoh • 01/20/2025

no, sonnet 3.5 is #7 on LiveBench, even below DeepSeek V3.

➕ show 1 reply

alt Hacker News