logoalt Hacker News

Youdentoday at 3:58 PM1 replyview on HN

They mentioned LMArena, you can get the results for that here: https://lmarena.ai/leaderboard/text

Mistral Large 3 is ranked 28, behind all the other major SOTA models. The delta between Mistral and the leader is only 1418 vs. 1491 though. I *think* that means the difference is relatively small.


Replies

jampekkatoday at 4:42 PM

1491 vs 1418 ELO means the stronger model wins about 60% of the time.

show 1 reply