logoalt Hacker News

cesarvarelayesterday at 9:23 PM4 repliesview on HN

LLMs have a large quantity of chess data and still can't play for shit.


Replies

dwohnitmokyesterday at 9:43 PM

Not anymore. This benchmark is for LLM chess ability: https://github.com/lightnesscaster/Chess-LLM-Benchmark?tab=r.... LLMs are graded according to FIDE rules so e.g. two illegal moves in a game leads to an immediate loss.

This benchmark doesn't have the latest models from the last two months, but Gemini 3 (with no tools) is already at 1750 - 1800 FIDE, which is approximately probably around 1900 - 2000 USCF (about USCF expert level). This is enough to beat almost everyone at your local chess club.

show 4 replies
iugtmkbdfil834yesterday at 9:31 PM

Hm.. but do they need it.. at this point, we do have custom tools that beat humans. In a sense, all LLM need is a way to connect to that tool ( and the same is true is for counting and many other aspects ).

show 1 reply
BeetleByesterday at 10:02 PM

Are you saying an LLM can't produce a chess engine that will easily beat you?

show 1 reply
menaerusyesterday at 10:05 PM

Did you already forget about the AlphaZero?