LLMs have a large quantity of chess data and still can't play for shit.

cesarvarela • yesterday at 9:23 PM • 4 replies • view on HN

Replies

Not anymore. This benchmark is for LLM chess ability: https://github.com/lightnesscaster/Chess-LLM-Benchmark?tab=r.... LLMs are graded according to FIDE rules so e.g. two illegal moves in a game leads to an immediate loss.

This benchmark doesn't have the latest models from the last two months, but Gemini 3 (with no tools) is already at 1750 - 1800 FIDE, which is approximately probably around 1900 - 2000 USCF (about USCF expert level). This is enough to beat almost everyone at your local chess club.

➕ show 4 replies

iugtmkbdfil834 • yesterday at 9:31 PM

Hm.. but do they need it.. at this point, we do have custom tools that beat humans. In a sense, all LLM need is a way to connect to that tool ( and the same is true is for counting and many other aspects ).

➕ show 1 reply

BeetleB • yesterday at 10:02 PM

Are you saying an LLM can't produce a chess engine that will easily beat you?

➕ show 1 reply

menaerus • yesterday at 10:05 PM

Did you already forget about the AlphaZero?

alt Hacker News

Replies