logoalt Hacker News

t0md4nlast Sunday at 6:58 PM1 replyview on HN

https://arxiv.org/abs/2501.17186


Replies

yosefklast Sunday at 7:16 PM

This is interesting. The "professional level" rating of <1800 isn't, but still.

However:

"A significant Elo rating jump occurs when the model’s Legal Move accuracy reaches 99.8%. This increase is due to the reduction in errors after the model learns to generate legal moves, reinforcing that continuous error correction and learning the correct moves significantly improve ELO"

You should be able to reach the move legality of around 100% with few resources spent on it. Failing to do so means that it has not learned a model of what chess is, at some basic level. There is virtually no challenge in making legal moves.

show 2 replies