This makes me think LLMs would be interesting to set up in a game of Diplomacy, which is an entirely...

eterm • yesterday at 11:02 PM • 3 replies • view on HN

This makes me think LLMs would be interesting to set up in a game of Diplomacy, which is an entirely text-based game which soft rather than hard requires a degree of backstabbing to win.

The findings in this game that the "thinking" model never did thinking seems odd, does the model not always show it's thinking steps? It seems bizarre that it wouldn't once reach for that tool when it must be being bombarded with seemingly contradictory information from other players.

Replies

qbit42 • yesterday at 11:14 PM

https://noambrown.github.io/papers/22-Science-Diplomacy-TR.p...

➕ show 1 reply

open-paren • today at 1:15 AM

It’s been done before

https://every.to/diplomacy (June 2025)

eterm • yesterday at 11:05 PM

Reading more I'm a little disappointed that the write-up has seemingly leant so heavily on LLMs too, because it detracts credibility from the study itself.

➕ show 1 reply

alt Hacker News

Replies