Show HN: 1v1 coding game that LLMs struggle with

21 points • by levmiseri • yesterday at 6:47 AM • 6 comments • view on HN

This is a game I wish I had as a kid learning programming. The concept of it is fairly similar to other coding games like Screeps, but instead of a complex world with intricate mechanics, Yare is a lot more minimal and approachable with quick 1v1 <3 min matches.

It's purely a passion project with no monetization aspirations. And it's open source: https://github.com/riesvile/yare

The first version 'launched' several years ago and I got some good feedback here: https://news.ycombinator.com/item?id=27365961 that I iterated on.

The latest overhaul is a result of simplifying everything while still keeping the skill ceiling high. And at least the LLMs seem to struggle with this challenge for now (I run a small tournament between major models - results and details here: https://yare.io/ai-arena

I'd love to hear your thoughts

Comments

dang • today at 1:39 AM

Macroexpanding the previous threads:

Show HN: Yare 2 – Programmable RTS game - https://news.ycombinator.com/item?id=32394902 - Aug 2022 (26 comments)

Show HN: Yare.io – game where you control units with JavaScript - https://news.ycombinator.com/item?id=27365961 - June 2021 (64 comments)

(Btw, reposts are fine after a year or so; links to past threads are just to satisfy extra-curious readers!)

vessenes • today at 2:48 AM

Cool!

From the prompt it looks like you don’t give the llms a harness to step through games or simulate - is that correct? If so I’d suggest it’s not a level playing field vs human written bots - if the humans are allowed to watch some games that is.

➕ show 1 reply

javadhu • yesterday at 10:18 AM

Cool project, this is my first time seeing such project using LLMs. Took me a while to understand what's happening on the home page.

A question though, why such powerful bots like Gemini 3.1 failed against Clowder bot? Is it because of inefficient code or the LLMs did not handle edge cases? Or they are not as good as humans when it comes to strategy.

➕ show 1 reply

DeathArrow • today at 8:44 AM

LLMs need to have feedback of the outcomes. Just like a human does.

alt Hacker News

Show HN: 1v1 coding game that LLMs struggle with

Comments