logoalt Hacker News

We Put Claude Code in Rollercoaster Tycoon

119 pointsby iamwillast Monday at 2:28 PM46 commentsview on HN

Comments

hk__2today at 4:59 PM

> The only other notable setback was an accidental use of the word "revert" which Codex took literally, and ran git revert on a file where 1-2 hours of progress had been accumulating.

show 2 replies
pocketarctoday at 5:15 PM

I love the interview at the end of the video. The kubectl-inspired CLI, and the feedback for improvements from Claude, as well as the alerts/segmentation feedback.

You could take those, make the tools better, and repeat the experience, and I'd love to see how much better the run would go.

I keep thinking about that when it comes to things like this - the Pokemon thing as well. The quality of the tooling around the AI is only going to become more and more impactful as time goes on. The more you can deterministically figure out on behalf of the AI to provide it with accurate ways of seeing and doing things, the better.

Ditto for humans, of course, that's the great thing about optimizing for AI. It's really just "if a human was using this, what would they need"? Think about it: The whole thing with the paths not being properly connected, a human would have to sit down and really think about it, draw/sketch the layout to visualize and understand what coordinates to do things in. And if you couldn't do that, you too would probably struggle for a while. But if the tool provided you with enough context to understand that a path wasn't connected properly and why, you'd be fine.

lukebechteltoday at 4:36 PM

> We don't know any C++ at all, and we vibe-coded the entire project over a few weeks. The core pieces of the build are…

what a world!

show 2 replies
fnordpiglettoday at 5:01 PM

Interesting article but it doesn’t actually discuss how well it performs at playing the game. There is in fact a 1.5 hour YouTube video but it woulda been nice for a bit of an outcome postmortem. It’s like “here’s the methods and set up section of a research paper but for the conclusion you need to watch this movie and make your own judgements!”

show 2 replies
nipponesetoday at 5:05 PM

> kept the context above the ~60% remaining level where coding models perform at their absolute best

Maybe this is obvious to Claude users but how do you know your remaining context level? There is UI for this?

show 3 replies
hauntertoday at 5:13 PM

This is what I want but for PoE/PoE2 builds. I always get a headache just looking at the passive tree https://poe.ninja/poe2/passive-skill-tree

equinumeroustoday at 5:08 PM

This is a cool idea. I wanted to do something like this by adding a Lua API to OpenRCT2 that allows you to manipulate and inspect the game world. Then, you could either provide an LLM agent the ability to write and run scripts in the game, or program a more classic AI using the Lua API. This AI would probably perform much better than an LLM - but an interesting experiment nonetheless to see how a language model can fare in a task it was not trained to do.

show 1 reply
rnmmrnmtoday at 5:35 PM

this is cute but i imagined prompting the ai for a loop-di-loop roller coaster. If this could build complex ride it would be a game changer.

mentostoday at 4:42 PM

The opening paragraph I thought was the agent prompt haha

> The park rating is climbing. Your flagship coaster is printing money. Guests are happy, for now. But you know what's coming: the inevitable cascade of breakdowns, the trash piling up by the exits, the queue times spiraling out of control.

khourytoday at 4:05 PM

Can't wait for someone to let Claude control a runescape character from scratch

show 3 replies
neomtoday at 5:10 PM

Wonder how it would do with Myst.

skybriantoday at 4:12 PM

Would a way to take screenshots help? It seems to work for browser testing.

show 1 reply
HelloUsernametoday at 4:06 PM

*OpenRCT2

nacozarinalast Monday at 2:34 PM

next up: Crusader Kings III

show 2 replies
huflungdungtoday at 4:44 PM

[dead]

azhenleytoday at 4:43 PM

Edit: HN's auto-resubmit in action, ignore.

show 1 reply