For a game like anchorhead, which is famous in its niche, shouldn’t Claude already know it sufficien...

pflenker • yesterday at 9:23 PM • 5 replies • view on HN

For a game like anchorhead, which is famous in its niche, shouldn’t Claude already know it sufficiently to just solve it right away? I would expect that its data source contained multiple discussions and walkthroughs of the game.

Replies

zetalyrae • today at 7:31 AM

I expect it's somewhere in the training data, but it's very unlikely to be salient. A few textfiles here and there in the ocean of the Internet is nothing. If Claude had memorized the walkthrough, it would have performed better.

vunderba • today at 2:58 AM

I would think so. I'd be far more interested in a comparison of LLMs (no internet search allowed) playing against IF games released in the past month.

Jweb_Guru • today at 3:38 AM

Yeah, I do not find performances like this very impressive.

IgorPartola • today at 5:51 AM

Honestly I am curious how it would do if it did have a walkthrough.

ratg13 • yesterday at 9:32 PM

It's very likely the model didn't stop to question if the game they were playing was something they knew already, and just assumed it was a puzzle created for it.

➕ show 1 reply

alt Hacker News

Replies