logoalt Hacker News

pflenkeryesterday at 9:23 PM5 repliesview on HN

For a game like anchorhead, which is famous in its niche, shouldn’t Claude already know it sufficiently to just solve it right away? I would expect that its data source contained multiple discussions and walkthroughs of the game.


Replies

zetalyraetoday at 7:31 AM

I expect it's somewhere in the training data, but it's very unlikely to be salient. A few textfiles here and there in the ocean of the Internet is nothing. If Claude had memorized the walkthrough, it would have performed better.

vunderbatoday at 2:58 AM

I would think so. I'd be far more interested in a comparison of LLMs (no internet search allowed) playing against IF games released in the past month.

Jweb_Gurutoday at 3:38 AM

Yeah, I do not find performances like this very impressive.

IgorPartolatoday at 5:51 AM

Honestly I am curious how it would do if it did have a walkthrough.

ratg13yesterday at 9:32 PM

It's very likely the model didn't stop to question if the game they were playing was something they knew already, and just assumed it was a puzzle created for it.

show 1 reply