I wonder how much of it is due to the model being familiar with the game or parts of it, be it due t...

wild_pointer • yesterday at 3:18 PM • 1 reply • view on HN

I wonder how much of it is due to the model being familiar with the game or parts of it, be it due to training of the game itself, or reading/watching walkthroughs online.

Replies

andrepd • yesterday at 3:32 PM

There was a well-publicised "Claude plays Pokémon" stream where Claude failed to complete Pokemon Blue in spectacular fashion, despite weeks of trying. I think only a very gullible person would assume that future LLMs didn't specifically bake this into their training, as they do for popular benchmarks or for penguins riding a bike.

➕ show 3 replies

alt Hacker News

Replies