Both Opus 4.6 and GPT-5.3 one shot a Gameboy emulator for me. Guess I need a better benchmark.

gallerdude • yesterday at 8:15 PM • 3 replies • view on HN

Replies

well_ackshually • yesterday at 10:01 PM

There's hundreds of gameboy emulators available on Github they've been trained on. It's quite literally the simplest piece of emulation you could do. The fact that they couldn't do it before is an indictment of how shit they were, but a gameboy emulator should be a weekend project for anyone even ever so slightly qualified. Your benchmark was awful to begin with.

gf000 • yesterday at 10:10 PM

Is such an emulator not part of their training data sets?

paxys • yesterday at 8:25 PM

As coding agents get "good enough" the next differentiator will be which one can complete a task in fewer tokens.

➕ show 2 replies

alt Hacker News

Replies