logoalt Hacker News

fc417fc802yesterday at 7:43 AM1 replyview on HN

Huh. I thought it wasn't supposed to receive any instructions tailored to the task but I didn't understand it to be restricted from accessing truly general tools such as programming languages. To do otherwise is to require pointless hoop jumping as frontier models inevitably get retrained to play games using a json (or other arbitrary) representation at which point it will be natural for them and the real test will begin.


Replies

frotauryesterday at 1:33 PM

This is my understanding as well, I thought tools where allowed.