Show the training set, and PROVE that the tasks and answers aren't in there. I don't understand why this is not a default first step for proving that this is creating new knowledge.
Are you claiming that for the open problems they give record-breaking solutions for, there were just answers on the web waiting to be found?
It's Google. Assume the training set contains, as a subset, the entirety of all public digitized information. How would you like to them to share it?
How can you actually verify it, even if they provide something?
Well that's harder than maybe solving well-known open problems (whose soln's are presumably not in training set lol) but it seems that their examples are not clearly breaking sota, especially on matmul