If a benchmark is affected the model owner will almost certainly tune it, so there will be a game of cat and mouse...
Honestly, wouldn't surprise me if the AI companies try to detect benchmarking. Most hardware companies do...