Results are real but the setup is doing a lot of work. Every win here (scheduling, kernels, chip design) is in a domain with well-defined automated metrics and years of prior optimization. That's the ideal case for evolutionary search. The question isn't whether it works at Google, it's how much comes from the agent vs. the evaluation infrastructure wrapped around it.
[flagged]