And how do you prove that the proof of correctness is not just a proof that 1=1? LLMs "cheating" on things is rather common.