What do you mean that cpu does something different? Isnt cpu doing what is being asked, that being xor with consequence of zeroing when given two same values.
I think OP means that it has come a long way from the simple mental model of µops being a direct execution of operations and with all the register renamings and so on
FTA:
> And, having done that it removes the operation from the execution queue - that is the xor takes zero execution cycles!1 It’s essentially optimised out by the CPU