Remat can produce a performance boost even when everything has a register.
Admittedly, this comes up more often in non-CPU backends.
> Remat can produce a performance boost even when everything has a register.
Can you give an example?
> Remat can produce a performance boost even when everything has a register.
Can you give an example?