I'm not really sure why your neo is performing better than the M4.
The M4 and the neo share the same CPU architecture but the M4 has 4 performance cores at 4.4ghz, while the neo has 2 performance cores at 4ghz.
The neo also does not have any CPU heatsink so it thermal throttles after only a few seconds:
https://cdn.arstechnica.net/wp-content/uploads/2026/03/MacBo...
Yes, all the M-series have more cores, they often have better thermal management, and they have more memory bandwidth. (The the Neo still has crazy high bandwidth.) But, for a single threaded, strictly compute task that runs in 10 seconds, it outperforms the M4 cores. I don't know why, I'm just sharing my experience.
The actual code I am using for this is: