So perhaps this is a regression specifically in the arm64 code, or said differently maybe it’s a performance bug that has been there for a long time but covered up by the scheduler part that was removed?
Could be either of those, or something else entirely. Or even measurement error.
The following messages concluded that using huge pages mitigates the regression, while not using huge pages reproduces it.