I found this a weird article.
If you wish to see some speedups using AVX512, without limiting yourself to C or C++, you might want to try ISPC (https://ispc.github.io/index.html).
You'll get sane aliasing rules from the perspective of performance, multi-target binaries with dynamic dispatching and a lot more control over the code generated.
Ispc looks interesting. Does it work with amd? They hint on gpu’s , i guess mostly intel ones?
ispc is something that deserves to be much more widely known about- it does an excellent job of bringing the cuda programming model to cpus