for the simplest cases it will be about the same speed as avx2, but if you're trying to do anything fancy, the extra registers and instructions are a godsend.
Well, try it out for a realistic program.
It makes for nice looking code, yes. But is often slower (for various reasons that are well understood by now).
Well, try it out for a realistic program.
It makes for nice looking code, yes. But is often slower (for various reasons that are well understood by now).