logoalt Hacker News

Sesse__yesterday at 8:35 PM0 repliesview on HN

Useful, then, that you can start several vectorized floating-point muls each cycle. (E.g., most modern x86 are 3/0.5 cycles for vmulps. No 20 cycles in sight.)