logoalt Hacker News

Ciericyesterday at 7:52 PM1 replyview on HN

I'm not even sure a 32 wide array would be good either since on AMD warps are 64 wide. I wouldn't go fully towards auto vectorization with though.


Replies

zozbot234yesterday at 7:55 PM

Warp SIMD-width should be a build-time constant. You'd be using a variable-length vector-like interface that gets compiled down to a specified length as part of building the code.

show 1 reply