logoalt Hacker News

dzaimalast Saturday at 4:26 PM1 replyview on HN

It has single-threaded semantics per element. Which is fine for anything that does completely independent computation for each element, but is quite annoying for everything else, requiring major algorithmic changes. And CPU SIMD is used for a lot of such things.


Replies

smallmancontrovlast Saturday at 4:33 PM

"Completely independent" except for anything that can be expressed using branches, queues, and locks. Which is everything. Again, are you sure you've tried CUDA? Past, like, the first tutorial?

show 1 reply