logoalt Hacker News

nathanielsimardlast Friday at 10:48 PM2 repliesview on HN

Well we can agree to disagree, CubeCL also has the concept of instruction parallelism, which would be used to target simd instructions on CPU. Our algorithms are normally flexible on both the plane size and the line size, adapting to the hardware with comptime logique. You are free to dislike the naming, but imo a mix of multiple APIs is worse than something new.


Replies

gyrovagueGeistlast Friday at 11:18 PM

For people who are interested Kokkos (a C++ library for writing portable kernels) also has a naming scheme for hierarchical parallelism. They use ThreadTeam, Thread (for individual threads within a group), and ThreadVector (for per thread SIMD).

Just commenting to share, personally I have no naming preference but the hierarchal abstractions in general are incredibly useful.

almostgotcaughtlast Friday at 10:53 PM

> Our algorithms are normally flexible on both the plane size and the line size

Congrats - I have no idea what this means lol.

show 1 reply