Well we can agree to disagree, CubeCL also has the concept of instruction parallelism, which would b...

nathanielsimard • last Friday at 10:48 PM • 2 replies • view on HN

Well we can agree to disagree, CubeCL also has the concept of instruction parallelism, which would be used to target simd instructions on CPU. Our algorithms are normally flexible on both the plane size and the line size, adapting to the hardware with comptime logique. You are free to dislike the naming, but imo a mix of multiple APIs is worse than something new.

Replies

gyrovagueGeist • last Friday at 11:18 PM

For people who are interested Kokkos (a C++ library for writing portable kernels) also has a naming scheme for hierarchical parallelism. They use ThreadTeam, Thread (for individual threads within a group), and ThreadVector (for per thread SIMD).

Just commenting to share, personally I have no naming preference but the hierarchal abstractions in general are incredibly useful.

almostgotcaught • last Friday at 10:53 PM

> Our algorithms are normally flexible on both the plane size and the line size

Congrats - I have no idea what this means lol.

➕ show 1 reply

alt Hacker News

Replies