logoalt Hacker News

Batmobile: 10-20x Faster CUDA Kernels for Equivariant Graph Neural Networks

43 pointsby ipnonlast Sunday at 1:29 AM4 commentsview on HN

Comments

shihabtoday at 1:00 PM

Hi, I just wanted to note that e3nn is more of an academic software that's a bit high-level by design. A better baseline for comparison would be Nvidia's cuEquivariance, which does pretty much the same thing as you did- take e3nn and optimize it for GPU.

As a HPC developer, it breaks my heart how worse academic software performance is compared to vendor libraries (from Intel or Nvidia). We need to start aiming much higher.

show 5 replies