Yes, when you use the PTX backend it supports Tensor Cores.It has also implementation for flash atte... | alt Hacker News

alt Hacker News

mikepapadim • last Friday at 8:32 AM • 1 reply • view on HN

Yes, when you use the PTX backend it supports Tensor Cores.It has also implementation for flash attention. You can also write your own kernels, have a look here: https://github.com/beehive-lab/GPULlama3.java/blob/main/src/... https://github.com/beehive-lab/GPULlama3.java/blob/main/src/...

Replies

lostmsu • last Friday at 10:12 AM

TornadoVM GitHub has no mentions of tensor cores or WMMA instructions. The only mention of tensor cores is in 2024 and states they are not used: https://github.com/beehive-lab/TornadoVM/discussions/393

➕ show 1 reply