alt
Hacker News
wolfgangK
•
last Saturday at 9:24 PM
•
0 replies
•
view on HN
Indeed, recent Flash Attention is a pain point for non CUDA.