logoalt Hacker News

wolfgangKlast Saturday at 9:24 PM0 repliesview on HN

Indeed, recent Flash Attention is a pain point for non CUDA.