alt
Hacker News
solarkraft
•
yesterday at 4:36 PM
•
0 replies
•
view on HN
My bad, I took this as something Multi-head Latent Attention (MLA) related.