alt
Hacker News
esafak
•
today at 1:36 PM
•
0 replies
•
view on HN
No, it
is
about compressing the KV cache; see
How TurboQuant works
.