Flash-KMeans: Fast and Memory-Efficient Exact K-Means

131 points • by matt_d • last Tuesday at 5:38 AM • 10 comments • view on HN

Comments

They created this in service of their video generation model which "clusters and reorders tokens based on semantic similarity using k-means.":

http://arxiv.org/pdf/2505.18875

➕ show 1 reply

jacquesm • today at 3:29 PM

Nice one. K-Means is one of those neat little powertools that once you get the hang of it you find more and more applications for, but it can be a bit slow for larger data sets. So this is very nice to have, thank you matt_d for posting.

leecarraher • today at 3:06 PM

Do they mean deterministic k-means, k-means++ ... ? Global optimal k-means is NP-Hard, so linear speedups aren't terribly helpful. It's nice, until you add more input. Standard k-means would be nice, or the k-means++ seed algorithm.

➕ show 1 reply

wood_spirit • today at 11:28 AM

Does this have corresponding speed ups or memory gains for normal CPUs too? Just thinking about all the cups of coffee that have been made and drunk while scikit-learn kmeans chugs through a notebook :)

➕ show 2 replies

matrix2596 • today at 11:25 AM

looks like flash attention concepts applied to kmeans, nice speedup results

maiconburn • today at 2:11 PM

[dead]

alt Hacker News

Flash-KMeans: Fast and Memory-Efficient Exact K-Means

Comments