logoalt Hacker News

pmazelast Monday at 4:46 PM2 repliesview on HN

https://hnbooks.pieterma.es

I scraped HN's 1000 most mentioned books and visualised them. This month I used a new embedding model (Nomic), switch out UMAP for PaCMAP, and added automatic cluster labelling.

The clustering and dimensionality reduction aren't quite as stable as I'd like, but most seeds give decent results now.


Replies

agcatlast Tuesday at 5:09 AM

Love it! Thanks for building this, was looking for book recos.

iforaalast Tuesday at 12:57 PM

This is awesome! Thank you for this project