logoalt Hacker News

nathan_comptonyesterday at 8:14 PM1 replyview on HN

Natural to use LM embeddings for this.


Replies

jamiltonyesterday at 9:55 PM

Yeah, convert to embedding, check if it's within a certain distance to an existing embedding and if so store it with that cluster and increment? Then check check further entries against against an average so clusters don't increase their "reach" indefinitely.