logoalt Hacker News

emschwartzyesterday at 12:15 PM2 repliesview on HN

Most of the commercial and open source offerings for hybrid search seem to be using BM25 + vector similarity search based on embeddings. The results are combined using Reciprocal Rank Fusion (RRF).

The RRF paper is impressive in how incredibly simple it is (the paper is only 2 pages): https://plg.uwaterloo.ca/~gvcormac/cormacksigir09-rrf.pdf


Replies

softwaredougyesterday at 4:51 PM

A warning that RRF is often not Enough, as it can just drag a good solution down towards the worse solution :)

https://softwaredoug.com/blog/2024/11/03/rrf-is-not-enough

show 1 reply
TeenGirlza17yesterday at 1:10 PM

[flagged]