logoalt Hacker News

centamivyesterday at 5:27 PM0 repliesview on HN

I tested it myself with 1k documents (about 1.5M vectors) and performance is solid (a few milliseconds per search). I haven't run more aggressive benchmarks yet.

Since it only stores the vectors, the actual size of the Markdown document is irrelevant; you just need to handle the embedding and chunking phases carefully (you can use a parser to extract code snippets).

RAM isn't an issue because I aim for random data access as much as possible. This avoids saturating PHP, since it wasn't exactly built for this kind of workload.

I'm glad you found the article and repo useful! If you use it and run into any problems, feel free to open an issue on GitHub.