You are glossing over the fact that for RAG you need to search over those 500GB+ which will b...

ricardobeat • last Saturday at 10:03 AM • 2 replies • view on HN

You are glossing over the fact that for RAG you need to search over those 500GB+ which will be painfully slow and CPU-intensive. The goal is fast retrieval to add data to the LLM context. Storage space is not the sole reason to minimize the DB size.

Replies

brookst • last Saturday at 1:45 PM

You’re not searching over 500GB, you’re searching an index of the vectors. That’s the magic of embeddings and vector databases.

Same way you might have a 50TB relational database but “select id, name from people where country=‘uk’ and name like ‘benj%’ might only touch a few MB of storage at most.

➕ show 1 reply

alt Hacker News

Replies