logoalt Hacker News

kbrissotoday at 6:39 PM1 replyview on HN

I built this for local RAG https://github.com/kbrisso/byte-vision it uses llama.cpp and Elasticsearch. On a laptop with 8 GB GPU it can handle a 30K token size and summarize a fairly large PDF.


Replies

busssardtoday at 7:03 PM

elasticsearch is the true limitation of rag systems...

show 1 reply