logoalt Hacker News

PlatoIsADiseasetoday at 1:43 PM2 repliesview on HN

Interesting.

I guess RAG is faster? But I'm realizing I'm outdated now.


Replies

lxgrtoday at 1:48 PM

No, RAG is definitely preferable once your memory size grows above a few hundred lines of text (which you can just dump into the context for most current models), since you're no longer fighting context limits and needle-in-a-haystack LLM retrieval performance problems.

show 1 reply
rdedevtoday at 4:42 PM

I think it still has a place of your agent is part of a bigger application that you are running and you want to quickly get something in your models context for a quick turnaround