Interesting.
I guess RAG is faster? But I'm realizing I'm outdated now.
I think it still has a place of your agent is part of a bigger application that you are running and you want to quickly get something in your models context for a quick turnaround
No, RAG is definitely preferable once your memory size grows above a few hundred lines of text (which you can just dump into the context for most current models), since you're no longer fighting context limits and needle-in-a-haystack LLM retrieval performance problems.