More of a proof of concept to test out ideas, but here's my approach for local RAG, https://github.com/amscotti/local-LLM-with-RAG
Using Ollama for the embeddings with “nomic-embed-text”, with LanceDB for the vector database. Recently updated it to use “agentic” RAG, but probably not fully needed for a small project.
Thank you for being the kind of person who explains what the abbreviation RAG stands for. I have been very confused reading this thread.
Woah. I am doing something very similar also using lancedb https://github.com/nicholaspsmith/lance-context
Mine is much more basic than yours and I just started it a couple of weeks ago.