logoalt Hacker News

naaskingyesterday at 4:39 PM0 repliesview on HN

I think the idea is to train a small, minimal LLM thinking model that can run on edge devices, but that has very little knowledge embedded in its weights, and so performs a sort of RAG to Encylopedia Britannica to ground answers to user queries.