Maybe the best "index" will just be markdown files fed into a tiny LLM model. Is anyone ...

2001zhaozhao • yesterday at 11:28 PM • 0 replies • view on HN

Maybe the best "index" will just be markdown files fed into a tiny LLM model.

Is anyone using small, low-latency, fast LLMs to implement stuff like search as a RAG alternative? Could be the perfect use case for that Llama3 8B ASIC some company showed off a few months ago.

alt Hacker News