I don’t know if this counts as tiny but I use llama 3B in prod for summarization (kinda). Its effe...

kianN • 01/22/2025 • 0 replies • view on HN

I don’t know if this counts as tiny but I use llama 3B in prod for summarization (kinda).

Its effective context window is pretty small but I have a much more robust statistical model that handles thematic extraction. The llm is essentially just rewriting ~5-10 sentences into a single paragraph.

I’ve found the less you need the language model to actually do, the less the size/quality of the model actually matters.

alt Hacker News