logoalt Hacker News

weystromyesterday at 8:22 PM0 repliesview on HN

While I think that local LLMs are the future, i think these setups are insane. You shouldn't be trying to push the SOTA, most people underestimate how much you can get out of small LLMs.

Why ask FABLE 5000 to "summarize this email thread" when a tiny model can do the job?

Sure Codex3000 can oneshot your backlog, but why not use a subsidized subscription to do it for now? We're clearly not at the peak of these model's capabilities yet.