> If you just do a tiny amount of tok/day and can wait for the answer to be computed overnig...

deaux • yesterday at 6:01 PM • 1 reply • view on HN

> If you just do a tiny amount of tok/day and can wait for the answer to be computed overnight or so

But they can't? The usage pattern is the polar opposite. Most people running these models locally just ask a few questions to it throughout the day. They want the answers now, or at least within a minute.

Replies

zozbot234 • yesterday at 6:49 PM

If you want the answer right now, that alone ups your compute needs to the point where you're probably better off just using a free hosted-AI service. Unless the prompt is trivial enough that it can be answered quickly by a tiny local model.

alt Hacker News

Replies