I struggle to see the incentive to do this, I have similar thoughts for locally run models. It'...

KoolKat23 • yesterday at 10:30 PM • 1 reply • view on HN

I struggle to see the incentive to do this, I have similar thoughts for locally run models. It's only use case I can imagine is small jobs at scale perhaps something like auto complete integrated into your deployed application, or for extreme privacy, honouring NDA's etc.

Otherwise, if it's a short prompt or answer, SOTA (state of the art) model will be cheap anyway and id it's a long prompt/answer, it's way more likely to be wrong and a lot more time/human cost is spent on "checking/debugging" any issue or hallucination, so again SOTA is better.

Replies

lukan • yesterday at 11:03 PM

"or for extreme privacy"

Or for any privacy/IP protection at all? There is zero privacy, when using cloud based LLM models.

➕ show 1 reply

alt Hacker News

Replies