You don't necessarily, but each token costs money for the AI to spit out. And probably more mon...

IncreasePosts • today at 1:25 AM • 1 reply • view on HN

You don't necessarily, but each token costs money for the AI to spit out. And probably more money when that output is used as input later. Delegating to a library makes sense financially.

Replies

storus • today at 2:43 AM

With local inference on pretty decent local models we have nowadays (Qwen-3.5 and better) it's not much of a concern anymore.

alt Hacker News

Replies