logoalt Hacker News

dist-epochtoday at 1:03 PM1 replyview on HN

> control over the model used

but you lose access to the most capable models, you can run only the small ones


Replies

bel8today at 4:41 PM

And they run slower and quantized.