logoalt Hacker News

adrian_byesterday at 9:54 PM1 replyview on HN

The 397B model can be run at home with the weights stored on an SSD (or on 2 SSDs, for double throughput).

Probably too slow for chat, but usable as a coding assistant.


Replies

xienzeyesterday at 10:05 PM

I think you have that backwards. Agentic coding is way more demanding than simple chat. The request/response loops (tool calling) are much tighter and more numerous, and the context is waaaaay bigger in general.