c:"$Sreact.fragment" d:I[3404,["231","static/chunks/231-78c50ebb86961294.js","894","static/chunks/894-de062744758ab0bb.js","269","static/chunks/app/item/page-31ee7347e881e158.js"],"default"] e:T41c,

Have you tried anything with https://codeberg.org/ikawrakow/illama

https://github.com/ikawrakow/ik_llama.cpp and their 4Bit-quants?

Or maybe even Microsofts Bitnet? https://github.com/microsoft/BitNet

https://github.com/ikawrakow/ik_llama.cpp/pull/337

https://huggingface.co/microsoft/bitnet-b1.58-2B-4T-gguf ?

That would be an interesting comparison for running local LLMs on such low-end/edge-devices. Or common office machines with only iGPU.5:["$","div",null,{"children":[["$","div",null,{"style":{"marginBottom":"2px"},"children":[["$","$c","0",{"children":[["$","span",null,{"children":[""," ",["$","b",null,{"children":"LargoLasskhyfv"}]]}]," • "]}],["$","$c","1",{"children":[["$","$L8",null,{"prefetch":false,"className":"quiet","style":"$undefined","href":"/item?id=46521233","children":["$","$Ld",null,{"children":1767748252}]}]," • "]}],["$","$c","2",{"children":[["$","$L8",null,{"prefetch":false,"className":"$undefined","style":"$undefined","href":"/item?id=46521233","children":[0," ","replies"]}]," • "]}],["$","$c","3",{"children":[["$","$L8",null,{"prefetch":false,"className":"$undefined","style":"$undefined","href":"https://news.ycombinator.com/item?id=46521233","children":"view on HN"}],null]}]]}],["$","div",null

alt Hacker News