It's the hardware more than the software that is the limiting factor at the moment, no? Hardwar...

noelwelsh • last Friday at 6:44 PM • 2 replies • view on HN

It's the hardware more than the software that is the limiting factor at the moment, no? Hardware to run a good LLM locally starts around $2000 (e.g. Strix Halo / AI Max 395) I think a few Strix Halo iterations will make it considerably easier.

Replies

ramesh31 • last Friday at 6:53 PM

>Hardware to run a good LLM locally starts around $2000 (e.g. Strix Halo / AI Max 395) I think a few Strix Halo iterations will make it considerably easier.

And "good" is still questionable. The thing that makes this stuff useful is when it works instantly like magic. Once you find yourself fiddling around with subpar results at slower speeds, essentially all of the value is gone. Local models have come a long way but there is still nothing even close to Claude levels when it comes to coding. I just tried taking the latest Qwen and GLM models for a spin through OpenRouter with Cline recently and they feel roughly on par with Claude 3.0. Benchmarks are one thing, but reality is a completely different story.

colecut • last Friday at 6:47 PM

This is rapidly improving

https://simonwillison.net/2025/Jul/29/space-invaders/

➕ show 1 reply

alt Hacker News

Replies