There also exists an in-between possibility, that is, if you get 128GB of vram (there are now multip...

GTP • yesterday at 7:48 PM • 1 reply • view on HN

There also exists an in-between possibility, that is, if you get 128GB of vram (there are now multiple options in the market to get that amount with a unified memory architecture) you can run DeepSeek V4 flash at good speed via DwarfStar. I'm not going to spend money on this, but my gut feeling is that this would be the right compromise for a lot of people.

Replies

jonaustin • yesterday at 10:53 PM

I just started using it on an m4 max 128 and it's the first time since buying the machine a year ago that it feels like local llm "just works" for reasonably decent coding.

Use pi though; claude code has way too much bootstrap context; slows everything way down.

alt Hacker News

Replies