Qwen3-Coder-Next works well on my 128GB Framework Desktop. It seems better at coding Python than Qwe...

cpburns2009 • yesterday at 3:16 PM • 2 replies • view on HN

Qwen3-Coder-Next works well on my 128GB Framework Desktop. It seems better at coding Python than Qwen3.5 35B-A3B, and it's not too much slower (43 tg/s compared to 55 tg/s at Q4).

27B is supposed to be really good but it's so slow I gave up on it (11-12 tg/s at Q4).

Replies

vlowther • yesterday at 8:33 PM

The 8 bit MLX unsloth quant of qwen3-coder-next seems to be a local best on an MBB M5 Max with 128GB memory. With oMLX doing prompt caching I can run two in parallel doing different tasks pretty reasonably. I found that lower quants tend to lose the plot after about 170k tokens in context.

➕ show 1 reply

UncleOxidant • yesterday at 6:20 PM

Agreed. Qwen3-coder-next seems like the sweetspot model on my 128GB Framework Desktop. I seem to get better coding results from it vs 27b in addition to it running faster.

alt Hacker News

Replies