logoalt Hacker News

nozzlegeartoday at 7:05 PM2 repliesview on HN

> I have been moving more and more to K2.7 Code and GLM-5.2 the last few weeks. They are often good enough for assistance, very fast, and cheap.

I've moved completely to local models that I run with my M1 Mac Studio (64gb ram) some time ago. But for the rare times when I feel the local, quantized Qwen3.6 isn't enough, I just connect to Openrouter and use something like Kimi, GLM or Deepseek for a fraction of the price of Anthropic et al.


Replies

plasticsopranotoday at 8:07 PM

Which quant do you use? I have a similar setup and the speed is atrocious at 4-bit.

show 1 reply
kamranjontoday at 7:22 PM

This is the way

show 1 reply