logoalt Hacker News

nurettintoday at 9:06 AM0 repliesview on HN

I also run a Qwen 3.6 moe A4B on old hardware. I set it up with

numactl --membind=1

so it is constrained to one of the memory sticks which speeds up token generation a little.