This sounds very appealing. What size Mac mini would I need for that?
A PC with an nvidia card with 16gb vram works just fine for Qwen MoE models, and these have worked great as a daily driver for me.
Good summary blog: https://maloyan.xyz/blog/running-qwen-locally-mac-mini-m4
I am curious if you implicitly assumed they are Macs or if that's what you are looking for specifically?
Personally, I would always max out the RAM you can fit into your budget. You might get lower bandwidth (= slower generation) than you do on a Mac if you choose a Strix Halo or DGX Spark, but there are always new tweaks being discovered to speed things up. That being said, with 32GB you should be able to fit an ok quant of 35B-A3B or 27B with some context, with 64GB you should be golden.