logoalt Hacker News

cat_plus_plustoday at 3:18 AM0 repliesview on HN

That's why I run a local Qwen3-Next model on an NVIDIA Thor dev kit (Apple Silicon and DGX Spark are other options but they are even more expensive for 128GB VRAM)