> DeepSeek and Qwen will function on cheap GPUs that other models will simply choke on. Uh, Dee...

qeternity • yesterday at 10:58 PM • 0 replies • view on HN

> DeepSeek and Qwen will function on cheap GPUs that other models will simply choke on.

Uh, Deepseek will not (unless you are referring to one of their older R1 finetuned variants). But any flagship Deepseek model will require 16x A100/H100+ with NVL in FP8.

alt Hacker News