logoalt Hacker News

throwa356262yesterday at 5:27 PM0 repliesview on HN

Deepseek v4 via deepseek themselves is significantly cheaper.

Because (1) Huawei collab and (2) vLLM etc dont implement half of the inference optimisations deepseek proposed in their paper.