logoalt Hacker News

impureyesterday at 5:18 PM2 repliesview on HN

I recently made an AI Agent and surprisingly coding with DeepSeek V4 Flash is quite cheap. It probably has to do with the aggressive prompt caching. I'm using OpenRouter with Novita AI as the preferred provider.


Replies

throwa356262yesterday at 5:27 PM

Deepseek v4 via deepseek themselves is significantly cheaper.

Because (1) Huawei collab and (2) vLLM etc dont implement half of the inference optimisations deepseek proposed in their paper.

kagaminoyesterday at 5:21 PM

Same here, deepseek v4 flash on opencode go. It's cheap, fats and good enough to follow my instructions

show 1 reply