logoalt Hacker News

dist-epochtoday at 1:48 PM4 repliesview on HN

What good is an open-weights DeepSeek model if you have nowhere to run it?

OpenAI / Google / Anthropic / XAI also have a ton of compute. That is the real moat.


Replies

elitoday at 2:24 PM

It's quite expensive to self-host but you have many places to run it. OpenRouter alone lists a dozen different providers for DeepSeek 4 Pro. https://openrouter.ai/deepseek/deepseek-v4-pro/providers.

So long as there is demand, there are always going to be providers competing to offer it at a low cost. My understanding is that the median price on there is in the ballpark of what it costs to run the inference. This is very different from e.g. Opus, which you can basically only buy from Anthropic at the price they set.

nmfishertoday at 2:03 PM

antirez running (quantized) DeepSeek V4 Pro on a Mac Studio M3 Ultra with 512GB of RAM:

https://bsky.app/profile/antirez.bsky.social/post/3mlzwmvlov...

It's much closer than you think. We're going to see specialized hardware in the next 24 months capable of running 2025-era frontier models. That's big.

show 3 replies
wolttamtoday at 2:06 PM

I just got into self hosting Deepseek v4 Flash on a single DGX Spark via antirez’s DwarfStar 4 project

It feels great to finally have access to something local.

amanaplanacanaltoday at 2:03 PM

That seems pretty temporary if people can just build more compute.