logoalt Hacker News

kamranjonyesterday at 11:16 AM1 replyview on HN

The hugging face models are already up and seem to be the original models with the speculative decoding module built in which is very cool:

Flash: https://huggingface.co/deepseek-ai/DeepSeek-V4-Flash-DSpark

Pro: https://huggingface.co/deepseek-ai/DeepSeek-V4-Pro-DSpark

Excited to see if this makes it into DwarfStar for local inference, have been using the flash model extensively since the 2-bit quants were made available by antirez.


Replies

ilakshyesterday at 2:02 PM

Any chance they will have this for Qwen 27 b also?

show 1 reply