logoalt Hacker News

NoImmatureAdHom01/20/20251 replyview on HN

Is there a "base" version of DeepSeek that just does straight next-token prediction, or does that question not make sense given how it's made?

What is the best available "base" next-token predictor these days?


Replies

zamadatix01/21/2025

DeepSeek-V3-Base is the literal answer for what you're looking for (both counts)... but hats off if you actually have the hardware to run it :).

show 1 reply