DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

213 points • by victorbuilds • today at 8:54 AM • 70 comments • view on HN

Comments

Shouldn’t there be a lot of skepticism here?

All the problems they claim to have solved are on are the Internet and they explicitly say they crawled them. They do not mention doing any benchmark decontamination or excluding 2024/2025 competition problems from training.

IIRC correctly OpenAI/Google did not have access to the 2025 problems before testing their experimental math models.

victorbuilds • today at 9:28 AM

Notable: they open-sourced the weights under Apache 2.0, unlike OpenAI and DeepMind whose IMO gold models are still proprietary.

➕ show 2 replies

yorwba • today at 9:50 AM

Previous discussion: https://news.ycombinator.com/item?id=46072786 218 points 3 days ago, 48 comments

➕ show 1 reply

ilmj8426 • today at 9:34 AM

It's impressive to see how fast open-weights models are catching up in specialized domains like math and reasoning. I'm curious if anyone has tested this model for complex logic tasks in coding? Sometimes strong math performance correlates well with debugging or algorithm generation.

➕ show 2 replies

simianwords • today at 11:41 AM

A bit important that this model is not general purpose whereas the ones Google and OpenAI used were general purpose.

➕ show 2 replies

terespuwash • today at 9:55 AM

Why isn’t OpenAI’s gold medal-winning model available to the public yet?

➕ show 1 reply

H8crilA • today at 10:45 AM

How do you run this kind of a model at home? On a CPU on a machine that has about 1TB of RAM?

➕ show 3 replies

letmetweakit • today at 11:59 AM

Does anyone know if this will become available on OpenRouter?

sschueller • today at 11:02 AM

How is OpenAI going to be able to serve ads in chatgpt without everyone immediately jumping ship to another model?

➕ show 5 replies

alt Hacker News

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning

Comments