logoalt Hacker News

mangolietoday at 11:57 AM3 repliesview on HN

https://x.com/deepseek_ai/status/1995452646459858977

Boom


Replies

andy12_today at 1:31 PM

Do note that that is a different model. The one we are talking about here, DeepSeekMath-V2, is indeed overcooked with math RL. It's so eager to solve math problems, that it even comes up with random ones if you prompt it with "Hello".

https://x.com/AlpinDale/status/1994324943559852326?s=20

simianwordstoday at 12:02 PM

Oh you may be correct. Are these models general purpose or fine tuned for mathematics?