Notable: they open-sourced the weights under Apache 2.0, unlike OpenAI and DeepMind whose IMO gold models are still proprietary.
Previous discussion: https://news.ycombinator.com/item?id=46072786 218 points 3 days ago, 48 comments
It's impressive to see how fast open-weights models are catching up in specialized domains like math and reasoning. I'm curious if anyone has tested this model for complex logic tasks in coding? Sometimes strong math performance correlates well with debugging or algorithm generation.
A bit important that this model is not general purpose whereas the ones Google and OpenAI used were general purpose.
Why isn’t OpenAI’s gold medal-winning model available to the public yet?
How do you run this kind of a model at home? On a CPU on a machine that has about 1TB of RAM?
Does anyone know if this will become available on OpenRouter?
How is OpenAI going to be able to serve ads in chatgpt without everyone immediately jumping ship to another model?
Shouldn’t there be a lot of skepticism here?
All the problems they claim to have solved are on are the Internet and they explicitly say they crawled them. They do not mention doing any benchmark decontamination or excluding 2024/2025 competition problems from training.
IIRC correctly OpenAI/Google did not have access to the 2025 problems before testing their experimental math models.