logoalt Hacker News

madiatortoday at 2:17 PM2 repliesview on HN

Check out OpenThoughts. It has a widely used dataset, a model that beats the deepseek's smaller reasoning models, and a paper that talks in detail about the data curation methodology.

https://www.open-thoughts.ai/


Replies

lambdatoday at 5:31 PM

Oh, neat, I hadn't heard of that.

From the blog, it looks like there hasn't been much progress for a few months, but if you check their HF it looks like they have a series of 32B models trained on top of Qwen3 32B with different numbers of training examples that they've uploaded a few days ago: https://huggingface.co/collections/open-thoughts/openthinker...

So looks a little bit more research oriented than intended for production use, but still neat to see this effort.

yogthostoday at 3:18 PM

neat