If all you have is the starting point and the finishing point, the lack of the path taken from one p...

CuriouslyC • yesterday at 2:37 PM • 5 replies • view on HN

If all you have is the starting point and the finishing point, the lack of the path taken from one point to another limits your ability to train models that can efficiently recreate the work, and increases its cost enough that it's possible the US labs can progress capabilities faster than Chinese labs can distill that behavior.

Replies

drewda • yesterday at 10:08 PM

As of this month, everyone has 100+ pages from Microsoft on how they trained their MAI-Thinking-1 model: https://microsoft.ai/pdf/mai-thinking-1.pdf

OpenAI and Anthropic may have gone silent on how they build their models, but other companies have different incentives.

lumost • yesterday at 9:36 PM

This just looks like a capex problem. There is no evidence that Anthropic has secret sauce above and beyond access to capital. If there is secret sauce, it's unclear that it changes the required amount of capital by all that much.

China will spend all of the money required to catch up, Google and OpenAI will both spend money to catch up as well. NVidia and others will not allow a frontier lab to become the AI bottleneck.

sealeck • yesterday at 4:24 PM

> lack of the path taken from one point to another limits your ability to train models that can efficiently recreate the work

Isn’t this the problem inference (training) a model is designed to solve :)))

➕ show 1 reply

wahnfrieden • yesterday at 4:12 PM

That’s already the case. Chinese ingenuity allowed them to achieve what they did without access to reasoning outputs

nyrikki • yesterday at 7:08 PM

[dead]

alt Hacker News

Replies