logoalt Hacker News

CuriouslyCyesterday at 2:37 PM5 repliesview on HN

If all you have is the starting point and the finishing point, the lack of the path taken from one point to another limits your ability to train models that can efficiently recreate the work, and increases its cost enough that it's possible the US labs can progress capabilities faster than Chinese labs can distill that behavior.


Replies

drewdayesterday at 10:08 PM

As of this month, everyone has 100+ pages from Microsoft on how they trained their MAI-Thinking-1 model: https://microsoft.ai/pdf/mai-thinking-1.pdf

OpenAI and Anthropic may have gone silent on how they build their models, but other companies have different incentives.

lumostyesterday at 9:36 PM

This just looks like a capex problem. There is no evidence that Anthropic has secret sauce above and beyond access to capital. If there is secret sauce, it's unclear that it changes the required amount of capital by all that much.

China will spend all of the money required to catch up, Google and OpenAI will both spend money to catch up as well. NVidia and others will not allow a frontier lab to become the AI bottleneck.

sealeckyesterday at 4:24 PM

> lack of the path taken from one point to another limits your ability to train models that can efficiently recreate the work

Isn’t this the problem inference (training) a model is designed to solve :)))

show 1 reply
wahnfriedenyesterday at 4:12 PM

That’s already the case. Chinese ingenuity allowed them to achieve what they did without access to reasoning outputs

nyrikkiyesterday at 7:08 PM

[dead]