logoalt Hacker News

FergusArgylltoday at 3:14 PM4 repliesview on HN

There are 2 things worth separating.

1) China distills and is therefore morally bad.

As you rightly point out, that's not a great argument.

2) China distills and is therefore possibly not that competent.

I think that makes sense. If they only catch up to the frontier through distillation then 1) Their model will never be as good as the model they are distilling from. 2) They will never reach the frontier - they need someone else to do it first.


Replies

_aavaa_today at 3:23 PM

This is literally a repeat of the whole “China only make low quality cheap stuff” argument.

“All they do is copy.”

And now, oops they are world leaders in EVs, batteries, solar, drones, just to name a few on the biggest consumer facing things.

show 1 reply
Lerctoday at 3:26 PM

>2) China distills and is therefore possibly not that competent.

I think deepseek at least has done enough innovative work that you could grant them a baseline of competency.

In general, there are enough papers coming out of China to suggest that there are quite a few people there who know what they are doing.

show 1 reply
xyzsparetimexyztoday at 5:00 PM

Deepseek models are on the Pareto frontier of cost/performance. Thats the far more important one than just making a top scoring model.

surgical_firetoday at 3:30 PM

> China distills and is therefore possibly not that competent.

I heard that argument more than one year ago, when chain of thought and reasoning cycles started to be hudden to protect against distillation.

Meanwhile, models as DeepSeek and MiMo are nothing short of excellent nowadays.

Ever since I switched away from OpenAI to DeepSeek I never felt the need to go back.

show 1 reply