Knowledge Distillation of Black-Box Large Language Models (2024)

64 points • by babelfish • yesterday at 10:32 PM • 12 comments • view on HN

Comments

dmezzetti • today at 12:53 AM

Well-Read Students Learn Better: On the Importance of Pre-training Compact Models

Related paper that's a good read: https://arxiv.org/abs/1908.08962

Alifatisk • yesterday at 11:02 PM

Why is this published again? Is this a reference to recent events?

➕ show 1 reply

linolevan • yesterday at 10:58 PM

Can we note that this is a 2024 paper in the title?

TimXare • today at 2:59 AM

[dead]

duendefm • yesterday at 10:53 PM

The Chinese are really going strong on destroying the American AI economy bubble. Honestly, despite the fact that I'm totally pro USA and anti China, I think we should help them crashing the American AI bubble. They are controlling everything and we can't even buy a new computer nowadays while getting no benefit from this. I wish some influential programmers stimulated coders everywhere to skip Claude and Chatgpt subscriptions for Chinese ones, at scale. If we programmers united we could help this bubble burst, I'm sure.

➕ show 2 replies

alt Hacker News

Knowledge Distillation of Black-Box Large Language Models (2024)

Comments