logoalt Hacker News

Knowledge Distillation of Black-Box Large Language Models (2024)

64 pointsby babelfishyesterday at 10:32 PM12 commentsview on HN

Comments

dmezzettitoday at 12:53 AM

Well-Read Students Learn Better: On the Importance of Pre-training Compact Models

Related paper that's a good read: https://arxiv.org/abs/1908.08962

Alifatiskyesterday at 11:02 PM

Why is this published again? Is this a reference to recent events?

show 1 reply
linolevanyesterday at 10:58 PM

Can we note that this is a 2024 paper in the title?

TimXaretoday at 2:59 AM

[dead]

duendefmyesterday at 10:53 PM

The Chinese are really going strong on destroying the American AI economy bubble. Honestly, despite the fact that I'm totally pro USA and anti China, I think we should help them crashing the American AI bubble. They are controlling everything and we can't even buy a new computer nowadays while getting no benefit from this. I wish some influential programmers stimulated coders everywhere to skip Claude and Chatgpt subscriptions for Chinese ones, at scale. If we programmers united we could help this bubble burst, I'm sure.

show 2 replies