Shy of an algo breakthrough, open source isn't going to catch up with SOTA, their main trick fo...

Workaccount2 • yesterday at 10:54 PM • 4 replies • view on HN

Shy of an algo breakthrough, open source isn't going to catch up with SOTA, their main trick for model improvement is distilling the SOTA models. That's why they they have perpetually been "right behind".

Replies

impulser_ • yesterday at 11:03 PM

They don't need to catch up. They just need to be good enough and fast as fuck. Vast majority of useful tasks of LLMs has nothing to do with how smart they are.

GPT-5 models have been the most useless models out of any model released this year despite being SOTA, and it because it slow as fuck.

➕ show 8 replies

_fizz_buzz_ • today at 1:11 AM

> their main trick for model improvement is distilling the SOTA models

Could you elaborate? How is this done and what does this mean?

➕ show 1 reply

mistercheph • today at 2:07 AM

Too bad, so sad for the Mister Krabs secret recipe-pilled labs. Shy of something fundamental changing, it will always be possible to make a distillation that is 98% as good as a frontier model for ~1% of the cost of training the SOTA model. Some technology just wants to be free :)

stx5 • today at 1:58 AM

[dead]

alt Hacker News

Replies