Because they distill | alt Hacker News

gregorygoc • yesterday at 7:41 PM • 4 replies • view on HN

Because they distill

Replies

it doesn't matter the reason. This is a race and nobody will care or remember how the winners got there.

Mistral looks like it's fading away to irrelevance unless they can play alongside the similar sized models, or have some unique advantage other than being in Europe, for Europe. I was really excited for them back when they were startup that had the biggest European venture round ever. This space will have a few winners, and many losers. Google, plus either Anthropic or OpenAI most likely. Big models will see breakthroughs in inference performance/cost fall precipitously and small models will only exist on devices (Pixels and iPhones, cars, watches, bluetooth speakers, etc)

➕ show 1 reply

k__ • yesterday at 9:54 PM

Why doesn't Mistral distill?

➕ show 1 reply

losvedir • yesterday at 10:37 PM

I feel like there's an implication here that distillation is a problem but I don't understand what you mean. I thought distillation was generating text from a model and then training another model on it. Is the something unethical in that? You're paying the API costs to generate the tokens, right?

Or I guess more to the point: is this something frontier labs have said is (or tried to paint at any rate) problematic? This feels like an "out of the loop" situation because I've only ever heard "distillation" with a positive connotation before.

opsnooperfax • yesterday at 10:23 PM

I suppose losing with dignity is a consolation.