I would love Andrej's take on the fast models we got this year. Gemini 3 flash and Grok 4 fast...

mips_avatar • yesterday at 1:39 AM • 2 replies • view on HN

I would love Andrej's take on the fast models we got this year. Gemini 3 flash and Grok 4 fast have no business being as good + cheap + fast as they are. For Andrej's prediction about LLMs communicating with us via a visual interface we're going to need fast models, but I feel like AI twitter/HN has mostly ignored these.

Replies

HarHarVeryFunny • yesterday at 2:09 PM

Just guessing here, but these small models may well be essentially distillations of larger ones, with this being where their power comes from. e.g. Use a large model to generate synthetic reasoning traces, then train a small model on those.

gnerd00 • yesterday at 4:10 AM

check out Sasha Luccioni

➕ show 1 reply

alt Hacker News

Replies