Seems like the industry is moving further towards having low-latency/high-speed models for dire...

capevace • yesterday at 6:38 PM • 2 replies • view on HN

Seems like the industry is moving further towards having low-latency/high-speed models for direct interaction, and having slow, long thinking models for longer tasks / deeper thinking.

Quick/Instant LLMs for human use (think UI). Slow, deep thinking LLMs for autonomous agents.

Replies

gaigalas • yesterday at 6:47 PM

You always want faster feedback. If not a human leveraging the fast cycles, another automated system (eg CI).

Slow, deep tasks are mostly for flashy one-shot demos that have little to no practical use in the real world.

➕ show 1 reply

varispeed • yesterday at 6:46 PM

Are they really thinking or are they sprinkling them with Sleep(x)?

alt Hacker News

Replies