> task focused small models This is tangential: and forgive my ignorance here, but is there an ...

greyskull • today at 6:37 PM • 1 reply • view on HN

> task focused small models

This is tangential: and forgive my ignorance here, but is there an inherent reason why there aren't smaller, focused models from the frontier model providers?

I'm thinking something like a software-specific subset of Opus that is the default for use in Claude Code. Smaller, cheaper to deploy and consume, maybe faster.

Replies

pavpanchekha • today at 6:54 PM

OpenAI used to make Codex-specific models, but they stopped. What I've gathered from interviews and similar is that training two models isn't worth the (small) lift from having a coding-specific model. You're pre-training on everything anyway, and coding RL is reasonably useful for general-purpose models too.

➕ show 1 reply

alt Hacker News

Replies