logoalt Hacker News

pavpanchekhayesterday at 6:54 PM1 replyview on HN

OpenAI used to make Codex-specific models, but they stopped. What I've gathered from interviews and similar is that training two models isn't worth the (small) lift from having a coding-specific model. You're pre-training on everything anyway, and coding RL is reasonably useful for general-purpose models too.


Replies

greyskullyesterday at 7:13 PM

Interesting. I'd have guessed there would be meaningful opex benefits to serving smaller models.

show 2 replies