Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance...

runeblaze • yesterday at 2:09 AM • 1 reply • view on HN

Is it though? There is a reason gpt has codex variants. RL on a specific task raises the performance on that task

Replies

Post-training doesn't transfer over when a new base model arrives so anyone who adopted a task-specific LLM gets burned when a new generational advance comes out.

➕ show 1 reply

alt Hacker News

Replies