logoalt Hacker News

QubridAItoday at 3:31 PM0 repliesview on HN

Honestly, this is pretty much how most of the new models operate nowadays: a base model combined with RL and some product-layer magic.