Honestly, this is pretty much how most of the new models operate nowadays: a base model combined with RL and some product-layer magic.