The question is - if the SOTA model disappear - do these follow-on models have the ability to improve themselves without distillation?