Deepseek showed that distillation is possible. Their results are possible without someone else doing the leading edge training