Efficiency gains can be used to make existing models more profitable, or to make new larger and more intelligent models.
Some yes, others no. Distillation and quantization can't be used to make new base models since they require a preexisting one.
Some yes, others no. Distillation and quantization can't be used to make new base models since they require a preexisting one.