it enables models larger than was previously possible.
No because the base model from which the distilled or quantized models are derived is larger.
No because the base model from which the distilled or quantized models are derived is larger.