So this is the norm: quantized version of the SOTA model is previous model. Full model becomes latest model. Rinse and repeat.