> In general, quantizing down to 6 bits gives no measurable loss in performance.
...this can't be literally true or no one (including e.g. OpenAI) would use > 6 bits, right?