Better performance than TQ and better quality than FP16?
Am I reading this right??
Why this is not a PR for vLLM ?
[dead]
Better performance than TQ and better quality than FP16?
Am I reading this right??