I don't think any models are natively INT4? I wouldn't see the point to nerf the model out...

simjnd • yesterday at 6:27 PM • 1 reply • view on HN

I don't think any models are natively INT4? I wouldn't see the point to nerf the model out-of-the-box.

zozbot234 • yesterday at 6:39 PM

It's not nerfed, it's natively trained at that quantization a.k.a. Quantization Aware Training.

alt Hacker News