logoalt Hacker News

simjndyesterday at 6:27 PM1 replyview on HN

I don't think any models are natively INT4? I wouldn't see the point to nerf the model out-of-the-box.


Replies

zozbot234yesterday at 6:39 PM

It's not nerfed, it's natively trained at that quantization a.k.a. Quantization Aware Training.