logoalt Hacker News

cphoovertoday at 5:02 PM2 repliesview on HN

5-10% accuracy is like the difference between a usable model, and unusable model.


Replies

samwhotoday at 5:22 PM

Definitely could be, but in the time I spent talking to the 4-bit models in comparison to the 16-bit original it seemed surprisingly capable still. I do recommend benchmarking quantized models at the specific tasks you care about.

ameliustoday at 7:36 PM

Yes I was wondering why they mentioned those numbers without mentioning their practical significance.