logoalt Hacker News

cubefoxtoday at 6:07 PM0 repliesview on HN

No because the base model from which the distilled or quantized models are derived is larger.