alt
Hacker News
cubefox
•
today at 6:07 PM
•
0 replies
•
view on HN
No because the base model from which the distilled or quantized models are derived is larger.