logoalt Hacker News

irthomasthomastoday at 5:58 PM1 replyview on HN

it enables models larger than was previously possible.


Replies

cubefoxtoday at 6:07 PM

No because the base model from which the distilled or quantized models are derived is larger.