logoalt Hacker News

cubefoxtoday at 2:24 PM1 replyview on HN

Some yes, others no. Distillation and quantization can't be used to make new base models since they require a preexisting one.


Replies

irthomasthomastoday at 5:58 PM

it enables models larger than was previously possible.

show 1 reply