logoalt Hacker News

BloondAndDoomyesterday at 7:59 PM1 replyview on HN

This is a bit more akin to distill - https://github.com/samuelfaj/distill

Advantage of SML in between some outputs cannot be compressed without losing context, so a small model does that job. It works but most of these solutions still have some tradeoff in real world applications.


Replies

thebeasyesterday at 8:48 PM

[dead]