logoalt Hacker News

orbital-decaylast Saturday at 10:22 PM1 replyview on HN

It really isn't, you can improve by distilling a weaker model


Replies

anon373839yesterday at 10:47 AM

Self-distillation is also a technique.