logoalt Hacker News

chipgap98today at 3:30 AM0 repliesview on HN

Deepseek showed that distillation is possible. Their results are possible without someone else doing the leading edge training