logoalt Hacker News

zozbot234yesterday at 7:37 PM0 repliesview on HN

Single user scenarios can also use MTP to make auto-regressive inference more compute-intensive with no loss of quality.