logoalt Hacker News

fcanesintoday at 5:11 PM0 repliesview on HN

Yes, DFlash is currently a SOTA speculative decoding method that Xiaomi just used in their MiMo model for >1000tkps