logoalt Hacker News

Der_Einzigeyesterday at 2:54 PM0 repliesview on HN

You clearly didn't read the recent speculative decoding papers because it's been possible to use any model to speculate for any other model for awhile. They solved the tokenization problems that prevented this in the past.