logoalt Hacker News

bigyabaiyesterday at 9:33 PM0 repliesview on HN

>90% of inference hardware is faster if you run an MOE model.