logoalt Hacker News

estearumtoday at 11:12 AM1 replyview on HN

> a departure from Mamba-2, which optimized for training speed.

?


Replies

cubefoxtoday at 1:43 PM

Yes? Mamba-2 optimized for training speed compared to Mamba-1. Mamba-3 adds optimization for inference. These are pretty much version numbers.