logoalt Hacker News

Nemotron 3 Ultra: Open Moe Hybrid Mamba-Transformer for Agentic Reasoning [pdf]

14 pointsby victormustartoday at 1:06 PM1 commentview on HN

Comments

throwa356262today at 2:34 PM

Is this the one from Jensens Computex presentation the other day?

It is significantly bigger than Qwen for the same level of intelligence, but I think the key strength was inference speed.