logoalt Hacker News

dilapyesterday at 5:55 PM0 repliesview on HN

Deepseek R1 was a publically-available, MoE model that was getting a ton of attention before llama4. Llama4 didn't get much attention because it wasn't good.