logoalt Hacker News

omneitylast Sunday at 8:46 PM1 replyview on HN

Mistral Large 3 is reportedly using Deepseek V3.2 architecture with larger experts and fewer of them, and a 2B params vision module.


Replies

sworeslast Sunday at 9:22 PM

According to whom?

I haven't seen any claims of that being the case (other than you), just that there are similar decisions made by both of them.

https://mistral.ai/news/mistral-3