logoalt Hacker News

Jackson__today at 9:01 PM1 replyview on HN

So they spent all of their R&D to copy deepseek, leaving none for the singular novel added feature: vision.

To quote the hf page:

>Behind vision-first models in multimodal tasks: Mistral Large 3 can lag behind models optimized for vision tasks and use cases.


Replies

Ey7NFZ3P0nzAetoday at 9:12 PM

Well, behind "models" not "langual models".

Of course models purely made for image stuff will completely wipe it out. The vision language models are useful for their generalist capabilities