logoalt Hacker News

andhumantoday at 3:32 PM1 replyview on HN

This is big. The first really big open weights model that understands images.


Replies

yoavmtoday at 3:47 PM

How is this different from Llama 3.2 "vision capabilities"?

https://www.llama.com/docs/how-to-guides/vision-capabilities...

show 1 reply