logoalt Hacker News

Jackson__today at 8:10 AM1 replyview on HN

As your local vision nut, their claims about "SOTA" vision are absolutely BS in my tests.

Sure it's SOTA at standard vision benchmarks. But on tasks that require proper image understanding, see for example BabyVision[0] it appears very much lacking compared to Gemini 3 Pro.

[0] https://arxiv.org/html/2601.06521v1


Replies

nostreboredtoday at 4:23 PM

Gemini remains the only usable vision fm :(