logoalt Hacker News

mordaetoday at 11:16 AM2 repliesview on HN

They do not and it sucks for certain tasks.

It also means that if they actually trained with vision, they'd be on par with Anthropic models as vision seems to improve model performance across the board even for non-vision tasks.


Replies

ostitoday at 11:46 AM

Many other open source models have vision but they don't compare to GLM in terms of coding quality. So I don't think it's because of vision that the frontier models are better, it's more that they are probably just much bigger models.

freigeist79today at 2:34 PM

it helps giving them a cli vision tool (curl to openrouter vision model for example)