logoalt Hacker News

FeepingCreaturetoday at 12:52 AM0 repliesview on HN

Your kid, it should be noted, has a massively bigger brain than the LLM. I think the surprising thing here maybe isn't that the vision models don't work well in corner cases but that they work at all.

Also my bet would be that video capable models are better at this.