logoalt Hacker News

dartos02/20/20250 repliesview on HN

Hopefully in the next decade we’ll get there.

Vision+language multimodal models seem to solve some of the hard problems.