Hopefully in the next decade we’ll get there. Vision+language multimodal models seem to solve some... | alt Hacker News

alt Hacker News

dartos • 02/20/2025 • 0 replies • view on HN

Hopefully in the next decade we’ll get there.

Vision+language multimodal models seem to solve some of the hard problems.