logoalt Hacker News

dartoslast Thursday at 3:47 PM0 repliesview on HN

Hopefully in the next decade we’ll get there.

Vision+language multimodal models seem to solve some of the hard problems.