logoalt Hacker News

grueztoday at 2:48 PM0 repliesview on HN

I thought all the recent models are "multimodal"? Is the image part just sticking an image recognizer in front of the text model?