the models can accept images directly as tokens. not a description of an image, the actual image its...

spongebobstoes • today at 12:57 AM • 0 replies • view on HN

the models can accept images directly as tokens. not a description of an image, the actual image itself.

yes, the visual intelligence is limited, but they do actually have vision capabilities.

alt Hacker News