logoalt Hacker News

TZubiritoday at 7:22 AM1 replyview on HN

I am confused, how can functions that output images help with functions that should take images as input?


Replies

taneqtoday at 8:28 AM

They’re multimodal LLMs trained for image generation. Turns out that if you want to generate images you gotta know what things look like.

show 1 reply