logoalt Hacker News

password54321last Tuesday at 11:37 PM1 replyview on HN

>Also if you want to have more semantics, you add image, video and audio to your model. It gets smarter because of it.

I think you are confusing generation with analysis. As far I am aware your model does not need to be good at generating images to be able to decode an image.


Replies

adastra22yesterday at 12:01 AM

It is, to first approximation, the same thing. The generative part of genAI is just running the analysis model in reverse.

Now there are all sorts of tricks to get the output of this to be good, and maybe they shouldn't be spending time and resources on this. But the core capability is shared.

show 1 reply