>Also if you want to have more semantics, you add image, video and audio to your model. It gets s...

password54321 • last Tuesday at 11:37 PM • 1 reply • view on HN

>Also if you want to have more semantics, you add image, video and audio to your model. It gets smarter because of it.

I think you are confusing generation with analysis. As far I am aware your model does not need to be good at generating images to be able to decode an image.

Replies

adastra22 • yesterday at 12:01 AM

It is, to first approximation, the same thing. The generative part of genAI is just running the analysis model in reverse.

Now there are all sorts of tricks to get the output of this to be good, and maybe they shouldn't be spending time and resources on this. But the core capability is shared.

➕ show 1 reply

alt Hacker News

Replies