SOTA open source model for image and vid generation. Beats all others but is too big to run on most people’s computers at 64b params.
Still impressive nonetheless given its artificially generated training sets.
Beats nano banana 1 but not yet competitive with 2 or seedance2, grok imagine,etc.
Great summary. I find image and video generation models are a more understandable reality check for how close local models are to frontier models.
It's sadly ironic I no longer even bother clicking on HN posts that are obvious product announcements from large corporations and instead just go to the replies. Corporate product announcements somehow fail to even clearly communicate the basic facts you did in your first nine words.