logoalt Hacker News

ilakshtoday at 1:09 AM1 replyview on HN

I wonder if some day there will be a video codec that is essentially a standard distribution of a very precise and extremely fast text-to-video model (like SmartTurboDiffusion-2027 or something). Because surely there are limits to text, but even the example you gave does not seem to me to be beyond the reach of a text description, given a certain level of precision and capability in the model. And we now have faster than realtime text to video.


Replies

egypturnashtoday at 1:17 AM

This sounds incredibly precarious and prone to breaking when you update to a new model.

show 1 reply