Open source has already caught up with SOTA:
https://www.reddit.com/r/StableDiffusion/comments/1hav4z3/op...
These are even unfair comparisons because they're leveraging text-to-video instead of the more powerful image-to-video. In the latter case, the results are indistinguishable.
Video generation is about to be everywhere, and we're about to have the "Stable Diffusion" moment for video.
Look at the comments: people are already fawning over open source being uncensored.
Cat's out of the bag.