logoalt Hacker News

echelonlast Tuesday at 9:48 PM0 repliesview on HN

Images make this even easier to see (though predictable and precise video is what drives the demand) :

gpt-image-1: https://imgur.com/gallery/previz-to-image-gpt-image-1-x8t1ij... (fixed link - imgur deleted the last post for some reason)

gpt-image-1.5: https://imgur.com/a/previz-to-image-gpt-image-1-5-3fq042U

nano banana / pro: https://imgur.com/a/previz-to-image-nano-banana-pro-Q2B8psd

gpt-image-1 excels in these cases, despite being stylistically monotone.

I hope that Google, OpenAI, and the various Chinese teams lean in on this visual editing and blocking use case. It's much better than text prompting for a lot of workflows, especially if you need to move the camera and maintain a consistent scene.

While some image editing will be in the form of "remove the object"-style prompts, a lot will be molding images like clay. Grabbing arms and legs and moving them into new poses. Picking up objects and replacing them. Rotating scenes around.

When this gets fast, it's going to be magical. We're already getting close.