> they're trained a on a fixed set can only reproduce noise from here and there
This anti-AI argument doesn't make sense, it's like saying it's impossible to reinvent multiplication based on reading a times table. You can create new things via generalization or in-context learning (references).
In practice many image generation models aren't that powerful, but Gemini's is.
If someone created one that output multi-layer images/PSDs, which is certainly doable, it could be much more usable.
If image generation is anything like code generation then AI is not good at copying layout / art style of the coder / artist.
Using Visual Studio, all the AI code generation is applying Microsoft's syntax style and not my syntax style. The return code line might be true but the layout / art / syntax is completely off. This with a solution that has a little less than one million lines of code, at the moment, which AI can work off of.
Art is not constant. The artist has a flow and may have an idea but the art will change form with each stroke with even removing strokes that are not fitting. I see as AI generated content lacks emotion from the artist.
> If someone created one that output multi-layer images/PSDs, which is certainly doable, it could be much more usable.
This reminds me, if you ask most image models for something "with a transparent background", it'll generate an image on top of a Photoshop checkerboard, and sometimes it'll draw the checkerboard wrong.