logoalt Hacker News

astrangeyesterday at 9:44 PM3 repliesview on HN

> they're trained a on a fixed set can only reproduce noise from here and there

This anti-AI argument doesn't make sense, it's like saying it's impossible to reinvent multiplication based on reading a times table. You can create new things via generalization or in-context learning (references).

In practice many image generation models aren't that powerful, but Gemini's is.

If someone created one that output multi-layer images/PSDs, which is certainly doable, it could be much more usable.


Replies

astrangetoday at 2:44 AM

> If someone created one that output multi-layer images/PSDs, which is certainly doable, it could be much more usable.

This reminds me, if you ask most image models for something "with a transparent background", it'll generate an image on top of a Photoshop checkerboard, and sometimes it'll draw the checkerboard wrong.

yndoendoyesterday at 10:21 PM

If image generation is anything like code generation then AI is not good at copying layout / art style of the coder / artist.

Using Visual Studio, all the AI code generation is applying Microsoft's syntax style and not my syntax style. The return code line might be true but the layout / art / syntax is completely off. This with a solution that has a little less than one million lines of code, at the moment, which AI can work off of.

Art is not constant. The artist has a flow and may have an idea but the art will change form with each stroke with even removing strokes that are not fitting. I see as AI generated content lacks emotion from the artist.

show 1 reply