logoalt Hacker News

tom133712/09/20242 repliesview on HN

Same goes with DALLE. It was cool to try it the first week or so but now the output is so much worse than Midjourney and stable diffusion. For me it can’t even generate straight lines and everything looks comic-ish.


Replies

vunderba12/09/2024

DALL-E 3 image quality has always been subpar, but its prompt adherence is on par with FLUX. Midjourney has some of the worst prompt adherence, but some of the best image quality.

show 1 reply
amzn-throw12/09/2024

To me this is just a simple artifact of size & attention.

Another example of this is stuff like Bluesky. There's a lot of reasons to hate Twitter/X, but people going "Wow, Bluesky is so amazing, there's no ads and it's so much less toxic!" aren't complimenting Bluesky, they're just noting that it's smaller, has less attention, and so they don't have ads or the toxic masses YET.

GenAI image generation is an obvious vector for all sorts of problems, from copyrighted material, to real life people, to porn, and so on. OpenAI and Google have to be extraordinarily strict about this due to all the attention on them, and so end up locking down artistic expression dramatically.

Midjourney and Stable Diffision may have equal stature amongst tech people, but in the public sphere they're unknowns. So they can get away with more risk.

show 1 reply