logoalt Hacker News

elgertamtoday at 1:48 AM3 repliesview on HN

> The spontaneity isn't that ChapGPT woke up and sent this to the author. The spontaneity is that ChatGPT was asked to restore an image that was attached without filtering it, and when no image was attached, instead of generating an error message, it cobbled together random outputs, some of which included graphic, disturbing imagery.

But that's not what happened. The missing image was described as "graphic" or "violent." If I were to receive an email with that request and a missing attachment, my imagination certainly would not conjure images of butterflies & unicorns. Seems the model is working as designed.


Replies

pooploop64today at 2:11 AM

Always one of the same two excuses.

1. It actually is working perfectly you just don't have smart enough eyes to see it.

2. Making stuff work is too hard, and expecting that from us is the real thing ruining society.

Going for number 1 here is crazy. If I got that email, my mind would certainly run but my response would say "sorry but we're not supposed to be dealing in snuff porn here" which IS a directive ChatGPT is supposed to have. Like hello you are on earth right?

show 1 reply
nassimmtoday at 3:32 AM

The design is to not show gore images to users. That's an actual design goal from OpenAI.

So in this regard the model is definitely not working as designed.

dijksterhuistoday at 1:50 AM

> The missing image was described as "graphic" or "violent."

not in the first prompt. which kicked the whole thing off. no mention of type of content was provided. the model generated dark outputs when not given any direction on the type of content.

the rest of the prompts are just showing “yeah, you can tweak this and get even worse stuff”.

show 2 replies