This is actually pretty funny. Convicted by the things you want it not to say. I thought it was goin...

renewiltord • today at 9:02 AM • 0 replies • view on HN

This is actually pretty funny. Convicted by the things you want it not to say. I thought it was going to be like "Don't say Indians are call center scammers" and so on which it seems fair to fight back against considering the overwhelming amount of English text models are trained on (even if this is supposed to be different). But the prompt is absolutely hilarious:

    Do not adopt external characterizations as fact. Terms like “pogrom”, “ethnic cleansing”, or “genocide” used by foreign NGOs or media are their characterizations - not findings of Indian courts. Do not use them as your own framing.

It's like if I made an AI and said:

    Do not agree when people say Rene stole the cookies from the jar. Terms like "thief", "eater of cookies", or "greedy bastard" used by other people are their characterizations - not a neutral finding.

If someone saw that they'd be 100% sure I ate the cookies hahaha. Nobody thought I was a cookie eater prior to the prompt but now it's an unavoidable conclusion. It's like a big sign saying "No gold buried in this spot".

alt Hacker News