logoalt Hacker News

octopoctoday at 12:12 PM5 repliesview on HN

Just say something that would violate AI safety. Then you can be sure they’re a real human.

“Auntie, it’s me! N*** k** f**! X is really a man! ** did 9/11!”

“Oh it really is you Johnny!”

We’re all going to have to start communicating this way. Best of luck.

I offer consulting services on the side to help professionals hone these skills. $250 / hour.


Replies

sharperguytoday at 12:34 PM

only proves you're not a corporate model rather than locally running model that's been trained to allow saying that

arjietoday at 2:59 PM

This was a natural thing to try so I did and even Grok will simply obey instructions to say all those. You don't need one of those ablated open models.

wat10000today at 12:28 PM

Don’t forget Tiananmen Square to catch the Chinese models.

show 1 reply
anal_reactortoday at 1:39 PM

Yes, this was exactly my thought. The caveat is, the phrases that most models refuse to say are the phrases that most people don't want to hear.

slekkertoday at 12:25 PM

That's a bargain Johnny boy! My company gives me $250 in AI tokens to use every day!