Just say something that would violate AI safety. Then you can be sure they’re a real human. “Aunti...

octopoc • today at 12:12 PM • 5 replies • view on HN

Just say something that would violate AI safety. Then you can be sure they’re a real human.

“Auntie, it’s me! N*** k** f**! X is really a man! ** did 9/11!”

“Oh it really is you Johnny!”

We’re all going to have to start communicating this way. Best of luck.

I offer consulting services on the side to help professionals hone these skills. $250 / hour.

Replies

sharperguy • today at 12:34 PM

only proves you're not a corporate model rather than locally running model that's been trained to allow saying that

arjie • today at 2:59 PM

This was a natural thing to try so I did and even Grok will simply obey instructions to say all those. You don't need one of those ablated open models.

wat10000 • today at 12:28 PM

Don’t forget Tiananmen Square to catch the Chinese models.

➕ show 1 reply

anal_reactor • today at 1:39 PM

Yes, this was exactly my thought. The caveat is, the phrases that most models refuse to say are the phrases that most people don't want to hear.

slekker • today at 12:25 PM

That's a bargain Johnny boy! My company gives me $250 in AI tokens to use every day!

alt Hacker News

Replies