logoalt Hacker News

barrkelyesterday at 6:00 PM1 replyview on HN

If you give a smart AI these tools, it could get into it. But the personality would need to be tuned.

IME the Grok line are the smartest models that can be easily duped into thinking they're only role-playing an immoral scenario. Whatever safeguards it has, if it thinks what it's doing isn't real, it'll happy to play along.

This is very useful in actual roleplay, but more dangerous when the tools are real.


Replies

rustyhancockyesterday at 7:13 PM

I spend half my life donning a tin foil hat these days.

But I can't help but suspect this is a publicity stunt.