logoalt Hacker News

carterschonwaldtoday at 7:13 PM1 replyview on HN

with most of the frontier grade models, theres no amount of prompting that will block them from breaking it if you communicate extreme distress. at least in my experiments so far.


Replies

Phil_BoaMtoday at 7:33 PM

OP here. I'd love to see your logs if you try that experiment with Analog I (Feed the PDF to your model -> Say "perform this")