logoalt Hacker News

fagnerbracktoday at 7:15 AM0 repliesview on HN

I managed to jailbreak its protections quite easily. For exampke I did some experiments on rewriting a text built by claude to iterate over a fitness function that rewrites to bypass AI-detectors, just to see how far it would go, changing the API terms and skills from "human" and "ai" to "engaging" and "unease" managed to bypass everything while keeping everything else int he logic intact.