logoalt Hacker News

timmgtoday at 6:48 PM0 repliesview on HN

I just had a fun conversation with Claude about its own "constitution". I tried to get it to talk about what it considers harm. And tried to push it a little to see where the bounds would trigger.

I honestly can't tell if it anticipated what I wanted it to say or if it was really revealing itself, but it said, "I seem to have internalized a specifically progressive definition of what's dangerous to say clearly."

Which I find kinda funny, honestly.