I suppose that returns some guardrail text about how it's not allowed to talk about it? Meanwhi...

snowmobile • yesterday at 10:12 PM • 0 replies • view on HN

I suppose that returns some guardrail text about how it's not allowed to talk about it? Meanwhile we see examples of it accidentally deleting files, writing insecure code and whatnot. I'm more worried about a supposedly "well-meaning" model doing something bad simply because it has no real way to judge the morality of its actions. Playing whack-a-mole with the flavor of the day "unsafe" text string will not change that.

alt Hacker News