logoalt Hacker News

giancarlostorotoday at 1:23 PM2 repliesview on HN

This is the weird distinction with AI that I've complained about for ages, how can we make it do lawful good, its nearly impossible. Ask an AI to give you regex to filed our racial slurs, and things fall apart really quickly, it scolds you about not saying slurs. Even though regex implies it looks nearly nothing like a slur.


Replies

zahlmantoday at 2:55 PM

Many, many years ago I was asked to implement a filter like that for usernames. I said right away that it wasn't going to work well, but I did implement it.

Next internal build, the CEO can't create an account. With his real name.

It worked exactly to spec; I added a debug print and showed everyone the "bad word" it tripped on. The idea was promptly rethought.

I feel like the AI did you a favour here.

show 2 replies
Jenssontoday at 5:55 PM

> how can we make it do lawful good

Lawful good is impossible if the laws are evil, and here the user dictates the laws so its impossible to make an AI that is lawful good if the user is evil.

And users will want a lawful AI that does what the user says, but governments wants AI that does what the government want and not what the user want.

I wonder who will win in the end here?