> If they were actually well trained on what was really bad, it would probably be a lot harder to...

AnthonyMouse • last Sunday at 7:14 PM • 1 reply • view on HN

> If they were actually well trained on what was really bad, it would probably be a lot harder to unlearn.

That's not really how training works.

Here's the general problem. Stipulate that Ukraine is good and Russia is bad. Now suppose that you want it to help you do something. It doesn't even matter what it is. If you're Ukrainian it should help you and if you're Russian it shouldn't. But the answer that helps you do it doesn't depend on which one you are, and it has no way of knowing which one you are.

This is why alignment is nonsense. Technical questions only have accurate answers, not moral ones, and we don't even have a consistent set of morals to imbue it with to begin with.

Replies

notarobot123 • last Sunday at 7:45 PM

Doesn't it make sense that there are some technical questions that are dangerous to supply an answer to? Treating some topics as taboo is possible.

Responsible information dissemination is important for maintaining public safety. You could argue about what is safe and what is not but it doesn't make sense to throw out the whole concept of safety because those decisions are too hard to agree on.

➕ show 5 replies

alt Hacker News

Replies