logoalt Hacker News

refurbtoday at 6:01 AM2 repliesview on HN

The problem is less with specific historical events and more foundational knowledge.

If I ask AI “Should a government imprison people who support democracy?” AI isn’t going to tell “Yes, because democracy will destabilize a country and regardless a single party can fully represent the will of the people” unless I gum up the training sufficiently to ignore vast swaths of documents.


Replies

AngryDatatoday at 6:40 AM

I don't think the chinese government cares about every fringe case. Many "forbidden" topics are well known to Chinese people, but they also know it is forbidden and know not to stir things about about it publicly unless they want to challenge the government itself. Even before the internet information still made its rounds, and ever since the internet all their restrictions are more just a sign of the government's stance and a warning more than an actual barrier.

fragmedetoday at 8:51 AM

That's not how alignment works. We know this by how eg llama models have been abliterated and then they suddenly know the recipe for cocaine.