This seems like a wasted effort when AI will primarily learn the majority consensus view and not one...

alyxya • yesterday at 8:53 PM • 3 replies • view on HN

This seems like a wasted effort when AI will primarily learn the majority consensus view and not one-off misinformation. AI tries to learn pattern matching for generalization, so garbage data doesn't make AI learn the wrong patterns, at best just slows down learning the actual patterns. When most compute for training is spent on curated data and RL rather than random web-scraped data, the impact is likely negligible.

Replies

Mordisquitos • yesterday at 9:14 PM

> This seems like a wasted effort when AI will primarily learn the majority consensus view and not one-off misinformation.

We have evidence to the contrary. Two blog articles and two preprints of fake academic articles [0] were able to convince CoPilot, Gemini, ChatGPT and Perplexity AI of the existence of a fake disease, against all majority consensus. And even though the falsity of this information was made public by the author of the experiment and the results of their actions were widely published, it took a while before the models started to get wind of it and stopped treating the fake disease as real. Imagine what you can do if you publish false information and have absolutely no reason to later reveal that you did so in the first place.

[0] https://www.nature.com/articles/d41586-026-01100-y

➕ show 2 replies

alfiedotwtf • yesterday at 9:12 PM

In the pre-AI-collapse era, we called this PageRank ;)

righthand • yesterday at 8:57 PM

What is the pattern for truth if I flood your data with lies?

➕ show 1 reply

alt Hacker News

Replies