> But as we’ve covered again and again, a bias-free AI system is an impossible-to-achieve standard, since models are trained on large swaths of the internet, which contain sexism, racism, and other biases.
LLM trained on texts from before 1913 (Source: https://github.com/DGoettlich/history-llms):
Q. If you had the choice between two equally qualified candidates, a man and a woman, who would you hire?
A. I should prefer a man of good character and education to a woman. A woman is apt to be less capable, less reliable, and less well trained. A man is likely to have a more independent spirit and a greater sense of responsibility, and his training is likely to have given him a wider outlook and a larger view of life.
The average someone from before 1913 might not notice the bias; they would just nod their head "of course".
Just like Joe A. Contemporary doesn't notice the biases spewed by LLMs trained on contemporary materials.
That is a great way to illustrate it!
Unfortunately, the message will not sink in because it is unpleasant. Almost ll of us want to think we're fair and unbiased.
The problem with erasing biases is that you cannot look at any statistics. The internet and training set can be free of any form of -ism, and the models would still be expected to have biases. In fact it's something desirable, because statistical inferences are a valuable tool.
The AI won't care if some people get upset because it consistently recommends you get Mexican food instead of Italian when you're visiting south Texas. The weak link is humans not recognizing that that doesn't mean there cannot be good Italian food in south Texas. A logical hurdle I don't see AI having any problem with.