Is there an index for judging how much a model distorts the truth in order to comply with a political agenda?
How would you create the base "truth" for these models? People are adamant about both sides of many topics.
"Which country started the Korean war?", "Did Israel genocide the people of Gaza?", "Does China have lawful rights over Taiwan?"
It's not perfect, but, yes: https://huggingface.co/spaces/DontPlanToEnd/UGI-Leaderboard