I like that test where some of the questions are wrong and wonder whether we should have that kind of thing in maths textbooks.
I think people need to be trained to be more confident in what they know, and if we gave them that kind of thing we could maybe train them to become so.
Actually - do they do this in LLM benchmarks? As a measure of overconfidence/confabulation? Seems immediately applicable.