That’s because it’s really hard to evaluate results but usage is comparable across people. It’s the equivalent of counting lines of code submitted or jira tickets closed.
https://en.wikipedia.org/wiki/McNamara_fallacy
https://en.wikipedia.org/wiki/McNamara_fallacy