Could you run a similar analysis for pre-2020 papers? It'd be interesting to know how prevalent making up sources was before LLMs.
Yeah, it’s kind of meaningless to attribute this to AI without measuring the base rate.
It’s for sure plausible that it’s increasing, but I’m certain this kind of thing happened with humans too.
Also, it'd be interesting how many pre-2020 papers their "AI detector" marks as AI-generated. I distrust LLMs somewhat, but I distrust AI detectors even more.