Since no one knows or can agree on what "code quality" is and we can't measure it for human output, I'm dubious about measuring it for LLMs
You don't need universal consensus to measure something. There are many good quality measures of code quality.
You don't need universal consensus to measure something. There are many good quality measures of code quality.