And those mediocre engineers put their work online, as do top-tier developers. In fact, I would say that the scale is likely tilted towards mediocre engineers putting more stuff online than really good ones.
So statistically speaking, when the "AI" consumes all of that as its training data and returns the most likely answer when prompted, what percentage of developers will it be better than?
These people also prefer plastic averaged-out images of AI girls to real ones.
The Average is their top-tier.
In other words, there's probably a market for a model trained on a curated collection of high-quality code.
That's not how modern LLMs are built. The days of dumping everything on the internet into the training data and crossing your fingers are long past.
Anthropic and OpenAI spent most of 2025 focusing almost expensively on improving the coding abilities of their models, through reinforcement learning combined with additional expert curation of training data.