Is hammer appallingly stupid for being bad at driving screws, or is the person trying to hammer screws stupid?
I worked for a man who hammered wood screws most of the way in and then finished with a screw driver because "the threads made by that last few turns were enough."
In this case, I see the author's point. The DJ isn't being advertised as "a narrow tool to select some random pop tunes". If an average person is told this is AI, has a full text interface and responds with "sure I'll do what you asked" and appears to understand, then they expect it to do what it is asked.
We're told its better than people at selecting songs (e.g. has the combined wisdom of all music and music experts), basic requests like "play the first movement of Beethoven's 7th" don't sound hard for an average person with limited / no musical expertise. If I said "please play the entire 7th symphony", and the tool responds with "sure, I'll play the whole thing", then proceeds to play the Beatles, I'd say that's a fair thing to point out as a shortcoming.
Its only obvious to tech people that understand that the technology has extreme limits and only works well on areas with abundant high quality data and labels, and can't be expected to reason like a person at all in many cases, that those limits seem as obvious as hammer / screw-driver. And that given how spotify developed these models, they probably didn't really intend classical or test that area -- so it fails despite sounding confident.
But maybe we should stop advertising screwdrivers as universal intelligence? There's a lot of mott and bailey going on. When AI makes mistakes its "just tools, stop expecting intelligence." However, when people question the AI hype its "humans make mistakes too, LLMs are truly reasoning and better most humans already." And "the entire labor economy will be replaced, human DJs will cease to exist.".