The thing is, if LLMs are stochastic parrots predicting the next word (aka, a partially decent auto complete), there's no reason it can't complete <specific question it can't answer> as "I don't know" - as that's a perfectly valid sentence too.
That's why I'm still cautiously optimistic about LLMs somewhere being good enough. I don't know if or when someone will manage to do it, but I'm hopeful.
Damn, did I say something wrong or unpopular to get a downvote?
AI models moved beyond next word predictors recently. Considering them to just be partially decent auto complete is completely missing many recent innovations.