With my optimistic hat on maybe it realized "wavelet tree or similar structure that can efficiently represent and query for subset membership" doesn't actually describe wavelet tree and this was its way of backtracking. I.e. it might have learned to respond like this to a prior inconsistent series of tokens.
But ya, I'm aware of the issue with them saying they don't know things they do know.