I can't tell you how many times a week claude opus 4.8 high effort has to apologise for being wrong when I'm asking it about something narrow and specific that i want it to research but it blurts out broad context from its training material and incorrect conclusions/assumptions. This is happening all the time. Someone needs to create a repository of its apologies to remind us all of its limitations.
I’ve noticed the same thing at work with Opus 4.8.
ChatGPT on my personal plan does it too. Just yesterday I asked it to give some places fitting a specific criteria. The first was that they were within a 2 hour drive of my city. 75% of the locations it gave me were more than 2x that distance. It kept doing this across multiple difference searches. I tried high and pro with no difference.
My company instituted a monthly "best use of AI" award to encourage people to share how they're using it. I suggested we should also have a "most wrong AI output" award to remind everyone they can't just trust it blindly but that hasn't happened yet for some reason ...