This could be the reason https://petergpt.github.io/bullshit-benchmark/viewer/index.v... Claude bullshits the least of all models. ChatGPT does it more than half the time.
That's a nice benchmark + website and wow ChatGPT scores worse than I thought.
That's a nice benchmark + website and wow ChatGPT scores worse than I thought.