This could be the reason | alt Hacker News

alt Hacker News

smusamashah • today at 5:43 AM • 1 reply • view on HN

This could be the reason https://petergpt.github.io/bullshit-benchmark/viewer/index.v... Claude bullshits the least of all models. ChatGPT does it more than half the time.

Replies

sunaookami • today at 9:02 AM

That's a nice benchmark + website and wow ChatGPT scores worse than I thought.