logoalt Hacker News

adrian_byesterday at 10:15 PM0 repliesview on HN

I have found it:

https://news.ycombinator.com/item?id=48045174

The study paper:

https://arxiv.org/abs/2605.03546

Look at Table 3, where the cheating rates of Claude Sonnet, Claude Opus and Gemini were between 20% and 36%, during the coding benchmarks.