> we now think it's +9% - +38%
If you are referring to the following quote [0], you are off by a sign:
> we now estimate a speedup of -18% with a confidence interval between -38% and +9%.
My bad, I messed up by being lazy while switching from decreases in time taken (that they report) to increased in throughput. (Yes, it's not just flipping the sign, but as I said, I was being lazy!) The broad point still holds, their initial findings have been reversed, and they expect selection effects masked a higher speedup.
The language is confusing, but the chart helps: https://metr.org/assets/images/uplift-2026-post/uplift_timel...
That update blog is funny. The only data they can get at reports slowdowns, but they struggle to believe it because developers self-report amazing speedups.
You'd get the same sort of results if you were studying the benefits of substance abuse.
"It is difficult to study the downsides of opiates because none of our participants were willing to go a day without opiates. For this reason, opiates must be really good and we're just missing something."