As John says in that thread, we've fixed this issue in SWE-bench: https://xcancel.com/jyangballin/status/2006987724637757670
If you run SWE-bench evals, just make sure to use the most up-to-date code from our repo and the updated docker images