Or patch it over to python, I assume LLMs are even better at python.
Don't assume. Empirically, they are not. (This post Feb 2026 may change in future yadda yadda)
See: autocodebench
https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/tree/ma...
Don't assume. Empirically, they are not. (This post Feb 2026 may change in future yadda yadda)
See: autocodebench
https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/tree/ma...