logoalt Hacker News

dnauticstoday at 3:52 PM1 replyview on HN

Don't assume. Empirically, they are not. (This post Feb 2026 may change in future yadda yadda)

See: autocodebench

https://github.com/Tencent-Hunyuan/AutoCodeBenchmark/tree/ma...


Replies

Towaway69today at 4:10 PM

Reading that made me think how much that might be related to Elixir being very similar in syntax to Ruby. Do LLMs really differentiate between the two?

Specific studies, as the one quoted, are a long way from original real world problems.

show 4 replies