I'm shocked to see how poorly these models, which I find useful day to day, do in solving virtu...

bwestergard • yesterday at 9:28 PM • 1 reply • view on HN

I'm shocked to see how poorly these models, which I find useful day to day, do in solving virtually any of the problems in Unlambda.

Before looking at the results my guess was that scores would be higher for Unlambda than any of the others, because humans that learn Scheme don't find it all that hard to learn about the lambda calculus and combinatory logic.

But the model that did the best, Qwen-235B, got virtually every problem wrong.

Replies

__alexs • yesterday at 9:31 PM

They are also weirdly bad at Brainfuck which is basically just a subset of C.

➕ show 1 reply

alt Hacker News

Replies