logoalt Hacker News

dontlikeyoueithtoday at 5:32 PM1 replyview on HN

Nope.

It's only surprising to people who still think they're going to build God out of LLMs.


Replies

simianwordstoday at 8:24 PM

It was surprising to me and when I reviewed the paper, I found serious flaws that calls the fundamental claims into question - they didn't use any reasoning tokens. Any LLM or human will fail at a task like this if not allowed to think.