logoalt Hacker News

ben_w01/20/20251 replyview on HN

> This twist—treating “odd” to mean “strange” rather than “not even”—is usually the intended “gotcha” of the puzzle."

I like this one.

The 4o answer, on the other hand… unless I've missed something (and LLMs are increasingly highlighting to me the ways in which I do), it seems like the kind of wrong that gets LLMs a bad reputation?


Replies

lynguist01/22/2025

It is! 4o is unfortunantely often very dumb in tricky circumstances, or is biased toward pundit-like opinions that are wrong. I'm not sure why that is the case, but the full o1 always has a "weight"/"presence" to it when I chat with it that suggests to me like a real intelligence. It can also solve difficult puzzles that 4o and me struggle with.