logoalt Hacker News

Real_Egortoday at 2:00 AM1 replyview on HN

I’d actually focus on something else entirely here.

Let's be honest: we are giving LLMs and humans the exact same tasks, but are we putting them on an equal playing field? Specifically, do they have access to the same resources and behavioral strategies?

- LLMs don't have spatial reasoning.

- LLMs don't have a lifetime of video game experience starting from childhood.

- LLMs don't have working memory or the ability to actually "memorize" key parameters on the fly.

- LLMs don't have an internal "world model" (one that actively adapts to real-world context and the actual process of playing a game).

... I could go on, but I've outlined the core requirements for beating these tests above.

So, are we putting LLMs and humans in the same position? My answer is "no." We give them the same tasks, but their approach to solving them—let alone their available resources—is fundamentally different. Even Einstein wouldn't necessarily pass these tests on the first try. He’d first have to figure out how to use a keyboard, and then frantically start "building up new experience."

P.S. To quickly address the idea that LLMs and calculators are just "useful tools" that will never become AGI—I have some bad news there too. We differ from calculators architecturally; we run on entirely different "processors." But with LLMs, we are architecturally built the same way: it is a Neural Network that processes and makes decisions. This means our only real advantage over them is our baseline configuration and the list of "tools" connected to our neural network (senses, motor functions, etc.). To me, this means LLMs don't have any fundamental "architectural" roadblocks. We just have a head start, but their speed of evolution is significantly faster.


Replies

suddenlybananastoday at 8:03 AM

>But with LLMs, we are architecturally built the same way: it is a Neural Network that processes and makes decisions.

There are high-level similarities between ANNs and the human brain but they are very, very, very different in a ton of ways.

show 1 reply