Yann LeCun was saying 3 years ago that because token generation is auto-regressive, its mathematical...

nok22kon • today at 8:13 AM • 6 replies • view on HN

Yann LeCun was saying 3 years ago that because token generation is auto-regressive, its mathematically impossible to generate a long stream of coherent tokens, because errors amplify exponentially.

and then models learned that they can back track and error correct

so much for "mathematically impossible..."

Replies

threethirtytwo • today at 9:25 AM

Stop attacking Yann. I would say like 90% of the HN crowd was parroting Yann too.

shevy-java • today at 9:14 AM

You insinuate here AI "learned".

I doubt that this was AI self-improvement.

➕ show 2 replies

charcircuit • today at 8:26 AM

I think it was largely the introduction of tool calling that allowed models to mitigate the issue of errors amplifying exponentially since it allows the model to understand if what it generated is correct or has issues that it needs to address. This addresses the potential lack of or low quality of world model by being able to reference the current state of the world.

➕ show 1 reply

TMWNN • today at 8:53 AM

> and then models learned that they can back track and error correct

You mean "Human developers learned ...", yes? Or was there really an all AI-driven, self-improving aspect to this?

➕ show 1 reply

waldarbeiter • today at 8:35 AM

[dead]

jiggawatts • today at 8:27 AM

Also, almost any argument against LLM intelligence also applies to humans.

I very commonly see someone make some small mistake and end up going in the wrong direction, “accumulating stupid” as they go, sometimes for years.

➕ show 2 replies

alt Hacker News

Replies