Pretraining + RL itself is the scaling limit. If you feed it the entire dataset before 1905, LLMs ar...

readitalready • today at 3:16 AM • 2 replies • view on HN

Pretraining + RL itself is the scaling limit. If you feed it the entire dataset before 1905, LLMs aren't going to come up with general relativity. It has no concept of physics, or time even.

AGI happens when you DON'T need to scale pertaining + RL.

Replies

acuozzo • today at 3:41 AM

> If you feed it the entire dataset before 1905, LLMs aren't going to come up with general relativity.

Link?

rishabhaiover • today at 4:20 AM

AGI maybe not, but it is reaching disruption level intelligence in the SWE domain.

alt Hacker News

Replies