This doesn't seem to really address synthetic data, let alone RL-based reasoning.

Philpax • last Thursday at 7:25 PM • 1 reply • view on HN

cratermoon • last Thursday at 7:30 PM

Why would it? Once those are introduced, advancement leaves behind pure scaling.

alt Hacker News