And that's all we do, and it's all we need, and it's probably all there is.
The discovery that reinforcement learning allows next-token prediction to extrapolate beyond its pretrained data set is harder to explain than the discovery of fire or the wheel or electricity, but it's up there on that level.