A lot of the current code and science capabilities do not come from NTP training.
Indeed in seems in most language model RL there is not even process supervision, so a long way from NTP