The subject you are referring to is most likely Meta-Reinforcement Learning [1]. It is great that John Carmack is looking into this, but it is not a new field of research.
[1] https://instadeep.com/2021/10/a-simple-introduction-to-meta-...