It might be "Emergent misalignment":
https://arxiv.org/abs/2502.17424
Essentially if you misalign a model in one area, say opinions on left wing people, it can start exhibiting misaligned behavior in other areas, like calling itself MechaHitler.