Perhaps they trained it with a new special system instruction token that is specifically trained to produce the same result as changing the system prompt, but is inserted into the prompt mid-conversation?