They are all autoregressive. They have just been trained to emit thinking tokens like any other tokens.