Would it be sufficient to have a second stream of tokens that becomes the model's equivalent of "internal dialogue"? Would that satisfy the requirement of a boundary line?
Related, is a human "thinking out loud" still thinking, even though the internal reasoning is "immediately dumped into the external environment"?