I feel like this is pretty pointless. Rather than trying to convince the model to do all of this, why not just run the tests automatically?