Has anyone tried to "turn off some cores" (eg using multi-instance gpu feature) and see if/how that increases reliability?