Harness/fill in the gaps coding: you define a bunch of tests/reference output/validation procedures, let the AI spin until all lights are green