Take a look at SkyPilot. Good for running these batch workloads. You can use spot instances to save costs.