My friends and I are working on Norma. It helps you curate a dataset that captures as much signal as possible for model training.
See norma.grouplabs.ca