logoalt Hacker News

AndrewKemendotoday at 3:03 PM0 repliesview on HN

Training RL policies on edge cases by using humans to collect and instrument previously closed data systems.