Here's one that was flagged for me: a question about a niche Reinforcement Learning paper from 2012
I've been reading the option-option model paper by David Silver. It appears that they achieved quite an effective result. Why hasn't there been more work on it since?