Nethack has been widely used to test reinforcement learning agents, starting from at least 2020; there was a Nethack challenge at NeurIPS 2021. https://nethackchallenge.com/report.html
For a more recent test, see https://kenforthewin.github.io/blog/posts/nethack-agent/ .