Thank you for your note! As I mention in the post this is not scientific at all.
I'm very curious how you would do multiple runs of multiple models in a "work alongside the model" manner?
Maybe have a second model that is configured to nudge the first model in the direction of exploration, and have the two of them work in tandem?
Maybe have a second model that is configured to nudge the first model in the direction of exploration, and have the two of them work in tandem?