Wow. I'm generally in the AI maximalist camp. But adding Werewolf feels dangerous to me. Anyone who's played knows lying, deceipt, and manipulation is often key to winning. We really want models climbing this benchmark?
Good question, but who's going to stop them?
AI already has a very creative imagination for role play so this just adds extra to their arsenal.
confidently and charismatically lying to clueless users has been one of fundaments of AI adoption
Oddly in the highlighted game I watched the werewolf simply gives up in the last round and says I'm the werewolf well-done... Vote me.
Bizarre.