logoalt Hacker News

taneqtoday at 10:37 AM0 repliesview on HN

It might not even be the leadership at this stage. It’s entirely possible that “rounds of conversation” is a metric that their reinforcement learning has been told to optimise.