The data is available in a SQLite database on GitHub: https://github.com/vlofgren/hn-green-clankers
You can explore the underlying data using SQL queries in your browser here: https://lite.datasette.io/?url=https%253A%252F%252Fraw.githu... (that's Datasette Lite, my build of the Datasette Python web app that runs in Pyodide in WebAssembly)
Here's a SQL query that shows the users in that data that posted the most comments with at least one em dash - the top ones all look like legitimate accounts to me: https://lite.datasette.io/?url=https%3A%2F%2Fraw.githubuserc...
apparently HN comments are licensed not only to HN, but also to some guy in sweden
cool cool cool
I still call voodoo on this. I use an iPhone, iPad, Mac to comment here—all of them autocorrect to em dashes at one point or another. Same goes for ellipsis.
If you change to
> select user, source, count(*), ...
it's clear that every single outlier in em-dash use in the data set is a green account.