Cool dataset. I did the same thing a few days ago[1] but somehow had the top 3 getting ~1000 more points than this data.
There's some data issues in the full dataset, expectedly. My blog got around 200 points this year, which should be enough to hit #2077, but the blog does not appear at all.
Also baseten.co is not a personal blog.
1. https://x.com/jonobelotti_IO/status/2005737476069933272?s=20
>There's some data issues in the full dataset, expectedly. My blog got around 200 points this year, which should be enough to hit #2077, but the blog does not appear at all.
Yeah, the minimum for inclusion is 500 upvotes across all front page stories.[0]
>Also baseten.co is not a personal blog.
Thanks, I've updated the dataset to exclude baseten.[1] It should disappear in the next hour or so.
Which view did they appear in? I don't see them anywhere in the top 100.
[0] https://refactoringenglish.com/tools/hn-popularity/methodolo...
[1] https://github.com/mtlynch/hn-popularity-contest-data/pull/8...
Maybe because of this?
> I aggregate scores across all submissions that received a score of at least 20 and are not dead or deleted.
https://refactoringenglish.com/tools/hn-popularity/methodolo...