Heavy slop (5+ patterns) · 105 sites · 21%
Mild (2–4) · 230 sites · 46%
Clean (0–1) · 165 sites · 33%
Can we have a list of the "clean" ones please? Actually, if you give me a list of the IDs for all 3 categories, I'll make URLs for each that people can browse.If the community feels that the division is useful, then we can maybe take you up on your offer to open-source the project, and perhaps find a way to use it on HN itself.
Love the idea. Let me get to this over the weekend and open-source it, then ping you via email.