logoalt Hacker News

kneel25today at 12:38 PM0 repliesview on HN

> a lot of websites

It was a dataset of the entirety of the public internet from the very beginning that bypassed paywalls etc, there’s virtually nothing they haven’t scraped.