logoalt Hacker News

maxrmkyesterday at 5:06 PM1 replyview on HN

It’s easy to opt out of being indexed by Google.


Replies

cdriniyesterday at 5:12 PM

Exactly. Identifying crawlers like Google, bing aren't the issue. They obey robots.txt, and can easily be blocked by user agent checks. Non-identifying crawlers, which provide humanlike user agents, and which are usually distributed so get around ip-based rate limits, are the main ones that are challenging to deal with.