logoalt Hacker News

sofixayesterday at 7:44 PM1 replyview on HN

This is literally the point of robots.txt. It was created to allow site owners to configure how and which parts of their website can be scraped by what bot, and all the "decent" ones (Google, Bing) respect it.


Replies

bfleschyesterday at 7:54 PM

Spoiler: They don't.

show 1 reply