logoalt Hacker News

knowitnonetoday at 12:15 AM2 repliesview on HN

you mean AI crawlers from Microsoft, owners of Github?


Replies

PaulDavisThe1sttoday at 2:25 PM

I have no idea where they are from. I'd surprised if MS is using a network of 1M+ residential IP addresses, but they've surprised me before ...

haiku2077today at 12:32 AM

The big companies tend to respect robots.txt. The problem is other, unscrupulous actors use fake user agents and residential IPs and don't respect robots.txt or act reasonably.

show 1 reply