logoalt Hacker News

knowitnone05/15/20252 repliesview on HN

you mean AI crawlers from Microsoft, owners of Github?


Replies

haiku207705/15/2025

The big companies tend to respect robots.txt. The problem is other, unscrupulous actors use fake user agents and residential IPs and don't respect robots.txt or act reasonably.

show 1 reply
PaulDavisThe1st05/15/2025

I have no idea where they are from. I'd surprised if MS is using a network of 1M+ residential IP addresses, but they've surprised me before ...