logoalt Hacker News

haiku2077yesterday at 12:32 AM1 replyview on HN

The big companies tend to respect robots.txt. The problem is other, unscrupulous actors use fake user agents and residential IPs and don't respect robots.txt or act reasonably.


Replies

internetteryesterday at 1:32 AM

Big companies have thrown robots.txt to the wind when it comes to their precious AI models.

show 1 reply