logoalt Hacker News

Ensorceledtoday at 2:18 PM3 repliesview on HN

Worse, the constant AI scraping is actually costing content providers additional money for no return. At least Google/Bing/Yahoo scraping would then be used to provide links back to your content.


Replies

devsdatoday at 4:48 PM

How do you distinguish Google/MS scraping for Gemini/Copilot vs Google Search/Bing? In the case of Google, the UA is the same and you are entirely at their mercy to honor the Google-Extended instructions in robots.txt

Google has further complicated it with new search announcement blurring lines between regular search and AI search. And AI likes to not honor any licenses or instructions when it is hungry for training material.

It is once again an example of Google using its dominant position to abuse and promote cross functional products.

show 1 reply
bolangitoday at 3:20 PM

Not only costing money. Constant AI scraping constitutes a denial-of-service attack that has brought down websites.

fiedziatoday at 3:12 PM

> At least Google/Bing/Yahoo scraping would then be used to provide links back

That doesn't work anymore. Google provides AI generated summary, nobody looks at the original site.